Data Engineer Lead

Photon
TX, United States
Full-time

Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.

Experience building and optimizing big data’ data pipelines, architectures, and data sets.

Experience in cloud services such as AWS EMR, EC2, EKS, Juypter notebooks

Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.

Strong analytic skills related to working with unstructured datasets.

Build processes supporting data transformation, data structures, metadata management, dependency, and workload management.

A successful history of manipulating, processing, and extracting value from large disconnected datasets.

Working knowledge of message queuing, stream processing, and highly scalable big data’ data stores.

Strong project management and organizational skills. Knowledge of agile methods is a plus.

Experience supporting and working with cross-functional teams in a dynamic environment.

Candidate should also have experience using the following software / tools :

Strong experience with AWS cloud services : EC2, EMR, EKS, Snowflake, Elastic-Search

Experience with stream-processing systems : Storm, Spark-Streaming, Kafka etc.

Experience with object-oriented / object function scripting languages : Python, Java, C++, Scala, etc.

Experience with relational SQL and NoSQL databases, including Postgres and Cassandra.

Experience with devops, data pipeline and workflow management tools : Concourse, Terraform, Luigi, Airflow, etc.

Bachelor’s in computer science or related field with at least 10 years for relevant experience is desired.

3 hours ago
Related jobs
Promoted
JPMorganChase
Plano, Texas

As a Senior Lead Software Engineer at JPMorgan Chase within the Corporate Sector - AI/ML & Data Platforms team, you are an integral part of an agile team that works to enhance, build, and deliver trusted market-leading technology products in a secure, stable, and scalable way. Actively contribut...

Promoted
Prodapt
Irving, Texas

The Lead Data Engineer will be responsible for understanding business requirements, proposing scalable solutions, performing code reviews, and ensuring seamless delivery. Lead and guide the data engineering team. The ideal candidate will have extensive experience working on GCP (Google Cloud Platfor...

Promoted
AgileEngine
Austin, Texas
Remote

Work with data lakes to develop data pipelines. US companies, we are always open to talented software, UX, and data experts in the Americas, Europe, and Asia. Collaborate closely and build rapport with product, research, and engineering teams. Experience working with Data Lakes. ...

CBase Inc
San Antonio, Texas
Remote

You will be the Revenue Management domain Data Engineering subject matter expert (SME) to be consulted by Data & Analytics, Data Governance, as well as other EM engineers, and Leadership, on the use of Data Assets to meet business needs. Actively leads, coaches, mentors, and teaches other engineers;...

The Friedkin Group
Sugar Land, Texas

This position demands a blend of systems engineering, data integration, and data analytics skills to enhance TFG's data capabilities, supporting advanced analytics, machine learning projects, and real-time data processing needs. As a Lead Data Engineer within the Trailblazer initiative, you will pla...

Capital One
Plano, Texas

Plano 1 (31061), United States of America, Plano, TexasLead Data Engineer. We are seeking Data Engineers who are passionate about marrying data with emerging technologies. As a Capital One Lead Data Engineer, you’ll have the opportunity to be on the forefront of driving a major transformation within...

Prudential Financial
TX, US

As a Lead Software Engineer on/in Data Management & Governance you will partner with product owners, tech leads, designers, engineers and delivery professionals to improve Data Management and Governance services. Experience in building scalable and stronger data pipelines to support data integra...

NexTier Oilfield Solutions
Houston, Texas

Experience with data processing platforms such as Spark, Hadoop, Hive, Sqoop, Airflow, Google Cloud Platform (Dataproc, Dataflow, BigQuery, Compute Engine, Data Fusion), AWS (EMR, Kinesis, Lambda, Glue), Azure (HDInsight, Data Factory). The position is responsible for capturing requirements and perf...

Amerit Consulting
Austin, Texas
Remote

As the Senior Engineer, you will lead a team of data engineers in designing, building, and maintaining high-performance software system to manage analytical data pipelines that fuel the organization’s data strategy using software engineering best practices. Experience leading a data engineering team...

Anblicks
Dallas, Texas

Design and implement Snowflake data pipelines and data warehouses to support business intelligence, analytics, and machine learning initiatives. Optimize data models and query performance to ensure efficient data access and analysis. Develop and maintain data governance policies and procedures to en...