Search jobs > San Francisco, CA > Data engineer

Data Engineer (The Data Pipeline Architect)

Unreal Gigs
San Francisco, California, US
Full-time

Are you passionate about designing, building, and maintaining data pipelines that support robust data architectures and facilitate seamless data flow?

Do you excel in creating scalable solutions that empower data-driven decision-making? If you’re ready to develop and optimize data systems that drive impactful analytics, our client has the perfect role for you.

We’re seeking a Data Engineer (aka The Data Pipeline Architect) to build and manage cloud-based data infrastructures that support analytical needs and operational efficiencies.

While professional experience and qualifications are key for this role, make sure to check you have the preferable soft skills before applying if required.

As a Data Engineer at our client , you’ll collaborate with data scientists, analysts, and software engineers to construct data pipelines and storage solutions that are both efficient and secure.

Your role will be critical in ensuring data systems are optimized for performance, reliability, and scalability.

Key Responsibilities :

Design and Implement Scalable Data Pipelines : Develop and maintain data pipelines that support data ingestion, transformation, and integration using cloud technologies.

You’ll automate data workflows and ensure the seamless movement of data between various systems.

Manage and Optimize Data Storage Solutions : Architect and maintain data lakes and data warehouses using platforms like BigQuery, Redshift, Snowflake, or similar cloud-based solutions.

You’ll ensure data structures are built for performance and scalability.

Collaborate with Data Teams for Strategy Development : Work closely with data scientists, analysts, and business stakeholders to understand data requirements and align data solutions with business goals.

You’ll provide input on data models and storage strategies.

Ensure Data Quality and Reliability : Implement and manage processes for data validation, error handling, and consistency checks.

You’ll ensure the quality of data is maintained through robust testing and monitoring practices.

Develop and Automate ETL Processes : Build ETL (Extract, Transform, Load) workflows to handle complex data transformations.

You’ll automate data extraction and transformation to support efficient data integration and reporting.

Monitor and Maintain Data Infrastructure : Use monitoring tools to track the performance and reliability of data systems.

You’ll proactively identify and resolve potential issues to maintain system health and performance.

Optimize Data Processing and Resource Management : Implement strategies for efficient resource allocation and cost-effective data processing.

You’ll leverage parallel processing and cloud capabilities to enhance performance.

Required Skills :

Cloud Data Platform Expertise : Experience with cloud data platforms such as AWS (Redshift, S3, Glue), GCP (BigQuery, Dataflow), or Azure (Azure Data Lake, Synapse).

You’re proficient in handling cloud-based data solutions.

Programming and Scripting Knowledge : Proficiency in Python, Java, or Scala for building data pipelines and data processing tasks.

You can write clean, efficient code for automation.

ETL and Data Pipeline Management : Proven ability to develop, maintain, and optimize ETL processes that handle large volumes of data.

You’re experienced with orchestration tools like Apache Airflow or Luigi.

  • SQL and Database Management : Strong ability to write complex SQL queries and work with relational and NoSQL databases.
  • Problem-Solving and Critical Thinking : Excellent problem-solving skills with a proactive approach to identifying and resolving data-related challenges.

Educational Requirements :

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field. Equivalent experience in data engineering and cloud technologies may be considered.
  • Certifications in cloud data engineering (e.g., Google Professional Data Engineer, AWS Certified Big Data Specialty, Microsoft Certified : Azure Data Engineer Associate) are a plus.

Experience Requirements :

  • 3+ years of experience in data engineering, with a proven track record of building and managing cloud-based data systems.
  • Experience with real-time data processing frameworks like Apache Kafka or AWS Kinesis is advantageous.
  • Familiarity with containerization and microservices architecture is a plus.

J-18808-Ljbffr

3 days ago
Related jobs
Promoted
The San Francisco Standard
San Francisco, California

By joining The Standard as a Senior Data & Analytics Engineer, you will play a crucial role in advancing our data infrastructure and capabilities, driving data-informed decisions, and shaping the future of local journalism. As a Senior Data & Analytics Engineer at The Standard, you will be i...

Promoted
VirtualVocations
Oakland, California

A company is looking for a Software Engineer, Data Transformation and Movement. ...

Promoted
Entertainment Data Oracle (EDO)
South San Francisco, California

A Machine Learning Engineer at EDO is part of the Data Science team and focuses on production deployment/maintenance of internal data and modeling products. The role requires planning and architecture skills along with solid engineering experience and intuition to build maintainable and extensible m...

Promoted
VirtualVocations
Oakland, California

A company is looking for a Data Operations Engineer to ensure smooth data flow and support AI initiatives for local businesses. ...

Promoted
Genmo
San Francisco, California

We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation. We're seeking an experienced Senior/Staff AI Infra En...

Promoted
VirtualVocations
Oakland, California

A company is looking for a Marketing Data Engineer (Remote). ...

Promoted
Tbwa Chiat/Day Inc
San Francisco, California

Work cross-functionally with engineers and partners from the Product, Data Science, Data Engineering, and AI teams to deliver short- and long-term impact. Notion is an in-person company, currently requiring employees to come to the office for two Anchor Days (Mondays & Thursdays) and requests th...

Promoted
Plus Power
San Francisco, California

In order for the global energy system to make the transition from a carbon-based grid, to a renewable energy grid, large scale energy storage must be introduced into the electricity system to balance intra-hour supply and demand. Plus Power recruits outstanding energy industry professionals who are ...

Promoted
Guidewire Software
San Mateo, California

Do you have prior experience in building data platforms/data lake and analytics SaaS services in the public cloud with expertise in modern Big Data stack (Apache Spark, Hadoop, Kafka, Flink, Hive etc. Senior Software Engineer role within a data platform/data engineering team. If so then come join ou...

Tredence
San Bruno, California

Hands-on of one or more scripting language is a plusABOUT YOU:You are self-motivated, collaborative, eager to learn, and hands onYou love trying out new apps, and find yourself coming up with ideas to improve themYou stay ahead with all the latest trends and technologiesYou are particular about foll...