ETL Developer

Smart IT Frame LLC

Pittsburgh, PA, United States

Full-time

JD : Role : ETL Engineer

Role : ETL Engineer

Location : Remote

No. of years of experience : 8+ years’

Hardcore Technical - ETL

ETL experience Strong, 8+ years
Hadoop Strong, 8+ years
PySpark Strong, 8+ years
Spark Strong, 8+ years
Dask Required, should be able to set up DASK framework
JupyterHub Setup Experience required

Location : Pittsburgh preferred, but remote is fine

Job Description :

We are seeking an experienced ETL Engineer to join our team. The ideal candidate will have 8 to 10 years of experience in designing, developing, and optimizing ETL processes, with strong expertise in Hadoop, PySpark, Spark, and Dask.

The role involves setting up and managing data workflows, ensuring data integrity and efficiency, and collaborating with cross-functional teams to meet business needs.

If you are passionate about data engineering and have a track record of delivering high-quality ETL solutions, we encourage you to apply.

Key Responsibilities :

ETL Processes : Design, develop, and optimize ETL pipelines to efficiently extract, transform, and load data from various sources.
Hadoop : Implement and manage Hadoop-based solutions for large-scale data processing and storage, ensuring optimal performance and scalability.
PySpark : Develop and maintain PySpark applications for processing and analyzing big data, leveraging Spark's capabilities for distributed computing.
Spark : Utilize Apache Spark for data processing, including batch and streaming data applications, ensuring high performance and reliability.
Dask : Set up and manage the Dask framework for parallel computing and distributed data processing, optimizing workflows and handling large-scale data tasks.
JupyterHub Setup : Configure and maintain JupyterHub environments for collaborative data analysis and notebook sharing, ensuring a seamless user experience.

Must-Have Skills :

ETL Expertise : Strong experience in designing and managing ETL processes, including data extraction, transformation, and loading.
Hadoop : Proven proficiency with Hadoop ecosystem tools and technologies for big data processing and storage.
PySpark : Extensive experience with PySpark for data processing, including writing and optimizing Spark jobs.
Spark : Deep understanding of Apache Spark for both batch and real-time data processing.
Dask : Hands-on experience setting up and managing the Dask framework for distributed computing and large-scale data processing.
JupyterHub Setup : Experience configuring and maintaining JupyterHub for data analysis and notebook collaboration.
Communication Skills : Strong verbal and written communication skills, with the ability to articulate complex technical concepts to diverse audiences.
Independent Work : Ability to work independently, manage multiple tasks, and deliver high-quality results with minimal supervision.

Good-to-Have Skills :

Cloud Platforms : Familiarity with cloud-based data platforms for deploying and managing big data solutions.
Data Visualization : Experience with data visualization tools (e.g., Tableau, Power BI) for creating insightful visualizations and reports.
Data Engineering Tools : Knowledge of additional data engineering tools and frameworks, including ETL and data integration technologies.
Agile Methodologies : Experience with Agile development practices and methodologies for managing data projects and tasks.

Qualifications :

Bachelor’s or Master’s degree in Computer Science, Data Engineering, Software Engineering, or a related field.
8 to 10 years of experience in data engineering, with strong expertise in ETL processes, Hadoop, PySpark, Spark, and Dask.
Proven experience setting up and managing JupyterHub environments.

4 days ago

Related jobs

Promoted

PowerExchange - ETL/Informatica Developer

Synechron

Pittsburgh, Pennsylvania

ETL/Informatica Developer - PowerExchange and PowerCenter. Must have a minimum of 8 years of Informatica ETL development experience with good Oracle database background. ...

Promoted

ETL informatica developer

Tata Consultancy Services

Pittsburgh, Pennsylvania

Role: ETL informatica developer. Design, implement, and continuously expand data pipelines by performing extraction, transformation, and loading activities Gather requirements and business process knowledge in order to transform the data in a way that's geared towards the needs of end users Maintain...

Promoted

Operations & Technology Transformation Senior Consultant, Guidewire Data Migration - ETL Developer

Deloitte

Pittsburgh, Pennsylvania

Operations & Technology Transformation Senior Consultant, Guidewire Data Migration - ETL Developer. ETL experience with Informatica and/or Microsoft SSIS. ...

Promoted

ETL Developer

Smart IT Frame LLC

Pittsburgh, Pennsylvania

ETL experience Strong, 8+ years. We are seeking an experienced ETL Engineer to join our team. The ideal candidate will have 8 to 10 years of experience in designing, developing, and optimizing ETL processes, with strong expertise in Hadoop, PySpark, Spark, and Dask. If you are passionate about data ...

ETL / Mainframe Developer

Virtusa

Pittsburgh, Pennsylvania

ETL / Mainframe Developer - CREQ195110 Description. PNC's Consumer Lending Technology division is interested in hiring the services of a Senior ETL Developer. Informatica ETL development experience (Informatica Power Exchange and PowerCenter). ...

SQL/ETL Developer (Remote)

Maximus

Pittsburgh, Pennsylvania

Remote

Description & Requirements Maximus is looking for a Remote SQL Engineer.The SQL Engineer assists in the development of software solutions that will meet or exceed business requirements; the development effort includes designing and implementing modules to the system specifications, conducting u...

Senior Informatica ETL Developer ( offshore team lead )

System One Holdings, LLC

Pittsburgh, Pennsylvania

Senior Informatica ETL Developer ( offshore team lead ) :. Lead junior offshore Informatica developers. Coordinate with offshore Informatica developers to ensure completion of Agile stories. Experience with Informatica ETL, Oracle and PL/SQL. ...

Senior Informatica ETL Developer ( offshore team lead )

System One Holdings, LLC

Pittsburgh, Pennsylvania

Senior Informatica ETL Developer ( offshore team lead )

System One

Pittsburgh, Pennsylvania

Candidates must be local or willing to relocate within commuting distance of, in the order of manager’s preference: (1) Columbus OH (2) Pittsburgh PA (3) Philadelphia PA (4) Dallas TX Senior Informatica ETL Developer ( offshore team lead ) : - Lead junior offshore Informatica developers - Coordin...

Operations & Technology Transformation Senior Consultant, Guidewire Data Migration - ETL Developer

Deloitte

Pittsburgh, Pennsylvania

Operations & Technology Transformation Senior Consultant, Guidewire Data Migration - ETL Developer. ETL experience with Informatica and/or Microsoft SSIS. ...

ETL Developer

PowerExchange - ETL/Informatica Developer

ETL informatica developer

Operations & Technology Transformation Senior Consultant, Guidewire Data Migration - ETL Developer

ETL Developer

ETL / Mainframe Developer

SQL/ETL Developer (Remote)

Senior Informatica ETL Developer ( offshore team lead )

Senior Informatica ETL Developer ( offshore team lead )

Senior Informatica ETL Developer ( offshore team lead )

Operations & Technology Transformation Senior Consultant, Guidewire Data Migration - ETL Developer

Related searches