Responsibilities
- Ensure the quality and reliability of data pipelines, data warehousing, and analytics solutions driven through automated testing.
- A strong background in data engineering, ETL & ELT (Extract, Transform, Load) orchestration tools, SQL, & Tableau.
- Identify and report defects and work closely with data engineers to resolve issues.
- Collaborate with data engineers to validate and optimize data warehousing solutions.
- Ensure data consistency, accuracy, and efficient storage.
- Utilize ETL orchestration tools to automate and schedule data workflows.
- Create tests to validate functionality and reliability of these workflows.
- Develop and maintain test suites for Tableau dashboards and reports; verify data accuracy and dashboard functionality to ensure data visualizations provide meaningful insights.
- Write and execute SQL queries to validate data transformations, data loading, & data retrieval processes. Ensure data consistency and correctness at each stage of the pipeline.
- Test data streaming processes for correctness, data integrity, & performance.
- Implement and maintain regression test suites to ensure that changes or updates to data pipelines do not introduce new issues or regressions.
- Develop and maintain automated test scripts and frameworks for data engineering processes to improve testing efficiency and coverage.
- Collaborate closely with data engineers, data analysts, and other stakeholders to understand data requirements and ensure data quality and reliability.
- Document test cases, test plans, and test results; create and maintain documentation on data pipelines, ETL processes, & data structures.
- Stay up to date with industry best practices, emerging technologies, and trends related to data engineering and testing;
identify opportunities for process improvement and automation.
Skills & Requirements :
- 5+ years of experience with software QA and / or data engineering.
- Proven experience in software testing, specifically in data engineering, ETL processes, and data warehousing.
- Strong SQL skills and experience working with databases such as SQL Server or similar, and data warehousing such as Snowflake or Amazon Redshift.
- Proficiency in ETL orchestration tools such as SnapLogic, Fivetran, similar.
- Experience with data visualization tools, particularly Tableau.
- Strong problem-solving skills, attention to detail, & excellent communication / collaboration skills.
- Proven ability to work independently and in a team.
- Knowledge of scripting languages for test automation is a plus (Python and / or JavaScript preferred).
- Experience with real-time data streaming technologies is preferred (Amazon Kinesis or Apache Kafka preferred).
- Bachelors in Computer Science, Information Technology, or related field is preferred.
30+ days ago