Data Engineer
Location : Palo Alto CA
Hire type : Contract.
Job Responsibilities :
Build and maintain an ecosystem of services and applications that provide order to result in a multi-product clinical laboratory.
Build data ingestion pipelines to gather data from various data sources. E.g., Salesforce Health Cloud, Lab systems, financial and billing applications.
Develop services to extract, load and transform data to provide purpose-built data stores.
Initiate and lead technical design discussions within and across technical teams.
Create artifacts, such as design and implementation documents, to guide development, implementation, and support.
Code for efficiency, reusability, scalability by following existing frameworks and tools.
Work with DevOps to develop and maintain automated deployment for regular release cadence.
Provide second-tier production support.
Job Qualifications :
5+ years of experience developing production quality data pipelines in Python, Scala or Java
3+ Hands-on experience in building custom ETL with focus on design, data modeling, implementation, and maintenance, to cater to the reporting needs of Data Analysts and Data Scientists.
3+ years of experience with developing in AWS and other Cloud environments. AWS S3, AWS Glue / EMR, Apache Spark, Redshift, Athena
Good understanding of Analytics ready data formats such as Parquet, ORC, JSON, etc. and Open Table formats Apache Iceberg, Apache Hudi, etc.
Experience working in a fast-paced environment leveraging an agile development framework, understanding of test automation and continuous integration.
Bachelor’s degree or higher in software engineering, CS, or any related field
Experience in healthcare industry is highly desired.