A company is looking for a Principal Data Engineer (PySpark).
Key Responsibilities :
Design and evolve scalable, distributed data infrastructure across cloud platforms
Build and maintain real-time and batch data processing pipelines supporting analytics and AI / ML workloads
Mentor and support data engineers, establishing best practices and code quality standards
Required Qualifications :
10+ years of software development and data engineering experience with ownership of production-grade data infrastructure
Bachelor's degree in Computer Science or a related field, or equivalent practical experience
Deep expertise in scaling Spark in production (Databricks, EMR, etc)
Proficient in Python with experience implementing software engineering best practices
Hands-on experience with both relational (MySQL / PostgreSQL) and NoSQL (MongoDB, DynamoDB, Cassandra) databases
Principal Data Engineer • Norfolk, Virginia, United States