Principal Data Engineer, Machine Learning

SmithRx
San Francisco, CA, US
Full-time

Job Description

Job Description

Who We Are :

SmithRx is a rapidly growing, venture-backed Health-Tech company. Our mission is to disrupt the expensive and inefficient Pharmacy Benefit Management (PBM) sector by building a next-generation drug acquisition platform driven by cutting edge technology, innovative cost saving tools, and best-in-class customer service.

With hundreds of thousands of members onboarded since 2016, SmithRx has a solution that is resonating with clients all across the country.

We pride ourselves for our mission-driven and collaborative culture that inspires our employees to do their best work. We believe that the U.

S healthcare system is in need of transformation, and we come to work each day dedicated to making that change a reality.

At our core, we are guided by our company values :

  • Integrity : Always operate with honesty and transparency so we earn the trust of our clients.
  • Courage : Demonstrate the courage needed to take on a broken industry and continuously improve what we offer to optimize health outcomes.
  • Together : Foster a collaborative and inclusive environment that values teamwork, respect, and open communication, and encourages creativity and diversity of thought.

Job Summary :

SmithRx is leading the transformation of pharmacy benefit management (PBM) with a cutting-edge platform that delivers real-time insights, cost efficiencies, and exceptional customer experiences.

As we continue to expand, we are seeking an experienced Principal Data Engineer with expertise in data engineering and AI / ML.

In this key role, you will take ownership of driving innovation and leading the technology strategy for modern data platforms across data warehouse, tooling, integrations, and AI / ML.

You will collaborate with cross-functional leaders to deliver impactful data solutions that directly influence our business outcomes.

What you will do :

  • Lead the design and development of robust data architectures that support scalable, secure, and efficient data pipelines.
  • Architect, develop an enterprise data warehouse (EDW) and tooling that encompasses design patterns to scale and expand through integrations and automation of ETL / ELT pipelines as well as analytic layer to scale reporting and insights.
  • Develop strategies across the entire AI / ML project lifecycle. This includes seamless integration with data platforms, spanning from problem definition and data preparation to model deployment and performance monitoring.
  • Drive innovation by evaluating and implementing new technologies and tools that enhance our data platform's capabilities.
  • Drive excellence and standardization e.g. Optimize the performance of database systems, ensuring best practices in data security, access control, and compliance.
  • Ensure data quality, lineage, and resilience across production environments including monitoring, alerting, and recovery mechanisms to ensure 99% uptime and quick resolution of data pipeline issues.
  • Provide technical leadership, mentoring, and guidance to team members, establishing and enforcing best practices in data engineering and data science.
  • Influence and Collaborate with cross-functional teams & leadership, including product managers, engineers, data analysts, and business stakeholders

What you will bring to SmithRx :

  • BS, MS, or PhD in Computer Science, Information Systems, or a related field, with 15+ years of experience in data engineering, data science, or a similar role.
  • Strong expertise in data architecture, database design, and optimization, with experience in OLTP, OLAP, NoSQL, and cloud-based data warehouses (e.

g., AWS Snowflake, PostgresDB, DymanoDB, etc ).

  • Proficiency in programming languages such as Python, SQL, and tools like Spark, PySpark, Airflow, DBT, Snowflake, Cortext, OpenAI, and Terraform.
  • Proven experience architecting and designing AI / ML initiatives with a deep understanding of AI / ML algorithms and frameworks.

Nice to have - experience in developing and deploying ML models in production

  • Ability to lead cross-functional teams, influence stakeholders, and manage complex projects in a fast-paced environment.
  • Strong analytical and problem-solving skills, with the ability to handle evolving requirements and ambiguous challenges.
  • Excellent communication and presentation skills, capable of conveying complex technical concepts to both technical and non-technical audiences.

What SmithRx Offers You :

  • Highly competitive wellness benefits including Medical, Pharmacy, Dental, Vision, and Life Insurance and AD&D Insurance
  • Flexible Spending Benefits
  • 401(k) Retirement Savings Program
  • Short-term and long-term disability
  • Discretionary Paid Time Off
  • 12 Paid Holidays
  • Wellness Benefits
  • Commuter Benefits
  • Paid Parental Leave benefits
  • Employee Assistance Program (EAP)
  • Well-stocked kitchen in office locations
  • Professional development and training opportunities
  • 1 day ago
Related jobs
Promoted
Genentech
South San Francisco, California

Our Engineering team is seeking engineers with strong skills and hands-on experience in designing, constructing, and optimizing large-scale distributed systems, with a particular emphasis on machine learning infrastructure. By joining us as a Principal Machine Learning Engineer, Infrastructure (LLM)...

Promoted
University of California-Berkeley
Berkeley, California

The project scientist will make significant and creative contributions in the area of machine learning & data analytics. Develop machine learning approaches, computer vision tools to help pre-process dataset and annotations to generate groundtruth benchmarks. Expertise in databases, data infrastruct...

Promoted
HireIO Inc
San Francisco, California

Experience in one or more of the following areas: NLP, Ranking, Ads, search engine, recommender system, distributed system, and machine learning. Data construction, instruction tuning, preference alignment, and model optimization;. Excellent coding ability, data structures, and fundamental algorithm...

Adobe
San Francisco, California

We are looking for a world-class ML engineerof computer vision (CV)specialtyto lead the data quality, performance, and optimization of this platform. Deep expertise in computer vision, image processing, video processing,and machine learning for GenerativeAI. This work is powered by a platform compri...

Karkidi
San Francisco, California

Master’s Degree in “STEM” field (Science, Technology, Engineering, or Mathematics) plus 3 years of experience in data analytics, or PhD in “STEM” field (Science, Technology, Engineering, or Mathematics). Our areas of research include reinforcement learning, recommender systems, causal inference, and...

The Learning Experience #351
San Francisco, California

We're looking for Machine Learning Engineers to join our Product Engineering Team focusing on the full Project Experience for our customers. You'll be a crucial part of our product team, working closely with your engineering manager, team lead, designer, and other engineers to evolve the feature set...

Recruiting from Scratch
San Francisco, California

As a Senior Machine Learning Engineer, you'll play a crucial role in developing and implementing ML-driven features that enhance our platform's capabilities in the audit and advisory industry. This is an opportunity to join as an early engineer at a company with product-market fit that still has hug...

Skyrocket Ventures
CA, United States

Staff Machine Learning Engineer - Mission Driven Health Startup. Every day and week and month is slightly different, but it will generally be the traditional machine learning work flow of collecting data, processing it, pruning it, training models, then evaluating. Expertise in at least one of: 1) c...

EvenUp
San Francisco, California

Provide technical leadership and mentorship for a highly skilled team of data scientists and machine learning engineers, guiding them in solving complex business problems. Expertise in one or more areas of machine learning, such as deep learning, reinforcement learning, probabilistic modeling, or op...

Greylock
San Francisco, California

This will be a hands-on / applied machine learning role with a focus on Generative AI, RAG, and document understanding (Taxonomy, Digitization, Classification, Extraction, Validation, etc. ...