Principal DevOps Engineer - Machine Learning Data Engineering

Workday
Pleasanton, California, US
Full-time
We are sorry. The job offer you are looking for is no longer available.

Your work days are brighter here.

Applying for this role is straight forward Scroll down and click on Apply to be considered for this position.

At Workday, it all began with a conversation over breakfast. When our founders met at a sunny California diner, they came up with an idea to revolutionize the enterprise software market.

A culture driven by our value of putting our people first is central to who we are. Our Workmates believe a healthy employee-centric, collaborative culture is essential for success in business.

We look after our people, communities, and the planet while still being profitable. Feel encouraged to shine; you don’t need to hide who you are.

About The Team

This is an opportunity to be part of a growth team focused on ML DevOps and ML Ops. We build ML capabilities into our products, and you will be building part of the next generation of Workday technology.

As a DevOps engineer, you will help develop ML powered features and experiences for every user across our HR & Talent product portfolio.

About The Role

In this role, you would :

  • Work with multi-functional teams to deliver scalable, secure and reliable solutions.
  • Effectively engage with data scientists, ML engineers, PMs, and architects in requirements elaboration and drive technical solutions.
  • Own and develop features from end to end including infrastructure as code.
  • Design and build solutions for efficient organization, storage and retrieval of data to enable substantial scale.
  • Build systems and dashboards to monitor service & ML health.
  • Lead in architecture reviews, code reviews and technology evaluation.
  • Research, evaluate, prototype and drive adoption of new ML tools with reliability and scale in mind.

About You

Basic Qualifications

  • 6 or more years of validated industry experience.
  • Bachelor’s and / or Master’s degree, preferably in CS, or equivalent experience.
  • Design, implement, and maintain robust DevOps pipelines for deploying, monitoring, and scaling machine learning development and data engineering.
  • Stay abreast of industry trends and emerging technologies, providing recommendations for continuous improvement of our DevOps and machine learning practices.
  • Troubleshoot and resolve performance bottlenecks, system outages, and other operational issues in collaboration with the ML engineering teams.
  • Optimize public cloud-based infrastructure (AWS, Azure, or GCP) to support the computational requirements of machine learning workloads.
  • Implement and manage CI / CD workflows to automate testing, integration, and delivery of machine learning components.
  • Develop and maintain monitoring and alerting systems for proactively identifying and addressing issues within the machine learning infrastructure.
  • Ensure the security and compliance of machine learning platforms, implementing best practices for encryption, data protection, and access controls.
  • Experience in managing relevant tools like Databricks and Sagemaker to perform efficient computation and management of large scale data lakes.
  • Experience in supporting your work in production.
  • 6 or more years of DevOps or programming experience preferably in Python, Java or Scala.

Other Qualifications

  • Implementation and operation of distributed systems.
  • Experience of data and / or ML systems with ability to think across layers of the stack.
  • Experience with Databricks, Sagemaker, & Apache-Spark.
  • Experience in leading or mentoring other team members.

Workday Pay Transparency Statement

The annualized base salary ranges for the primary location and any additional locations are listed below. Workday pay ranges vary based on work location.

As a part of the total compensation package, this role may be eligible for the Workday Bonus Plan or a role-specific commission / bonus, as well as annual refresh stock grants.

Recruiters can share more detail during the hiring process. Each candidate’s compensation offer will be based on multiple factors including, but not limited to, geography, experience, skills, job duties, and business need, among other things.

Our Approach to Flexible Work

With Flex Work, we’re combining the best of both worlds : in-person time and remote. Our approach enables our teams to deepen connections, maintain a strong community, and do their best work.

We know that flexibility can take shape in many ways, so rather than a number of required days in-office each week, we simply spend at least half (50%) of our time each quarter in the office or in the field with our customers, prospects, and partners (depending on role).

J-18808-Ljbffr

16 days ago
Related jobs
Unity
Remote, CA
Remote

As a machine learning and data science expert, you will guide and drive the application of machine learning solutions across the whole organization. Making requirements and communicating data and machine learning capability needs to the product and engineering teams as the authority in Data for ML. ...

Harnham
CA, United States

Machine Learning Engineering - Tech Lead. TEAM: Machine learning and data science is the core part of the company. Work with data scientists to build end-to-end machine learning models using Generative AI and LLMs. Join a strong team of 5 data scientists and engineers and work directly with the foun...

Square
Remote, CA, US
Remote

Machine Learning and Engineering industry experience (full stack ML experience). We use Machine Learning and Generative AI as an important part of our toolkit. Our machine learning systems monitor and surface suspicious activity (money laundering, illegal activity and terms of service violations) fo...

Deeproute.ai
Fremont, California

Develop Learning based planning algorithms for trajectories (deep learning, reinforcement learning, decision trees, etc) to ensuring that the vehicle behavior is natural, safe and smooth. Experience in at least one of: robotics research in motion planning, trajectory optimization, planning under unc...

Block
California, United States
Remote

Machine Learning and Engineering industry experience (full stack ML experience). We use Machine Learning and Generative AI as an important part of our toolkit. Our machine learning systems monitor and surface suspicious activity (money laundering, illegal activity and terms of service violations) fo...

CoStar Group
CA, Orange County

Machine Learning Engineer, Data Scientist or related role. Experience building data pipelines to collect data, train and test models, measure model performance, run inference on large datasets, and output results. Collaborate on the continued improvement of CoStar’s cloud-based machine learning envi...

Glocomms
CA, United States

Over 7 years of relevant development experience with a strong understanding of machine learning technologies (RecSys/NLP/CV). ...

Harnham
CA, United States

Lead Machine Learning Engineer. TEAM: Machine learning and data science is the core part of the company. Join a strong team of 5 data scientists and engineers and work directly with the VP of Engineering. As a Lead Machine Learning engineer you will…. ...

Prudential Financial
CA, US

Data Scientists, Data Engineers, Data Analysts and other professionals to implement machine learning models that will deliver stability, producibility, scalability and integration with other products and services. As a Lead, Machine Learning Engineer. Machine Learning and Deep Learning:. Understandi...

Angi
California
Remote

This is a leadership role, developing a team of dedicated machine learning scientists using state-of-the-art machine learning techniques with both structured and unstructured data. Angi is seeking an exceptional Director of Data Science and Machine Learning to be a driving force behind Angi's transf...