Our Technical Operations team manages the infrastructure, DevOps, and Site Reliability of our platform. We are looking for a Staff Cloud DevOps / Site Reliability Engineer to join our team.
Before applying for this role, please read the following information about this opportunity found below.
Qualifications
Bachelor's degree in Computer Science, Engineering, or a related field
7+ years of experience as a DevOps, Infrastructure, Operations, or Site Reliability Engineer (or as a software engineer with relevant experience).
At least 2 years experience each with :
Terraform
CI / CD using modern tools (GitOps)
Optional (not required but considered a plus) :
MLOps (building, orchestrating, and maintaining Machine Learning Pipelines)
Multi-cloud deployments (2 or more)
ArgoCD
Network management and VPNs
Responsibilities
Infrastructure : Maintain and contribute to Infrastructure-as-Code (Terraform)
DevOps and CI / CD Pipelines : Orchestrate pipelines using Github Actions, Helm, ArgoCD
Site Reliability : Measure and monitor availability, latency, and overall service health, drive incident management and post-mortem analysis
The US base salary range for this full-time position is $180,000 - $280,000. In addition to base pay, total compensation includes equity and benefits.
Within the range, individual pay is determined by work location, level, and additional factors, including competencies, experience, and business needs.
The base pay range is subject to change and may be modified in the future.
J-18808-Ljbffr
Site Reliability Engineer
Manages, supports and maintains a reliable environment for the site in order to ensure the stability and security of multiple open-source systems/platforms that are run or operated in that environment. Building and supporting a reliable site for the environment in order to meet the development and m...
Staff Site Reliability Engineer
You will help support the Lacework service and play a key role in building, operating, and improving the Lacework Cloud Security Platform, the world's best real-time cloud-native threat detection system. Develop best practices alongside engineering/operations teams to improve the scalability and rel...
Site Reliability Engineer, Adobe Document Cloud
Adobe's Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. You have a track record as a site reliability engineer or eager to build a career in large-scale SaaS businesses, and a strong desire to implement initiatives and...
Senior Site Reliability Engineer - DGX Cloud
Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of software and systems engineering practices. Senior Site Reliability Engineer - DGX Cloud. SRE at NVIDI...
Sr. Software Engineer - Cloud Platform Reliability (Remote)
Since our inception, our market leading cloud-native platform has offered unparalleled protection against the most sophisticated cyberattacks. Develop and maintain services to meet reliability and scalability demands. United States Citizenship is necessary to retain access to resources in AWS GovClo...
Site Reliability Engineer
Cloud platform knowledge (specifically AWS) is required, including incident handling and problem management. ...
Senior Site Reliability Engineer
We are hiring a Site Reliability Engineer to join our newly established SRE team. You will work closely with our cloud engineering and software development teams to design, implement, and maintain systems that ensure the high availability, performance, and security of our platform. This is a unique ...
Staff Software Engineer, Machine Learning, Google Cloud AI
Master’s degree or PhD in Engineering, Computer Science, or a related technical field. Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. We're looking for engineers who bring fresh i...
Staff Cloud Software Engineer
As a Staff Cloud Software Engineer, you will collaborate closely with cross-functional teams to ensure the scalability, security, and efficiency of our cloud platform. Implement and maintain the company's cloud portal, providing a user-friendly interface for managing cloud resources. Proven experien...
Senior Site Reliability Engineer - Storage Engineering
LinkedIn is looking to hire Senior Site Reliability Engineer within the production Storage Engineering group. DWDM, CWDM, MMF, SMF, SR, LR, ZR, SONET, MPLS)· Software engineering skills with efficient, maintainable and testable C/C++/Python· Experience deploying storage for shared-nothing applicatio...