Proven work experience as a Site Reliability Engineer, Systems Engineer, or similar software engineering role. Site Reliability Engineering(SRE) at TikTok combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. The teams within USDS ...
Candidates must have prior experience working as a reliability engineer in the semiconductor industry, with an in-depth understanding of semiconductor device reliability. IC device failures, analyzing reliability test results, tracking device parameter trends, and using acceleration models to predic...
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the Ads data platform area, you will have the opportunity to manage the services and infrastructures in one...
Balance feature development speed and reliability with well-defined service level objectives. Previous success in technical engineering. ...
TikTok mobile reliability team ensures TikTok apps have reliability, uptime appropriate to our users and have a fast rate of improvement. Develop and maintain our automation tools like static analyzers, reliability automation tests, data pipelines and monitors, reliability analysis tools. You will a...
As Reliability Engineer, you will play a key role in enabling new technologies in the early stages of development. Electrical Engineering, Mechanical Engineering, or related discipline. The ideal candidate has strong knowledge of engineering fundamentals, experience with failure modes of hardware in...
As a Secure Development Factory (SDF) Site Reliability Engineer - DevOps, you will be at the heart of Western Digital’s engineering process, delivering the software development tools and infrastructure that empowers engineering teams to develop and deliver high quality products quickly. Site Reliabi...
Join our team at NVIDIA as a Senior Site Reliability Engineer focused on HPC storage and play a crucial role in designing, implementing, and optimizing on-prem High-Performance Computing (HPC) storage solutions while harnessing the power of cloud computing. You will collaborate closely with engineer...
Senior) Site Reliability Engineer (m/f/d). Profound experience in the field of site reliability engineering or comparable activities. Profound experience in the field of site reliability engineering or comparable activities. Responsibility for the reliability, availability and performance of our Int...
We are looking for a Lead Reliability Engineer to spearhead reliability efforts specifically tailored for datacenter and high-performance computing (HPC) applications. Stay abreast of emerging technologies and industry trends in datacenter and HPC reliability engineering, leveraging this knowledge t...
Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of software and systems engineering practices. Senior Site Reliability Engineer - DGX Cloud. SRE at NVIDI...
Participate in technical operations and rotations in response to performance and reliability issues. Graduate with Bachelor's or Master's degree in Software Development, Computer Science, Computer Engineering, or a related technical discipline. ...
We are hiring a Site Reliability Engineer to join our newly established SRE team. You will work closely with our cloud engineering and software development teams to design, implement, and maintain systems that ensure the high availability, performance, and security of our platform. Additionally, you...
We’re looking for great SREs, as well as software engineers interested in production engineering, to help us scale the largest enterprise security cloud infrastructure in the world. You will design and enhance software architecture to improve scalability, service reliability, capacity, and performan...
As a Staff Site Reliability Engineer, you’ll be the subject matter expert with operating systems and networking. You can plan, lead, and execute strategic objectives for the team or all of engineering. SRE or Software Engineering role. You’re passionate about mentoring junior engineers and you belie...
Work with electrical engineers (EE), mechanical engineers (ME), and process engineers to analyze failure modes and identify risks in the optical, electrical, and mechanical systems of LiDAR products. Bachelor’s or Master’s degree in Optical Engineering, Mechanical Engineering, Electrical Engineering...
Enabling the movement towards advanced chip design, KLA's Global Products Group (GPG), which is responsible for creating all of KLA’s metrology and inspection products, is looking for the best and the brightest research scientist, software engineers, application development engineers, and senior pro...
Digital: DevOps, Digital: Site Reliability Engineering (SRE). ...
At least 5 years in a Site Reliability Engineering, DevOps, or infrastructure-focused role. The Apple Services Engineering (ASE) team is one of the most exciting examples of Apple's long-held passion for combining art and technology. Engineers here collaborate to uphold a unified vision that include...
This position will liaise closely with our engineering, research and development, and information security teams to ensure that the IT department is working safely with the latest technologies. ...
Currently we are looking for Site Reliability Engineers to join our team to support and advance that mission What You'll Do Site Reliability Engineering (SRE) of AML (Applied Machine Learning) team combines system engineering and the art of machine learning to develop and run massively distributed A...
Phone and Core Technologies Operations Reliability Engineers are responsible for guiding development and operations teams toward generating reliable designs for Apple's new technology components, modules, and overall products. Summarize Reliability results and share findings with cross-functional pa...
The Reliability/Failure Analysis Engineer will perform technical planning, integration, verification and validation, cost and risk, and reliability and effectiveness analyses for microwave high power amplifier (HPA) products (SSPA < TWTA). Santa Clara, CA for a Reliability/Failure Analysis Engineer....
Location RTP/NC and SanJose CA.MustHaveTechnical/Functional Skills:.Docker Kubernetes Ansible Python Shell scriptingetc.Candidate should have good knowledge inK8s.Mandatory and good knowledge with K8sstorage and networking.Should have deployed applications inKubernetes.Good knowledge in Linux andAdv...
We are looking for a passionate Embedded Site Reliability Engineer who will lead the technical strategy and vision for our underpinning infrastructure, alerting & monitoring, infrastructure provisioning, networking, and development tooling in collaboration with other engineering teams and leadership...