Search jobs > San Jose, CA > Site reliability engineer

Site Reliability Engineer - AML

ByteDance
San Jose
Full-time

ResponsibilitiesFounded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join UsAt ByteDance, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and for millions of users across all of our products.

We lead with curiosity and aim for the highest, never shying away from taking calculated risks and embracing ambiguity as it comes.

Here, the opportunities are limitless for those who dare to pursue bold ideas that exist just beyond the boundary of possibility.

Join us and make impact happen with a career at ByteDance. The mission of our AML team is to push next-generation recommendation-based algorithms and platform for the company.

We also drive substantial impact for core businesses of the company. Currently we are looking for Site Reliability Engineers to join our team to support and advance that mission What You'll Do Site Reliability Engineering (SRE) of AML (Applied Machine Learning) team combines system engineering and the art of machine learning to develop and run massively distributed AI / recommendation system around the world.

On the SRE team, you'll have the opportunity to sharpen your expertise in coding, performance analysis and large system operation, and get heavily involved in the process of hardware / capacity decision-making.

SRE ensures that the very centric machine learning services at ByteDance have the highest level of availability, as well as creating highly automated systems and pipelines.

Qualifications1. Expertise in analyzing and troubleshooting distributed systems.2. Bachelor / Master's degree in Computer Science, a related technical field involving software develop or systems engineering.

3. Experience programming in at least one of the following languages : Python, C / C++ or Go. 4. With solid background of algorithms and data structures.

Preferred qualifications : 1. Ability to design and maintain large-scale systems.2. Strong understanding of code optimizing and routine tasks automation.

3. SRE experience on large scale distributed system. ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives.

Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life.

To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach.

We are passionate about this and hope you are too. ByteDance Inc. is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws.

If you need assistance or a reasonable accommodation,

30+ days ago
Related jobs
Promoted
TikTok
Mountain View, California

The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more. Participate as part of a global team to support site-up issues to ensure that services are reliable, fault-tolerant, ...

Zoom
San Jose, California

Be familiar with chaos engineering and fault injection tools ( chaos monkey, Amazon Fault Injection Service). ...

TikTok
Mountain View, California

The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more. About the TeamThe Global E-commerce SRE team of US Tech Services works with engineering and product teams to build and run large-...

TikTok
San Jose, California

Our infrastructure team is seeking experienced site reliability engineers to build globally distributed edge platform for provisioning and deploying edge services. Our team operates a large network of edge POPs around the world to accelerate site traffic and cache CDN content. Collaborate with softw...

Apple
Cupertino, California

The Apple Service Engineering - Redis SRE team is looking for Site Reliability Engineers with experience in developing processes, tools, and automation for managing distributed systems in production environments. This role is for engineers who enjoy deep technical engineering that spans large cross-...

https:/www.energyjobline.com/sitemap.xml
Sunnyvale, California

Digital: DevOps, Digital: Site Reliability Engineering (SRE). ...

YO HR CONSULTANCY
San Jose, California

Location RTP/NC and San Jose CA.MustHave Technical/Functional Skills:.Docker Kubernetes Ansible Python Shell scripting etc.Candidate should have good knowledge in K8s.Mandatory and good knowledge with K8s storage and networking.Should have deployed applications in Kubernetes.Good knowledge in Linux ...

Dunhill Professional Search & Government Solutions
San Jose, California
Remote

The Site Reliability Engineer will be joining a team responsible for developing and maintaining tools, alerts, and dashboards to support the Technical Operations team in monitoring application health and performance. The engineer will be responsible for implementing improvements to processes to impr...

General Motors
Palo Alto, California

Lead Site Reliability engineering effort to improve anomaly detection, platform stability and resilience using modern best practice. Collaborate with engineering teams to analyze and provide inputs in architecture, infrastructure resources, observability to achieve reliability and scalability goals....

JPMorgan Chase Bank, N.A.
Palo Alto, California

QUALIFICATIONS: Minimum education and experience required: Bachelor's degree in Computer Applications, Computer Science, Electronic Engineering, or related field of study plus 5 years of experience in the job offered or as Site Reliability Engineer, Systems Engineer, Senior Technical Analyst, o...