Site Reliability Engineer - Video Platform - USDS Base pay range : $118,657 / yr - $187,200 / yr
Responsibilities :
Responsible for overall reliability of TikTok's video system, including video publishing and distribution.
Perform lifecycle management of production systems including change management, service deployment, operations and emergency response.
Monitor the system and respond to incidents to maintain system service level agreement (SLA), review and follow up all production incidents.
Perform capacity management of compute, storage and network bandwidth resources to ensure system stability and save infrastructure costs.
Provide strong support during big events to ensure the system is capable of consuming a large volume of Internet traffic.
Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global infrastructure.
Qualifications :
Bachelor\'s degree in Computer Science or a related technical background involving software / system engineering, or equivalent working experience.
Programming experience with at least one of the following languages : C, C++, Java, Python, C# or Go.
Knowledge of networking, operating system, database systems and container technology.
Understanding of microservice architecture and troubleshooting in large-scale distributed systems.
Experience with open-source systems such as Linux, MySQL, MongoDB, Redis and ELK.
Experience building solutions with AWS, Google Cloud, Azure and other cloud services is a plus.
Strong teamwork, self-motivation and good communication.
Other information :
As a condition of employment, all successful candidates must be able to establish authorization to work in the United States. Sponsorship or immigration-related benefits are not provided.
#J-18808-Ljbffr
Site Reliability Engineer • Mountain View, CA, United States