About the team
TikTok video system is a world-leading video platform that provides multi-media storage, delivery, transcoding a part of US Tech Service department, we are responsible for building the next generation video processing platform which provides excellent experiences for billions of users around the world.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager / department.
We regularly review our hybrid work model, and the specific requirements may change at any time. About the role : This is a Site Reliability Engineer role, focusing on the data pipeline reliability for the Video Platform team in USDS.
Data SREs monitor data and keep production batch and realtime processing jobs up and running with the highest level of availability, ensuring our users have the freshest, complete and correct data possible.
Responsibilities : Manage day-to-day operations of data service, realtime / batch data pipelines, such as Service Level Agreement management, pipeline deployment, performance tuning and troubleshootingProactively monitor and troubleshoot data pipelines and systems for performance issues, errors, or anomalies Create tools, build alarms and dashboards, drive internal process improvements, and automation to monitor and improve data engineering operationsImprove systems reliability, efficiency, and velocity through scaling, optimization of both resources and data processing workflows, potentially refactoring code or implementing new solutionsDevelop and deploy new reliable and scalable data pipelines and infrastructure components as required by business needsWork closely with data engineering and various vertical teams within the Video Architecture platform
Minimum QualificationsBachelor's in Computer Science or a related technical background involving software / system engineering, or equivalent working experienceGood programming experience with SQL and at least one of the following languages : Java, Python, Go, or ScalaExperience in data engineering, with a focus on data systems reliability, scalability, and performance Preferred QualificationsSolid experience with big data technologies (.
Hadoop, Spark, Flink, YARN) and databases (SQL, NoSQL)Knowledge of data pipeline and workflow management tools (., Airflow, Luigi)Demonstrated independent thinking capabilities and troubleshooting skills in large scale distributed systemsGood communication and coordination skillsExperience in building data solutions with AWS, Azure and other cloud services is a plus Candidates for this position must be legally authorized to work in the United States.
This position is not eligible for visa sponsorship or support.