Search jobs > Redwood City, CA > Senior site reliability

Senior Site Reliability Engineer - Moloco Commerce Media

Moloco, Inc.
Redwood City, California, US
Full-time

The Impact You’ll Be Contributing to Moloco :

Apply (by clicking the relevant button) after checking through all the related job information below.

  • Build a state-of-the-art ad serving infrastructure for our commerce media platform
  • Manage the infrastructure that serves real time ad decisions based on our machine learning (ML) models and self manageable ad campaigns
  • Maintain and improve the CI / CD pipeline to deploy infrastructure and code updates in live environments
  • Develop infrastructure tools and processes that improve the productivity of engineering teams
  • Traditional SRE / Operational support areas such as tooling and automation, monitoring, workflow management, maintaining and improving data pipelines, etc.

The Opportunity :

  • Customer Facing : Design, implement, and maintain highly available infrastructure directly facing customer requests with high levels of traffic
  • Large-Scale Server : Design and implement large scale clusters capable of handling a wide range of requests with automatic scaling and resistance against cascading failures
  • Deployment Automation : Design and implement deployment pipelines tightly integrated with code development that can test, monitor, and decide to refuse or accept new deployments automatically
  • End to End Infrastructure Management : Collaborate with SWEs to develop end to end infrastructure solutions to minimize operating cost without compromising on high availability and scalability

How Do I Know if the Role is Right For Me?

  • Bachelor's Degree or above in Computer Science or equivalent technical degree
  • Hands-on experience working with GCP or other cloud platforms (e.g. AWS, Azure)
  • Practical, proven knowledge of a high-level language (e.g. Go, Java, Python, etc.)
  • 5+ years of experience in large-scale software development environment
  • Experience working with infrastructure-related software (e.g. Kubernetes, Helm, Terraform, etc.)
  • Experience developing infrastructure, configuration and deployment scripting and automation for large scale / high complexity services in a microservices environment
  • Experience working with large-scale distributed systems.
  • Passionate about operational excellence and thrive in an environment where you are able to provide extremely high levels of customer support
  • High level of verbal and written communication skills to collaborate effectively not only within the team but also with other infrastructure engineers across the organization
  • Tenacious problem solver who takes ownership of issues from end-to-end to full resolution

J-18808-Ljbffr

3 days ago
Related jobs
Promoted
Moloco, Inc.
Redwood City, California

The Impact You’ll Be Contributing to Moloco:. You will be responsible for developing an ML-based online advertising platform for the rapidly growing retail media industry. Collaborate with other teams, including but not limited to Infra, Machine Learning, Data Science and Analytics, and production, ...

Promoted
Google
San Bruno, California

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. Master's degree in Computer ...

Promoted
Zoox
San Mateo, California

Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. Bachelor's degree in an engineering, mathematics, or related field and 2+ years of relevant experience. M...

Promoted
Zipline
South San Francisco, California

Interested in this role You can find all the relevant information in the description below.Do you want to change the world? Zipline is on a mission to transform the way goods move.Our aim is to solve the world’s most urgent and complex access challenges by building, manufacturing and operating the f...

Promoted
Apple Inc.
Cupertino, California

The Apple Service Engineering - Edge & Messaging SRE team is looking for Site Reliability Engineers to build and run the services that hundreds of millions of customers use every day. We're looking for a talented and passionate person who loves designing, engineering and running systems and infr...

Promoted
MOLOCO
Redwood City, California

As an entrepreneurial Product Designer, you will spearhead and expedite the product design efforts of our Moloco Commerce Media (MCM) product. Lastly, Moloco is a 2024 certified Great Place to Work! Check us out on Glassdoor and be sure to get an inside look at working at Moloco on Instagram, Twitte...

Promoted
Verkada
San Mateo, California

We are actively looking for a talented Site Reliability Engineer to join the Infrastructure team. Designed with simplicity and scalability in mind, Verkada gives organizations the real-time insight to know what could impact the safety and comfort of people throughout their physical environment, whil...

Rubrik
Palo Alto, California

Senior Site Reliability Engineers at Rubrik are systems/software engineers who ensure that Rubrik's infrastructure services run smoothly and have the capacity for future growth. As a Senior Site Reliability Engineer, you will be responsible for:. Minimum 3-5 years of experience as a Development, Dev...

TikTok
Mountain View, California

About the role:This is a Site Reliability Engineer role, focusing on the data pipeline reliability for the Video Platform team in USDS. TikTok video system is a world-leading video platform that provides multi-media storage, delivery, transcoding a part of US Tech Service department, we are responsi...

TikTok
Mountain View, California

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the Ads data platform area, you will have the opportunity to manage the services and infrastructures in one...