Search jobs > Redwood City, CA > Senior site reliability

Senior Site Reliability Engineer - Moloco Commerce Media

Moloco, Inc.
Redwood City, California, US
Full-time

The Impact You’ll Be Contributing to Moloco :

Apply (by clicking the relevant button) after checking through all the related job information below.

  • Build a state-of-the-art ad serving infrastructure for our commerce media platform
  • Manage the infrastructure that serves real time ad decisions based on our machine learning (ML) models and self manageable ad campaigns
  • Maintain and improve the CI / CD pipeline to deploy infrastructure and code updates in live environments
  • Develop infrastructure tools and processes that improve the productivity of engineering teams
  • Traditional SRE / Operational support areas such as tooling and automation, monitoring, workflow management, maintaining and improving data pipelines, etc.

The Opportunity :

  • Customer Facing : Design, implement, and maintain highly available infrastructure directly facing customer requests with high levels of traffic
  • Large-Scale Server : Design and implement large scale clusters capable of handling a wide range of requests with automatic scaling and resistance against cascading failures
  • Deployment Automation : Design and implement deployment pipelines tightly integrated with code development that can test, monitor, and decide to refuse or accept new deployments automatically
  • End to End Infrastructure Management : Collaborate with SWEs to develop end to end infrastructure solutions to minimize operating cost without compromising on high availability and scalability

How Do I Know if the Role is Right For Me?

  • Bachelor's Degree or above in Computer Science or equivalent technical degree
  • Hands-on experience working with GCP or other cloud platforms (e.g. AWS, Azure)
  • Practical, proven knowledge of a high-level language (e.g. Go, Java, Python, etc.)
  • 5+ years of experience in large-scale software development environment
  • Experience working with infrastructure-related software (e.g. Kubernetes, Helm, Terraform, etc.)
  • Experience developing infrastructure, configuration and deployment scripting and automation for large scale / high complexity services in a microservices environment
  • Experience working with large-scale distributed systems.
  • Passionate about operational excellence and thrive in an environment where you are able to provide extremely high levels of customer support
  • High level of verbal and written communication skills to collaborate effectively not only within the team but also with other infrastructure engineers across the organization
  • Tenacious problem solver who takes ownership of issues from end-to-end to full resolution

J-18808-Ljbffr

3 days ago
Related jobs
Promoted
MOLOCO
Redwood City, California

Check us out on Glassdoor and be sure to get an inside look at working at Moloco on social media. Moloco is a machine learning company empowering organizations of all sizes to grow and unlock the full value of their unique first-party data, elevating the traditional path to performance advertising. ...

Promoted
MongoDB
Palo Alto, California

The Cloud Site Reliability Engineering Team designs and builds the global infrastructure on which we deploy our services. ...

Promoted
EarnIn
Palo Alto, California

Senior Software Engineer (Data Exchange). Senior Software Engineer - Finance Platform. Senior Software Engineer (Internal Tool). Software Quality Engineer (Mobile Automation, Contract). ...

Promoted
Altimetrik
Mountain View, California

Participate and contribute in FMEA/Chaos testing, Security remediations, etc. ...

Promoted
EarnIn
Palo Alto, California

As a Staff Site Reliability Engineer, you’ll be the subject matter expert with operating systems and networking. You can plan, lead, and execute strategic objectives for the team or all of engineering. SRE or Software Engineering role. You’ve tackled site-wide outages, lessons were learned, and you ...

Promoted
Fidelity Media B.V
Mountain View, California

We are looking for a senior software engineer to contribute to our media systems. Uphold a high standard of engineering excellence because the performance and reliability of media infrastructure directly impact our product experience, hence the perception of Loom’s overall brand. Be familiar with me...

Promoted
Inworld AI
Mountain View, California

DevOps, Infrastructure, Operations, or Site Reliability Engineer (or as a software engineer with relevant experience). We are looking for a Staff Cloud DevOps/Site Reliability Engineer to join our team. Our Technical Operations team manages the infrastructure, DevOps, and Site Reliability of our pla...

Promoted
Google
Sunnyvale, California

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. Master's degree in Computer Science or Engineering. SRE ensures that Google Cloud's services—both our internally critical and our externally-visib...

Promoted
SmartThings
Mountain View, California

SmartThings is seeking a Staff Site Reliability Engineer to be the technical leader on a newly formed SRE team whose mission is to drive platform reliability and operations improvements across critical areas such as availability, latency, efficiency, capacity, change management, monitoring, and inci...

Promoted
Apple, Inc.
Cupertino, California

Support and improve the Hardware Technology engineering environment from design through deployment, including additional refinement and scale-up to support future growth. ...