Senior Site Reliability Engineer

Motion Recruitment
San Diego, California, United States
$140K-$170K a year
Full-time
We are sorry. The job offer you are looking for is no longer available.

This company is a leading global online fashion and lifestyle retailer dedicated to making fashion accessible and affordable for everyone.

With a strong commitment to innovation and sustainability, they operate on-demand production to ensure a smarter and more future-ready industry.

They are searching for a Senior Site Reliability Engineer with expertise in managing large-scale, mission-critical systems with zero downtime.

As part of their SRE team, you will collaborate with diverse teams to ensure their data services remain highly available and performant.

Your role will involve designing scalable solutions, implementing automation, and contributing to the reliability and security of their database infrastructure.

Required Skills & Experience

  • 3+ years of experience supporting mission-critical applications in a cloud environment
  • Proficiency in continuous integration / build systems, Java, SQL, and NoSQL databases
  • Familiarity with observability tools like Grafana, Prometheus, and Zabbix
  • Strong scripting / programming skills in Python or Go
  • Experience with Elasticsearch, Kafka, RabbitMQ, and Redis
  • Knowledge of container technologies like Docker, Kubernetes, or Mesos

What You Will Be Doing

  • Collaborating with cross-functional teams to optimize operational data handling
  • Participating in an on-call rotation to ensure 24 / 7 system availability
  • Managing critical services like Elasticsearch, Kafka, RabbitMQ, and Redis
  • Developing tools and processes to enhance system observability and resilience
  • Triaging site availability incidents and reducing MTTR for customer impact
  • Implementing Service Level Metrics & Objectives and enhancing monitoring capabilities
  • Creating and maintaining technical documentation, runbooks, and procedures

You will receive the following benefits :

  • Medical, dental, and vision coverage
  • 401(k) Match
  • 401(k) savings plan with company match and financial advisor access
  • Generous vacation, holiday, sick days, and employee discounts
  • Free weekly catered lunches, office snacks, and beverages
  • Dog-friendly office and free gym access

Applicants must be currently authorized to work in the US on a full-time basis now and in the future.

LI-AV3

30+ days ago
Related jobs
Promoted
VirtualVocations
San Diego, California

A company is looking for a Senior Engineer II - Site Reliability Engineering. Key Responsibilities:Provide guidance and support to product engineering teams for developing high-quality software systems through monitoring toolsManage monitoring tools and best practices to ensure total visibility into...

Promoted
Staff Agency.com LLC (formerly Delta Hire, LLC)
CA, United States

Join a dynamic and innovative team as a Senior Site Reliability Engineer (SRE) and play a crucial role in shaping the future of our infrastructure. You will be free to bring your ideas, make impactful technical decisions, and contribute to a highly popular product within a robust engineering culture...

Promoted
VirtualVocations
San Diego, California

A company is looking for a Senior Site Reliability Engineer. ...

Promoted
Rollbar, Inc.
San Diego, California

Dexcom’s Site Reliability Engineering (SRE) team exists to empower our SW Dev Teams to engineer highly reliable systems through which people take control of their health. We are seeking a highly experienced and hands-on Staff Site Reliability Development Engineer to lead our efforts in building and ...

Promoted
VirtualVocations
San Diego, California
Remote

...

Promoted
Fractal
CA, United States

Work cross-functionally with Services and Engineering teams. ...

Ursus
San Diego, California

Site Reliability experience operating at scale in high pace environment. Collaborate with engineering and system teams to drive changes and ensure optimal application performance and resiliency. ...

Tencent
California, US

Are you passionate about gaming and skilled in managing distributed online systems? Uncapped Games is looking for a Site Reliability Engineer like you! Join us in our quest to revolutionize the Real-Time Strategy (RTS) genre with our groundbreaking new game. ...

Splunk Inc
California, United States

You will partner with senior engineers to solve difficult problems. Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splu...

Splunk Inc
California, United States
Remote

Site Reliability Engineers in this role will be engaging with multiple service owners across the platform to teach and implement modern interpretations ofSRE,observability, Chaos Engineering andDevOps. Splunk's Cloud Services group is looking for a Site ReliabilityEngineer to help lead, design and b...