Site Reliability Engineer - Senior (NE)

Ursus
San Diego, CA
Full-time

Description

  • Hands-on application management and support for AWS cloud environments, including full-stack diagnosis, fault resolution and root cause analysis.
  • Proactive monitoring of production systems and identify issues before service impact.
  • Drive and Implement monitoring tools / metrics / reports for tracking application / service performance.
  • Collaborate with engineering and system teams to drive changes and ensure optimal application performance and resiliency.
  • Lead service and system performance analysis, service capacity planning, and service continuity validation for multiple applications.
  • Identify areas for process automation, and develop automated scripts / tools to for regular operational activities.
  • Review and influence design, architecture, standards, and methods for deploying, monitoring and operating services and applications.
  • Actively participate and / or commit in the execution of tasks required to meet milestones and deliverables set by the SCRUM team throughout the release cycle.
  • Provide rotational on-call support.

Qualifications :

  • BS in Computer Science or equivalent experience
  • 3+ years professional Site Reliability experience operating at scale in high pace environment
  • 4+ years hands-on with AWS, Kubernetes, Infrastructure as Code, monitoring and alerting
  • Experience with building out Kubernetes cluster from scratch preferably using EKS
  • Extensive use of automation for Infrastructure as Code preferably via Terraform
  • Strong development experience in one of these languages Python or Go
  • Experienced user of one or more source code management tools, preferably Git
  • Should have experience with continuous integration, continuous delivery / deployment tools like Jenkins and ArgoCD

IND123

30+ days ago
Related jobs
Promoted
Staff Agency.com LLC (formerly Delta Hire, LLC)
CA, United States

Join a dynamic and innovative team as a Senior Site Reliability Engineer (SRE) and play a crucial role in shaping the future of our infrastructure. Another critical objective will be to develop and deploy new automation tools that streamline the DevOps pipeline. You will be free to bring your ideas,...

Promoted
VirtualVocations
San Diego, California

...

Promoted
Apple
San Diego, California

We are looking for a Site Reliability Engineer to be a member of our team. The successful candidate will write code to automate our processes to ensure reliability and manage thousands of compute and storage instances across large heterogeneous infrastructure. Experience with one or more of the foll...

Promoted
VirtualVocations
San Diego, California

A company is looking for a Midlevel Site Reliability Engineer. ...

Promoted
Indotronix Avani Group
San Diego, California

The expectation for a Senior Systems / Reliability Engineer is that they regularly serve on the systems engineering team and interact with electrical, mechanical, and firmware engineers to achieve verification goals, act as a leader of technical excellence, and participate as an agent for continuous...

Promoted
VirtualVocations
San Diego, California
Remote

A company is looking for a Site Reliability Engineer for a national remote position. ...

Motion Recruitment
San Diego, California

They are searching for a Senior Site Reliability Engineer with expertise in managing large-scale, mission-critical systems with zero downtime. This company is a leading global online fashion and lifestyle retailer dedicated to making fashion accessible and affordable for everyone. Knowledge of conta...

GEICO
San Diego, California

Our Senior Engineer is a key member of the engineering staff working across the organization to collaboratively design creative solutions to complex problems using automation. You will help drive our insurance business transformation as we transition from a traditional IT model to a tech organizatio...

Splunk Inc
California, United States

You will partner with senior engineers to solve difficult problems. Join us as we pursue our disruptive vision to make machine data accessible, usable and valuable to everyone. Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer wi...

GEICO
San Diego, California
Remote

Engineers to innovate and build new systems, improve, and enhance existing systems as well as identify new opportunities to apply your knowledge to solve critical problems. You will help drive our insurance business transformation as we transition from a traditional IT model to a tech organization w...