Search jobs > Washington, DC > Sr site reliability

Sr. Site Reliability Engineer

Celonis
Washington, District of Columbia, US
Full-time

The Role :

Read the overview of this opportunity to understand what skills, including and relevant soft skills and software package proficiencies, are required.

  • You will be part of a highly technical, collaborative and creative team, with a focus on SRE & Software Engineering.
  • Responsible for the design, implementation, reliability and management of cloud-based FedRAMP-compliant applications and platforms.
  • Responsible for application incident management escalations which involve troubleshooting complex technical problems and resolving application issues within defined service level objectives.
  • Design, write, and deliver software that enhances the availability, scalability, and efficiency of our services.
  • Partner with platform and application development teams to learn from incidents and improve the platform resiliency.
  • Share acquired knowledge and document accordingly while implementing SRE best practices.

The qualifications you need :

  • A bachelors or masters degree in a technical field (e.g. Computer Science, Software Engineering) or a comparable education.
  • Experience programming with Java, the Spring framework, and Python (or a similar scripting language in Linux environment).
  • A minimum of 5 years experience developing cloud based software applications.
  • Experience working with public cloud providers (AWS, Azure, or GCP) and modern cloud monitoring system observability frameworks (e.g., Datadog).
  • Experience in developing and running large-scale production services with elastic cloud services and Kubernetes.
  • Project experience of operation within the SRE domain.
  • Familiarity with CI / CD processes and tools (ArgoCD, GitAction, etc.).
  • Experience with infrastructure as code (Terraform, Kustomization).
  • Strong problem-solving skills and the ability to troubleshoot complex technical issues.
  • Excellent English verbal and written communication skills.

This position will not be eligible for any form of immigration visa sponsorship now or in the future.

J-18808-Ljbffr

8 days ago
Related jobs
Promoted
MetroStar
Washington, District of Columbia

Site Reliability Engineer (SRE). SRE with a strong understanding of SRE principles for highly scalable and reliable systems. Willing to work in downtown Washington, DC on client site at least 3 days per week. ...

Promoted
Palantir Technologies
Washington, District of Columbia

Site Reliability Engineers combine engineering experience and an innate drive to improve existing systems and processes, with the creativity to develop novel solutions to evolving challenges. We’re looking for Site Reliability Engineers who can help us build, operate, and maintain high-performance, ...

Promoted
WEX, Inc.
Washington, District of Columbia
Remote

The WEX Site Reliability Engineering (SRE) team is looking for individuals passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. Site Reliability Engineer or equivalent role. As part of the...

Promoted
TikTok
Washington, District of Columbia

BS degree in Computer Science, Computer Engineering, Electrical Engineering or relevant majors with 2+ years of working experience. The teams within USDS that deliver on this commitment daily span Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions an...

Promoted
Splunk Inc.
Washington, District of Columbia

We are looking for a Site Reliability Engineer to join our Splunk Cloud's Traffic Engineering team to help scale and secure the global Cloud networking infrastructure. Ability to mentor junior engineers on the team, provide technical direction, perform design/code reviews, and champion engineering b...

Promoted
System One
Washington, District of Columbia

As a Site Reliability Engineer (SRE), you’ll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. Minimum of 8 years of experience as a Site Reliability Engineer with a strong understanding of SRE principles fo...

Promoted
Varada Consulting, LLC
Washington, District of Columbia

Varada Consulting, LLC is seeking a full-time highly skilled and experienced Site Reliability Engineer (SRE) to join our team. Apply Site Reliability Engineering (SRE) principles to design, build, and operate highly scalable and reliable systems that meet the needs of our customers. Minimum of 8 yea...

Computer World Services (CWS)Corporation
Washington, District of Columbia

The Senior Systems Engineer - Observability (SSE) will define and implement infrastructure and application observability, set up governance, optimization, monitoring, and control for a consolidated common operating picture for IT operations. The role will work with engineering, application, security...

Computer World Services
Washington, District of Columbia
Remote

The Senior Systems Engineer - Observability (SSE) will define and implement infrastructure and application observability, set up governance, optimization, monitoring, and control for a consolidated common operating picture for IT operations. The role will work with engineering, application, security...

Snapx
Washington, District of Columbia

Qualifications:</p> <ul> <li>10+ years of overall experience in IT including, with hands-on Development and Systems engineering background</li> <li>3-5 years of experience in a Site Reliability Engineering role</li> <li>Experience with Enterprise Cloud trans...