Site Reliability Engineer

Black Rock GroupsWashington, DC, United States

job_description.job_card.variable_hours_ago

serp_jobs.job_preview.job_type

serp_jobs.job_card.full_time

serp_jobs.filters_job_card.quick_apply

job_description.job_card.job_description

Randstad is seeking a skilled and proactive Site Reliability Engineer (SRE) to join our client in the Washington D.C. area, focusing on optimizing the availability, performance, and scalability of critical production services. The ideal candidate will bridge the gap between development and operations by applying software engineering principles to infrastructure and operational problems. This role requires a strong background in CI / CD pipeline development, infrastructure automation using Infrastructure-as-Code (IaC), incident response, and deep experience with cloud platforms, preferably AWS. The SRE will collaborate across engineering teams to drive automation, enhance observability, and ensure the continuous, secure delivery of high-quality software.

Responsibilities

Deployment & Automation : Design, build, and maintain robust Continuous Integration / Continuous Delivery (CI / CD) pipelines utilizing tools suchs as GitHub Actions, Jenkins, or AWS CodePipeline.
Infrastructure-as-Code (IaC) : Automate the provisioning and management of cloud infrastructure using IaC tools like Terraform, CloudFormation, or AWS CDK.
Monitoring & Observability : Develop comprehensive monitoring dashboards, alerting rules, and logging configurations using platforms such as AppDynamics, CloudWatch, or Dynatrace to proactively ensure systems meet defined Service Level Objectives (SLOs).
Incident Response & Remediation : Participate in a rotating on-call schedule, triage and resolve high-priority incidents, and conduct blameless postmortem reviews to identify and implement root cause remediations.
Security & Compliance : Contribute to a DevSecOps culture by assisting with secrets management and integrating security scanning tools (e.g., AWS ECR, Checkmarx, Synk) directly into CI / CD pipelines.
Documentation & Knowledge Sharing : Create and maintain high-quality technical documentation, runbooks, and escalation procedures to ensure system readiness and operational efficiency.
Cross-Functional Collaboration : Partner with application developers, infrastructure engineers, and security teams to successfully deploy and sustain production-grade services.
Database Management : Apply knowledge of relational (MySQL, PostgreSQL) and NoSQL (MongoDB) databases to optimize database structures and contribute to data modeling efforts.

QualificationsRequired Experience & Technical Skills

2+ years of hands-on experience in a Site Reliability Engineering, DevOps, or Infrastructure support role.

Proficiency with at least one major cloud platform (AWS experience is strongly preferred).

Experience with building and managing CI / CD pipelines (e.g., Jenkins, GitHub Actions, AWS CodePipeline).

Proficiency in automating infrastructure with an IaC tool (e.g., Terraform, CloudFormation).

Strong working knowledge of Linux-based systems and shell scripting.

Familiarity with version control systems, particularly Git.

Understanding of core monitoring and alerting principles and experience with common observability tools.

Basic understanding of core cloud services (e.g., AWS S3, EFS, Kinesis) and basic troubleshooting techniques.

Willingness to participate in on-call rotations and take ownership of service reliability.

Education & Soft Skills

Bachelor's degree in Computer Science, Information Systems, Engineering, or a related technical field-or equivalent hands-on professional experience.

Strong desire to learn and continually grow expertise in automation, observability, and SRE best practices.

Excellent problem-solving, analytical, and communication skills to work effectively with diverse teams.

Required Skills :

Basic Qualification :

Additional Skills :

This is a high PRIORITY requisition. This is a PROACTIVE requisition

Background Check : No

Drug Screen : No

serp_jobs.job_alerts.create_a_job

Site Reliability Engineer • Washington, DC, United States

Job_description.internal_linking.related_jobs

serp_jobs.job_card.promoted

Staff Site Reliability Engineer

VisaAshburn, VA, United States

serp_jobs.job_card.full_time

Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Reliability Engineer

JobotFrederick, MD, US

serp_jobs.job_card.full_time

Manufacturing company hiring Reliability Engineer in Frederick County!.This Jobot Job is hosted by : Christine McNamara.Are you a fit? Easy Apply now by clicking the "Apply Now" buttonand ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days

serp_jobs.job_card.promoted
serp_jobs.job_card.new

Sr Site Reliability Engineer - Remote

SitusAMCWashington, DC, United States

serp_jobs.filters.remote

serp_jobs.job_card.full_time

SitusAMC is where the best and most passionate people come to transform our client’s businesses and their own careers.Whether you’re a real estate veteran, a passionate technologist, or looking to ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours

serp_jobs.job_card.promoted

Staff Site Reliability Engineer (Federal)

OktaWashington, DC, United States

serp_jobs.job_card.full_time

Okta is The World's Identity Company.We free everyone to safely use any technology, anywhere, on any device or app.Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secur...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Site Reliability Engineer (Pipeline)

Technica CorporationWashington, DC, United States

serp_jobs.job_card.full_time

At Technica Corporation, our goal is to provide exceptional professional services and innovative technology solutions that meet or exceed our customer’s expectations. We specialize in a wide range o...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day

Site Reliability Engineer (req-174)

CATHEXISTysons, VA, US

serp_jobs.job_card.full_time

serp_jobs.filters_job_card.quick_apply

Team CATHEXIS elevates the government contracting experience through rapid response, deep skill, and thoughtful problem-solving and communication. Our core capabilities are our top-tier program and ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days

serp_jobs.job_card.promoted

Site Reliability Engineer - Redmond WA

Redis EnterpriseWashington, DC, United States

serp_jobs.job_card.full_time

We built the product that runs the fast apps our world runs on.If you checked the weather, used your credit card, or looked at your flight status online today, you’re welcome.At Redis, you’ll work ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Site Reliability Engineer

VirtualVocationsRockville, Maryland, United States

serp_jobs.job_card.full_time

A company is looking for a Mid-Sr.Site Reliability Engineer with a focus on on-prem Kubernetes / K8s.Key Responsibilities Manage and maintain on-premise containerized environments Deploy resources...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Sr. Manager - Site Reliability Engineer

VisaAshburn, VA, United States

serp_jobs.job_card.full_time

serp_jobs.job_card.promoted

Principal Site Reliability Engineer - Cloud (Remote)

Donnelley Financial, LLCRockville, MD, United States

serp_jobs.filters.remote

serp_jobs.job_card.full_time

Join a dynamic team at the pulse of global markets, where we deliver innovative software and service solutions for essential financial reporting and capital markets transactions.At DFIN, we are a v...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Site Reliability Engineer

CSCI ConsultingQuantico, VA, United States

serp_jobs.job_card.full_time

CSCI Consulting is looking for a.Site Reliability Engineer (SRE).This role combines deep systems engineering knowledge with DevOps automation, proactive monitoring, and incident response practices....serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days

serp_jobs.job_card.promoted

Site Reliability Engineer - Developer, Connected Warfare

Anduril IndustriesWashington, DC, United States

serp_jobs.job_card.full_time

Site Reliability Engineer, Connected Warfare.Washington, District of Columbia, United States.Anduril Industries is a defense technology company with a mission to transform U.By bringing the experti...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.promoted

Software Reliability Engineer

RaftMcLean, VA, United States

serp_jobs.job_card.full_time

All of the programs we support require.All work must be conducted within the continental U.Distributed Data Systems, Platforms at Scale, and Complex Application Development, with headquarters in Mc...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days

serp_jobs.job_card.promoted

Site Reliability Engineer

Karsun SolutionsWashington, DC, United States

serp_jobs.job_card.full_time

Summary : As a Site Reliability Engineer, you will help build out and run production environments, automate operations and maintain and support infrastructure. Drive and establish Service level objec...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day

serp_jobs.job_card.promoted

Site Reliability Engineer, Home

Google Inc.Washington, DC, United States

serp_jobs.job_card.full_time

Experience completing work as directed, and collaborating with teammates; developing knowledge of relevant concepts and processes. At Google, we have a vision of empowerment and equitable opportunit...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day

serp_jobs.job_card.promoted
serp_jobs.job_card.new

Cloud Site Reliability Engineer

Ford Motor CompanyWashington, DC, United States

serp_jobs.job_card.full_time

Enterprise Technology is the engine driving the future of transportation.If you’re looking for the chance to leverage advanced technology to redefine the mobility landscape, enhance the customer ex...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours

serp_jobs.job_card.promoted

Site Reliability Engineer III

VerisignReston, Virginia, United States

serp_jobs.job_card.full_time

Verisign helps enable the security, stability, and resiliency of the internet.We are a trusted provider of internet infrastructure services for the networked world and deliver unmatched performance...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30

serp_jobs.job_card.new

Principal Site Reliability Engineer

DMV IT ServiceWashington, DC, US

serp_jobs.job_card.full_time

serp_jobs.filters_job_card.quick_apply

Principal Site Reliability Engineer.DMV IT Service LLC, founded in 2020, is a trusted IT consulting firm specializing in IT infrastructure optimization, cybersecurity, networking, and staffing solu...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours