Site Reliability Engineer

Motion Recruitment
Arlington, Virginia, United States
Full-time

Site Reliability Engineer

This company is looking for a Site Reliability Engineer to lead a team responsible for building, managing, maintaining, and scaling the centralized infrastructure services that support our mission-critical operations.

The company is located in Herndon, VA and will remain remote friendly. Requiring a couple days on site a month.

What You Will Be Doing :

  • Oversee the design of software solutions that integrate Open Source, Commercial Off-The-Shelf (COTS), and custom-developed components.
  • Deploy, configure, and manage services across production, QA, and development environments on platforms such as OpenStack and Docker.
  • Build and manage infrastructure using Terraform.
  • Develop deployment automation tools using Ansible.
  • Create automation and configuration management solutions with SaltStack and Jenkins.
  • Implement encryption solutions with HashiCorp Vault.
  • Contribute to the development of a large-scale Software Defined Network (SDN) using Guardicore.
  • Document processes, procedures, configurations, and deployment plans.
  • Collaborate with technical teams to implement systems and software.
  • Occasionally provide operational support, including troubleshooting and problem resolution.
  • Offer technical leadership in operational processes and change management, while mentoring less experienced engineers.
  • Provide regular progress updates to management.
  • Participate in a 24x7 on-call rotation.

Required Skills & Experience :

  • Bachelor’s degree in Computer Science, a related technical field, or equivalent education and experience.
  • 8+ years of experience in developing and managing mission-critical systems.
  • In-depth knowledge of Linux configuration and administration.
  • Proficiency in a high-level scripting language such as Python.
  • Extensive experience with automation, including not only development but understanding the purpose and key areas for automation.
  • Strong grasp of infrastructure-as-code principles.
  • Excellent written and verbal communication skills, with the ability to clearly explain complex issues.
  • Solid understanding of network protocols and security practices.
  • Experience building and optimizing monitoring and reporting solutions using tools like Grafana and Splunk.
  • Familiarity with development tools such as GitHub, Jira, and Confluence.

Preferred Skills and Experience :

  • Expertise in deployment automation using tools like Ansible.
  • Hands-on experience with Jenkins in a continuous integration and delivery environment.
  • Experience with Docker or Kubernetes in a production setting.
  • Familiarity with OpenStack in production environments.
  • Knowledge of HTTP proxies like Squid.
  • Experience working with Red Hat Enterprise Linux and / or FreeBSD.
  • Familiarity with CMDB and ITIL platforms such as ServiceNow.
  • Experience with RedHat Identity Manager and / or FreeIPA.
  • Administration of Linux and Unix systems in large-scale environments.
  • Experience with VMware in a production environment.
  • Familiarity with Agile methodologies, including Kanban and / or Scrum.
  • Experience in Registry Services, E-commerce, or ISP environments is a plus.

Applicants must be currently authorized to work in the United States on a full-time basis now and in the future.

This position doesn’t provide sponsorship.

30+ days ago
Related jobs
Promoted
Microsoft
Reston, Virginia

We are looking for Site Reliability Engineers to help design and implement scenarios for our customers. Site Reliability Engineering IC3 - The typical base pay range for this role across the U. Site Reliability Engineering IC4 - The typical base pay range for this role across the U. OR Bachelor's De...

Promoted
Peraton
Reston, Virginia

Bachelor's Degree in Computer Science, Information Technology, or a related field and 6 years of Cloud Engineering experience. Experience as a Cloud Engineer or similar role. ...

Promoted
Microsoft
Reston, Virginia

We are looking for Site Reliability Engineers to help design and implement scenarios for our customers. OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration. OR M...

Promoted
Palo Alto Networks
Reston, Virginia

We are seeking development heavy Site Reliability Engineers to design, build, maintain, and scale production services and server farms within our FedRAMP SASE product portfolio in. We want passionate engineers who bring new ideas in all facets of DevOps. Collaboration and partnership are at the foun...

Cellebrite
Vienna, Virginia
Remote

As a FedRAMP Site Reliability Engineer (SRE), you will drive the automation of multiple parts of infrastructure and deployment systems. Strive to improve processes, to enable engineering and operations teams to work smarter and faster with high quality. You get to work with other SRE, DevOps enginee...

Capital One
McLean, Virginia

As a Capital One Lead Software Engineer, Site Reliability Engineer you’ll have the opportunity to be on the forefront of driving a major transformation within Capital One. Ave (22114), United States of America, New York, New YorkLead Software Engineer, Site Reliability (Bank Tech). New York City (Hy...

Palo Alto Networks
Reston, Virginia

We are seeking development heavy Site Reliability Engineers to design, build, maintain, and scale production services and server farms within our FedRAMP SASE product portfolio in. We want passionate engineers who bring new ideas in all facets of DevOps. Collaboration and partnership are at the foun...

Capital One
McLean, Virginia

Center 3 (19075), United States of America, McLean, VirginiaLead Platform Engineer, Site Reliability Engineering (SRE). Site Reliability Engineering experience. We are seeking Platform Engineers who are passionate about creating and supporting DevOps tools with emerging technologies to join our team...

Space Ground System Solutions
Alexandria, Virginia

Space Ground System Solutions (SGSS) has an immediate full-time opening for a Site Reliability Engineer (SRE) on its IT Support team located in Alexandria, VA. You will serve as a critical link between the software development team and NRLs sponsors and customers, engineering and delivering operatio...

Zachary Piper Solutions
McLean, Virginia
Remote

Piper Companies is on the lookout for a Site Reliability Engineer to join our Managed Ansible on Cloud team. Key Responsibilities for the Site Reliability Engineer Include:. Qualifications for the Site Reliability Engineer Include:. Compensation for the Site Reliability Engineer Include. ...