Site Reliability Engineer (SRE) - 2

Akina, Inc.
Annapolis Junction, Maryland, USA
Full-time

TS / SCI w / Polygraph required

Approved for 60% telework

06-10-SRE

Description :

DevOps refers to a software development concept that unites and brings together developers and IT staff. The DevOps approach involves consistent, small edits to software coding.

This means frequent updates and testing of software that results in very quick releases. DevOps is a culmination of two practices : Development and Operations.

A Site Reliability Engineer is an expert who utilizes the DevOps methodology and integrates IT operations into software management and deployment.

They ensure that the DevOps strategy is well implemented.

The Site Reliability Engineer is expected to have a good understanding of the software development lifecycle, know automation tools for developing digital pipelines (CI Continuous Integration / CD Continuous Deployment), and have classical system administration experience.

They are expected to work across departments with managers, developers, and administrators to improve our software products for the customer.

Position Specific Skills :

Experience with automating the deployment, scaling, and management of containerized applications using Kubernetes and related tools.

Collaborate with developers to create and maintain CI / CD pipelines to ensure fast and efficient delivery of software. Troubleshoot and resolve issues related to Kubernetes infrastructure, applications, and networking.

Experience coordinating with development teams to streamline code deployment. Conduct system tests for security, performance, availability, and reliability.

Ensure the stable performance of the infrastructure in a large-scale setting, and know how to scale that infrastructure.

Required Skills :

  • Design and deploy Kubernetes clusters in a highly available and scalable manner. Includes unit testing, deployment, monitoring and reporting.
  • Create container images and Helm Charts
  • Deploying Docker images and configuring them on Kubernetes.
  • Implement and maintain monitoring, logging, and alerting solutions to ensure visibility and control over the Kubernetes environment.
  • Evaluate new technologies and tools to improve the Kubernetes-based infrastructure and provide recommendations to the team.
  • Troubleshoot pod issues deployed in Kubernetes.
  • Document internal processes and procedures related to duties and responsibilities.
  • Automate the deployment, scaling, and management of containerized applications using Kubernetes and related tools.
  • Collaborate with developers to create and maintain CI / CD piplines to ensure fast and efficient delivery of software.
  • Troubleshoot and resolve issues related to Kubernetes infrastructure, applications, and networking.
  • Coordinate with development teams to streamline code deployment.
  • Conduct systems tests for security, performance, availability, and reliability.
  • Ensure code quality, test and distribute code updates, and monitor the health and stability of deployed products.
  • Ensure the stable performance of the infrastructure in a large-scale setting and know how to scale that infrastructure.
  • Have the ability to multi-task and adapt to changes quickly
  • Have high level problem-solving and excellent communication skills.

LCAT Qualifications :

Five (5) years of experience in programs and contracts of similar scope, type, and complexity is required and a Master’s degree in Computer Science or related discipline from an accredited college or university is required;

OR Eight (8) years of experience in programs and contracts of similar scope, type and complexity and a Bachelor’s degree in Computer Science or related discipline from an accredited college or university is required.

Two (2) years of additional SRE experience on projects with similar software processes may be substituted for a bachelor’s degree.

Minimum three (3) years of experience with administering Docker, programming with C++ and / or Python, and programming on / and administering Linux servers is required.

Akina is a Woman Owned, Service Disabled, Veteran Owned, Small Business, looking for talented and ambitious individuals to join our team.

We offer a generous compensation package that includes 24 days PTO accrued annually and 11 federal holidays. Our 401k is 100% vested on your start date and the company makes a direct contribution worth 10% of your salary.

Akina covers 100% of healthcare costs for employees and 50% toward dependents. We offer educational assistance towards college classes and will cover costs associated with job related training and certifications Akina is committed to excellence and creating innovative and flexible solutions for our clients.

We are a small company with an open ear to our employees' needs in order to attract and retain quality talent that enables our customer's mission.

www.akina-inc.com / careers

30+ days ago
Related jobs
Promoted
Capital One
Baltimore, Maryland

Lead Platform Engineer, Site Reliability Engineering (SRE). Site Reliability Engineering experience. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at 1-800-304-9...

Promoted
Peraton
Fort Meade, Maryland

Eight (8) years with a Bachelor's degree in Computer Science or Mathematics, Information Systems, Engineering, or a similar degree will be considered. Elastic Certified Observability Engineer. Eight (8) years' of experience in software development/engineering, including requirements analysis, softwa...

Promoted
Capital One
Baltimore, Maryland

Senior Software Engineer, Site Reliability Engineering (SRE). As a Site Reliability Engineer, you’ll have the opportunity to be on the forefront of driving a major transformation within Capital One. Site Reliability Engineering experience. As a member of the SREnity Team, you’ll be involved in devel...

Peraton
Laurel, Maryland

Peraton is a next-generation national security company that drives missions of consequence spanning the globe and extending to the farthest reaches of the galaxy.As the world’s leading mission capability integrator and transformative enterprise IT provider, we deliver trusted, highly differentiated ...

Promoted
Capital One
Baltimore, Maryland

As a Capital One Lead Software Engineer, Site Reliability Engineer you’ll have the opportunity to be on the forefront of driving a major transformation within Capital One. Lead Software Engineer, Site Reliability (Bank Tech). New York City (Hybrid On-Site): $201,400 - $229,900 for Lead Software Engi...

00100 LEIDOS, INC.
Columbia, Maryland

Leidos is hiring for a DevOps Engineer / SRE in Columbia, MD. Bachelor’s Degree in Computer Science, Computer Engineering, or similar field and five plus years of relevant software development or testing experience. Candidate must have at least four plus years of prior relevant cloud engineering exp...

Peraton
Laurel, Maryland

Peraton is a next-generation national security company that drives missions of consequence spanning the globe and extending to the farthest reaches of the galaxy.As the world’s leading mission capability integrator and transformative enterprise IT provider, we deliver trusted, highly differentiated ...

Booz Allen Hamilton
Annapolis Junction, Maryland

Engineering to make a system more resilient and efficient frees up time and money to build more capabilities. Whether you come from a background in network engineering, systems administration, or software development—if you have a passion for making systems better, we need you! . What if you could u...

Procession Systems
Columbia, Maryland

Conduct system tests for security, performance, availability, and reliability. ...

Splunk Inc
Maryland, United States

Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splunk organization. As a TechOps Engineer, you will be interfacing with...