Sr. Systems Development Engineer (DevOps), AWS Managed Operations (MO)

Amazon
Herndon, VA, United States
Full-time

DescriptionDo you love decomposing problems to develop products that impact millions of people around the world? Would you enjoy identifying, defining, and building software solutions that revolutionize how businesses operate?

AWS Utility Computing (UC) provides product innovations from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry.

As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS.

Within AWS UC, Amazon Dedicated Cloud (ADC) roles engage with AWS customers who require specialized security solutions for their cloud services.

Would you enjoy diving deep into operating and improving some of the largest software systems humanity has ever built? Do the challenges that come of driving technical, business, and cultural change to improve the reliability, performance, and efficiency excite you?

The AWS Managed Operations (MO) organization was founded in April 2023, with the objective to reduce operational load and toil through long-term engineering projects.

Managed Operations (MO) is building the best-in-class engineering and operations team that will own the day-to-day operations for AWS Regions;

improving the availability, reliability, latency, performance and efficiency to operate AWS regions.Amazon is looking for highly motivated Senior Systems Development Engineers who can balance the day-to-day operations of AWS’ software systems with long-term software engineering to reduce operational toil.

We need engineers who enjoy constantly learning and diving deep into the wide range of systems and technologies that make up one of the world’s largest cloud providers.

10012Key job responsibilitiesYou’ll roughly spend 50% of your time operating production systems and 50% making long-term improvements to the reliability, availability, and performance of those software systems.

An example week could look like : Monday morning you root caused why some deployments that recently failed, and in the afternoon, you made fixes for those bugs.

Tuesday you realized there’s actually a common thread to those bug fixes yesterday, and designed a solution to that class of problem, seeking feedback from your team.

On Wednesday you investigated a Service Level Objective (SLO) that recently became less than useful. You dove deep, talked with the partner team, and found out the thresholds no-longer makes sense, so you updated their infrastructure as code (IaC) to fix it.

Then on Thursday and Friday you were developing software with your team on a system you designed which safely replaces the fleets in your team’s care with a more optimal hardware type, increasing the performance whilst decreasing its carbon emissions.

A day in the lifeYou’ll roughly spend 50% of your time operating production systems and 50% making long-term improvements to the reliability, availability, and performance of those software systems.

Over the course of a week, this could look like; Monday morning you root caused deployments that recently failed, and in the afternoon, you made fixes for those bugs.

Tuesday and Wednesday you executed a highly sensitive time critical change to production. Thursday and Friday you were developing software with your team to remove humans from the loop on problems like you worked on over the previous two days, driving a common source of error out of the system and improving its reliability.

About the teamDiverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply.

If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

About AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences.

Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.

Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Work / Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture.

When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.Basic Qualifications6+ years of deploying and operating in a Linux / Unix environment experienceExperience with Linux / UnixExperience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, RustExperience leading the design, automation, deployment, and support of large-scale infrastructureExperience with CI / CD pipelines build processesPreferred Qualifications10+ years of deploying and operating in a Linux / Unix environment experience3+ years of development / programming / scripting language (Python / Java / Bash / Perl) experienceExperience taking a leading role in building complex software or computing infrastructure that has been successfully delivered to customersProficiency in one or more scripting languages (Bash, Python, Ruby, Perl)Experience developing distributed service applications & developing user experiences scenariosUnderstanding of the AWS environment, including VPC, EC2, EBS, S3, SQS, Cloud Formation and Lambda.

Proven ability to troubleshoot and identify the root cause of issues.A history of dealing well with ambiguity, prioritizing needs, and delivering measurable results in a dynamic environment.

Experience with maintaining distributed systems and web servicesAutomation, testing or monitoring framework developmentExperience in a 24x7 production environment, esp.

one based on LinuxAmazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

For individuals with disabilities who would like to request an accommodation, please visit

8 days ago
Related jobs
Promoted
Northrop Grumman
Dulles, Virginia

Experience can be considered in lieu of degree * Demonstrated software development experience * Python application development, C++, Bash Scripting experience * Demonstrated experience in technical problem solving, to include decomposition, root cause analysis, solution development, implementation, ...

Promoted
Illuminate Mission Solutions
Sterling, Virginia

The Information Systems Security Officer (ISSO) manages all aspects of an organization's information security system, for classified and unclassified systems, including researching, testing, training and implementing programs designed to safeguard sensitive information from any possible breaches. As...

Procession Systems
Herndon, Virginia

Work closely with development, operations, and security teams to ensure best practices are followed and to resolve any issues related to container security and deployment. Automate deployment processes and ensure seamless integration with development workflows. Implement and manage security practice...

The Aerospace Corporation
Chantilly, Virginia

The range of work includes cubesat payload development, signal collection prototypes, antenna modeling and measurements in our anechoic chamber, software defined radio (SDR) communications and signal collections systems, and RF system in support of prototyping and customer developments. Catalog of w...

Amazon Web Services, Inc.
Catharpin, Virginia

We are looking for an innovative software development engineer with an interest in information security, identity, certificates, and public key infrastructure (PKI), to build scalable, operationally excellent systems. Cloud computing is disrupting, and we are seeking talented, entrepreneurial-minded...

Comcast Corporation
Reston, Virginia

Integrate microservices architecture with data engineering pipelines to enhance modularity, scalability, and the robustness of the overall system. Mentor junior engineers, sharing best practices in cloud-based data engineering and pipeline automation. Be an active part of the Net Promoter System - a...

Capital One
McLean, Virginia

Build a high performing operations team, recruiting world class SREs, production engineers, data engineers, groom, and retain talent on team. New York City (Hybrid On-Site): $321,500 - $366,900 for Sr Distinguished EngineerSan Francisco and San Jose, California (Hybrid On-Site): $340,500 - $388,700 ...

MassGenics
Tysons, Virginia

Systems Engineer - DevOps, you will be:. Systems Engineer for a direct hire role out of Tysons, VA. Working knowledge and administration experience with application lifecycle management systems, such as Atlassian Jira (and related components), Azure DevOps, GitLab, etc. Microsoft Certified Professio...

Max Populi
Herndon, Virginia

Performs Systems Engineering activities including concept of operations formulation, requirements definition, analysis and engineering, system architecting, system analysis and design, interface and data architectures, validation and verification, systems integration, system & op. For example, b...

Technology Recruiting Solutions
Herndon, Virginia

Performs Systems Engineering activities including concept of operations formulation, requirements definition, analysis and engineering, system architecting, system analysis and design, interface and data architectures, validation and verification, systems integration, system & op. For example, b...