Search jobs > St Louis, MO > Remote > Senior site reliability

FedNow Senior Site Reliability Engineer

Federal Reserve System
St. Louis, MO
Remote
Full-time
Part-time

Company

Federal Reserve Bank of BostonFederal Reserve Financial Services (FRFS) delivers a suite of payments services to financial institutions via FedLine® Solutions, FedNowSM, Fedwire®, National Settlement Service (NSS), FedCash®, FedACH® (Automated Clearing House), and Check Services.

We are currently leading a strategic effort to transform FRFS to a national, enterprise-focused organization. Through our evolved structure, we will meet the needs of the marketplace for new products and services more quickly, seek to provide a more robust and unified customer experience across our financial service offerings, and create new career growth opportunities for FRFS staff.

The Federal Reserve has developed a new interbank 24x7x365 real-time gross settlement (RTGS) service with integrated clearing functionality, called the FedNow Service.

This service enables financial institutions to provide their customers with the ability to send and receive payments any time, any day, and have full access to those funds within seconds.

This position is a unique opportunity to be part of this mission-critical Federal Reserve initiative that is transforming the payments landscape in the United States.

While open to location and remote work, residence near a Federal Reserve facility is preferred

Responsibilities

As a Senior Engineer of the SRE / Production Operations team for FedNow, you will operate the production environment for the program.

You will architect, implement, and leverage solution monitoring and tooling to be used for capacity planning, utilization reporting, and scaling.

The team uses open source and proprietary software to support Engineering, DevOps, and DevSecOps tools, services, and solutions.

CI / CD and IaCPipeline automation design and development.

Resiliency, DR and BCP (including testing)

The SRE / Production Operations team is part of the Technical Operations (TechOps) department and has the overall responsibility for the design, management and execution of operations required to support the ongoing technical and delivery needs of the FedNow Program, as well as the transition to production support and operations.

This team interfaces with internal stakeholders, customers for planning, delivery, and service management.

It owns ongoing ITIL processes, and the implementation and driving of continuous improvement initiatives.

You will work closely with Engineers and Architects of the FedNow program in order tomaintain seamless automation across the entire platform.

Proactively identify suspected gaps in system architecture and design experiments to expose them

The ideal candidate is someone who loves building and maintaining reliable and scalable systems, CI / CD tooling, and automating cloud-based highly available, high performing applications.

Key Skills

Strong communication and collaboration skills

Confluence, Jira / Octane

Experiment analysis and documentation

Technical / functional expertise in tooling for ITIL, Agile, Project Management and SDLC

Extensive knowledge and understanding of working in AWS environments & services

EC2, EBS, RDS, Aurora, S3, Route 53, ELB, IAM, etc.

Hashicorp Terraform, Consul, Vault, and Ansible

Automation experience preferably GitLab

Experience with scripting languages preferably Python for automated processes

Monitoring / measuring of KPIs with focus on RCA and corrective action

Experience supporting infrastructure for large multi-services applications

Experience working with continuous deployment in micro-services architectures

Experience in fault injection / experimentation and system attacks

Familiarity with Fault Injection tooling

i.e. AWS Fault Injection Simulator, Gremlin, ChaosToolkit, Chaos Monkey)

Best practices in chaos engineering process and implementation

Chaos gamedays, business critical KPIs, etc.)

Observability

CloudWatch, Dynatrace, Grafana, Prometheus

Automation mindset to enable consistency and dependability in common actions

Test development and debugging experience

Full Time / Part Time

Full time

Regular / Temporary

Regular

Job Exempt (Yes / No)

Job Category

Work Shift

First (United States of America)

Always verify and apply to jobs on Federal Reserve System Careers () or through verified Federal Reserve Bank social media channels.

30+ days ago
Related jobs
Promoted
Capital One
Maryland Heights, Missouri
Remote

Locations: US Remote, United States of AmericaSr Lead Site Reliability Engineer - Back End, Shopping (Remote-Eligible)Interested in joining a dynamic remote-first engineering team in a fast-paced environment full of greenfield problem-solving? Then Capital One Shopping might be the place for you. Wh...

Federal Reserve System
St. Louis, Missouri
Remote

As a Senior Engineer of the SRE / Production Operations team for FedNow, you will operate the production environment for the program. You will work closely with Engineers and Architects of the FedNow program in order tomaintain seamless automation across the entire platform. Federal Reserve Bank of ...

Innova Solutions
St. Louis, Missouri

Need a Senior Site Reliability Engineer to work on Disaster Recovery (DR) initiative for different applications in the Cloud and onsite platforms. Innova Solutions is immediately hiring for Senior Site Reliability Engineer. As a Senior Site Reliability Engineer, you will:. ...

Federal Reserve System
St. Louis, Missouri

As a Senior Cloud Reliability Engineer in the SRE chapter, you will be accountable for implementing reliability practices using software as means for the cloud foundational product line in the Federal Reserve. Works part of cloud foundational platform squads to demonstrate and champion site reliabil...

MassGenics
St. Louis, Missouri

Need a Senior Site Reliability Engineer to work on Disaster Recovery (DR) initiative for different applications in the Cloud and onsite platforms. Innova Solutions is immediately hiring for Senior Site Reliability Engineer. As a Senior Site Reliability Engineer, you will:. ...

Innova Solutions
St. Louis, Missouri

Need a Senior Site Reliability Engineer to work on Disaster Recovery (DR) initiative for different applications in the Cloud and onsite platforms. Innova Solutions is immediately hiring for Senior Site Reliability Engineer. As a Senior Site Reliability Engineer, you will:. ...

MassGenics
St. Louis, Missouri

Need a Senior Site Reliability Engineer to work on Disaster Recovery (DR) initiative for different applications in the Cloud and onsite platforms. Innova Solutions is immediately hiring for Senior Site Reliability Engineer. As a Senior Site Reliability Engineer, you will:. ...

Cynet Systems
St. Louis, Missouri

Write code, test, and execute or orchestrate pipelines into production based on customer specifications.Act as a subject matter expert (SME) for infrastructure and cloud operations to resolve or escalate as appropriate.Develop, lead, and optimize the peer review process for Infrastructure as Code (I...

Federal Reserve System
St. Louis, Missouri

Interacts with Site Reliability Engineers (SREs), Site Reliability Analysts, and application operational staff to provide application technical support for Cloud based technology solutions, including application monitoring, application tuning, troubleshooting, resolution of complex technical issues,...

Leonardo DRS
Bridgeton, Missouri

Leonardo DRS is seeking a Full Time Senior Reliability Engineer responsible for providing system-level reliability analyses, coordination, planning, guidance, and support to project teams throughout the acquisition lifecycle. Develop reliability requirements for systems, subsystems, and assemblies i...