Search jobs > Cleveland, OH > Remote > Senior site reliability

FedNow Senior Site Reliability Engineer

Federal Reserve System
Cleveland, OH
Remote
Full-time
Part-time

Company

Federal Reserve Bank of BostonFederal Reserve Financial Services (FRFS) delivers a suite of payments services to financial institutions via FedLine® Solutions, FedNowSM, Fedwire®, National Settlement Service (NSS), FedCash®, FedACH® (Automated Clearing House), and Check Services.

We are currently leading a strategic effort to transform FRFS to a national, enterprise-focused organization. Through our evolved structure, we will meet the needs of the marketplace for new products and services more quickly, seek to provide a more robust and unified customer experience across our financial service offerings, and create new career growth opportunities for FRFS staff.

The Federal Reserve has developed a new interbank 24x7x365 real-time gross settlement (RTGS) service with integrated clearing functionality, called the FedNow Service.

This service enables financial institutions to provide their customers with the ability to send and receive payments any time, any day, and have full access to those funds within seconds.

This position is a unique opportunity to be part of this mission-critical Federal Reserve initiative that is transforming the payments landscape in the United States.

While open to location and remote work, residence near a Federal Reserve facility is preferred

Responsibilities

As a Senior Engineer of the SRE / Production Operations team for FedNow, you will operate the production environment for the program.

You will architect, implement, and leverage solution monitoring and tooling to be used for capacity planning, utilization reporting, and scaling.

The team uses open source and proprietary software to support Engineering, DevOps, and DevSecOps tools, services, and solutions.

CI / CD and IaCPipeline automation design and development.

Resiliency, DR and BCP (including testing)

The SRE / Production Operations team is part of the Technical Operations (TechOps) department and has the overall responsibility for the design, management and execution of operations required to support the ongoing technical and delivery needs of the FedNow Program, as well as the transition to production support and operations.

This team interfaces with internal stakeholders, customers for planning, delivery, and service management.

It owns ongoing ITIL processes, and the implementation and driving of continuous improvement initiatives.

You will work closely with Engineers and Architects of the FedNow program in order tomaintain seamless automation across the entire platform.

Proactively identify suspected gaps in system architecture and design experiments to expose them

The ideal candidate is someone who loves building and maintaining reliable and scalable systems, CI / CD tooling, and automating cloud-based highly available, high performing applications.

Key Skills

Strong communication and collaboration skills

Confluence, Jira / Octane

Experiment analysis and documentation

Technical / functional expertise in tooling for ITIL, Agile, Project Management and SDLC

Extensive knowledge and understanding of working in AWS environments & services

EC2, EBS, RDS, Aurora, S3, Route 53, ELB, IAM, etc.

Hashicorp Terraform, Consul, Vault, and Ansible

Automation experience preferably GitLab

Experience with scripting languages preferably Python for automated processes

Monitoring / measuring of KPIs with focus on RCA and corrective action

Experience supporting infrastructure for large multi-services applications

Experience working with continuous deployment in micro-services architectures

Experience in fault injection / experimentation and system attacks

Familiarity with Fault Injection tooling

i.e. AWS Fault Injection Simulator, Gremlin, ChaosToolkit, Chaos Monkey)

Best practices in chaos engineering process and implementation

Chaos gamedays, business critical KPIs, etc.)

Observability

CloudWatch, Dynatrace, Grafana, Prometheus

Automation mindset to enable consistency and dependability in common actions

Test development and debugging experience

Full Time / Part Time

Full time

Regular / Temporary

Regular

Job Exempt (Yes / No)

Job Category

Work Shift

First (United States of America)

Always verify and apply to jobs on Federal Reserve System Careers () or through verified Federal Reserve Bank social media channels.

30+ days ago
Related jobs
Promoted
Federal Reserve Bank of Cleveland
Cleveland, Ohio

As a site reliability engineer or senior you will provide technical knowledge to stakeholders and end users ensuring continuous normal operations. Site Reliability Engineer Senior. Provides rotational 24x7 on-call support as necessary by the business, with minimal assistance from Senior staff. Consu...

Federal Reserve System
Cleveland, Ohio
Remote

As a site reliability engineer or senior you will provide technical knowledge to stakeholders and end users ensuring continuous normal operations. Site Reliability Engineer Senior. Provides rotational 24x7 on-call support as necessary by the business, with minimal assistance from Senior staff. Consu...

Promoted
Federal Reserve Bank of Cleveland
Cleveland, Ohio

As a Site Reliability Engineer - Cloud Operations you will serve as a lead technical operations expert responsible for the design, implementation, and support of cloud based information technology solutions in current and future state. Federal Reserve Financial Services (FRFS) delivers a suite of pa...

Huntington National Bank
Ohio

The Google Cloud Platform (GCP) Site Reliability Engineer (SRE) Manager is responsible for supporting the GCP framework and consumers of the platform. The qualified candidate will collaborate with the CDO, Application, Incident, Security, and Change Management teams to manage the ITIL process, reduc...

OverDrive
Cleveland, Ohio

The Site Reliability Engineer's (SRE) responsibilities include availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning for existing and future services. Work on small projects and individual tasks with regular guidance from more senior...

The Lubrizol Corporation
Wickliffe, Ohio

The Corporate Senior Reliability engineer is recognized as a subject matter expert (SME) in Reliability focusing on asset management strategies, which include but are not limited to FMEA, RCM, RCAs, Criticality Assessment, PM/PM Optimization, and predictive/preventative inspection technology. Join O...

GEICO
Cleveland, Ohio
Remote

Our Staff Engineer works with our Sr Staff Engineer and Sr. GEICO is seeking an experienced Staff Engineer with a passion for building high-performance, low maintenance, zero-downtime platforms, and applications. You will help drive our insurance business transformation as we transition from a tradi...

Fidelity Investments
Westlake, Ohio

You will execute plans for technical standardization and process refinement within the engineering organization, especially for Site Reliability Engineers. Principal, Site Reliability EngineerJob Description:. Our Site Reliability Engineering group within Enterprise Infrastructurebines Operations Ex...

Federal Reserve System
Cleveland, Ohio

Serves as a Senior Software Engineer: to design, develop and implement new complex solutions in accordance with FedNow and Fed Standards. Federal Reserve Bank of BostonFederal Reserve Financial Services (FRFS) delivers a suite of payments services to financial institutions via FedLine® Solutions, Fe...

Medical Mutual
Cleveland, Ohio

Job Description - Site Reliability Engineer I-V (2400118). Site Reliability Engineer I-V -(2400118). Ensures the reliability and stability of assigned platforms, systems, and applications. Ensures reliability and stability of assigned platforms and systems. ...