Talent.com
FedNow Principal Site Reliability Engineer

FedNow Principal Site Reliability Engineer

Federal ReserveSan Francisco, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
  • serp_jobs.job_card.part_time
job_description.job_card.job_description

Overview

Federal Reserve Bank of Boston - Federal Reserve Financial Services (FRFS) delivers a suite of payments services to financial institutions via FedLine Solutions, FedNowS M, Fedwire, National Settlement Service (NSS), FedCash, FedACH (Automated Clearing House), and Check Services. We are currently leading a strategic effort to transform FRFS to a national, enterprise-focused organization. Through our evolved structure, we will meet the needs of the marketplace for new products and services more quickly, seek to provide a more robust and unified customer experience across our financial service offerings, and create new career growth opportunities for FRFS staff.

The Federal Reserve has developed a new interbank 24x7x365 real-time gross settlement (RTGS) service with integrated clearing functionality, called the FedNow Service. This service enables financial institutions to provide their customers with the ability to send and receive payments any time, any day, and have full access to those funds within seconds. This position is a unique opportunity to be part of this mission-critical Federal Reserve initiative that is transforming the payments landscape in the United States.

The position will be primarily on-site with residency commutable to one of our offices required.

Responsibilities

  • As a Principal Engineer of the SRE / Production Operations team for FedNow, you will operate the production environment for the program.
  • You will architect, implement, and leverage solution monitoring and tooling to be used for capacity planning, utilization reporting, and scaling.
  • The team uses open source and proprietary software to support Engineering, DevOps, and DevSecOps tools, services, and solutions.
  • CI / CD and IaC Pipeline automation design and development.
  • Resiliency, DR and BCP (including testing).
  • The SRE / Production Operations team is part of the Technical Operations (TechOps) department and has the overall responsibility for the design, management and execution of operations required to support the ongoing technical and delivery needs of the FedNow Program, as well as the transition to production support and operations.
  • This team interfaces with internal stakeholders, customers for planning, delivery, and service management.
  • It owns ongoing ITIL processes, and the implementation and driving of continuous improvement initiatives.
  • You will work closely with Engineers and Architects of the FedNow program in order to maintain seamless automation across the entire platform.
  • Proactively identify suspected gaps in system architecture and design experiments to expose them.
  • The ideal candidate is someone who loves building and maintaining reliable and scalable systems, CI / CD tooling, and automating cloud-based highly available, high performing applications.

Key Skills

  • Strong communication and collaboration skills
  • Extensive knowledge and understanding of working in AWS environments & services
  • EC2, EBS, EKS, RDS, Aurora, S3, Route 53, ELB, IAM, etc.
  • Hashicorp Terraform, Consul, Vault, and Ansible
  • Automation experience preferably using GitLab
  • Experience with scripting languages preferably Python for automated processes
  • Experience working in Linux environment and shell scripting
  • Experience supporting infrastructure for large multi-services applications
  • Experience working with continuous deployment in micro-services architectures
  • Experience working with Docker, Containers, ECR and EKS.
  • Observability - CloudWatch, OpenSearch, Dynatrace, Grafana, Prometheus
  • Familiarity with Fault Injection tooling (i.e. AWS Fault Injection Simulator, Gremlin, ChaosToolkit, Chaos Monkey)
  • Automation mindset to enable consistency and dependability in common actions
  • The Federal Reserve Bank of Boston is committed to provide equal employment opportunities to all persons without regard to race, color, religion, national origin, sex, sexual orientation, gender identity, age, genetic information, disability, or military service.

    All employees assigned to this position will be subject to FBI fingerprint / criminal background and Patriot Act / Office of Foreign Assets Control (OFAC) watch list checks at least once every five years.

    For this job, any offer of employment is contingent upon successfully passing a two-phase security screening. The first phase consists of the satisfactory completion of a physical examination (including a drug screening), reference checks, and a security investigation consisting of credit and criminal history checks.

    The second phase, which might not be complete until after you begin working at the Reserve Bank, is an additional risk-based security screening determined by the risk rating of the position. Depending upon the sensitivity of the position, this phase may include, and is not limited to, work and residency eligibility verification, and personal interviews with the candidate, references, and prior employers.

    All applicants must have resided in the United States for at least three (3) years.

    Full Time / Part Time Full time Regular / Temporary Regular Job Exempt (Yes / No) Yes Job Category Information Technology Family Group Work Shift First (United States of America)

    The Federal Reserve Banks are committed to equal employment opportunity for employees and job applicants in compliance with applicable law and to an environment where employees are valued for their differences.

    Always verify and apply to jobs on Federal Reserve System Careers or through verified Federal Reserve Bank social media channels.

    Privacy Notice

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Site Reliability Engineer • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Site Reliability Engineer - SRE at Descope Los Altos, CA

    Site Reliability Engineer - SRE at Descope Los Altos, CA

    Itlearn360Los Altos, CA, United States
    serp_jobs.job_card.full_time
    Site Reliability Engineer - SRE job at Descope.Descope R&D group is a skilled team of developers with a unique DNA of creativity,flexibility,anopen mindset. We are looking for a passionate SRE to jo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VirtualVocationsSan Francisco, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Site Reliability Engineer.Key Responsibilities Ensure near-zero downtime through monitoring, alerting, self-healing automation, and continuous improvement Create autom...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    DevOps Engineer / Site Reliability Engineer

    DevOps Engineer / Site Reliability Engineer

    HyperFiSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    We're building the kind of platform we always wanted to use : fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connec...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Site Reliability Engineer I

    Site Reliability Engineer I

    prosper.comSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantumPalo Alto, CA, United States
    serp_jobs.job_card.full_time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    VirtualVocationsConcord, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Lead Site Reliability Engineer (M365).Key Responsibilities Lead the team in identifying system and service issues, and manage the deployment of new versions Oversee pr...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Site Reliability Engineer, Storage

    Senior Site Reliability Engineer, Storage

    Epoch BiodesignSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Crusoe Energy is on a mission to unlock value in stranded energy resources through the power of computation.Take a look at what we do! - https : / / www. We aim to align the long term interests of the c...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Senior Site Reliability Engineer (SRE) (Mountain View)

    Senior Site Reliability Engineer (SRE) (Mountain View)

    ACL DigitalMountain View, CA, United States
    serp_jobs.job_card.full_time
    Senior Site Reliability Engineer (SRE).Responsibilities : Design, develop, and maintain automation frameworks for performance testing and monitoring of QuickBooks infrastructure.Ensure the scalabili...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Senior Software Engineer - Site Reliability

    Senior Software Engineer - Site Reliability

    Ironclad Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Every dollar earned, relationship formed, and advantage gained comes down to the contract that makes it real.But getting a contract done is more complicated than it should be.And when contract data...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Software Engineer, Infrastructure Reliability

    Software Engineer, Infrastructure Reliability

    OpenAISan Francisco, CA, United States
    serp_jobs.job_card.full_time
    We’re hiring Software Engineers to join our Applied Infrastructure organization, and more specifically for our Database Systems and Online Storage teams. These teams operate with a high degree of au...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    VirtualVocationsSanta Clara, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Principal Site Reliability Engineer.Key Responsibilities Define and drive the SRE strategy, aligning reliability practices with business and technology goals Lead inci...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    FortinetSunnyvale, California, United States
    serp_jobs.job_card.full_time
    At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Site Reliability Engineer (SRE) - grok.com & API

    Site Reliability Engineer (SRE) - grok.com & API

    Pantera CapitalPalo Alto, CA, United States
    serp_jobs.job_card.full_time
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Software Engineer, Site Reliability Engineer (SRE)

    Senior Software Engineer, Site Reliability Engineer (SRE)

    harvey.aiSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Harvey, we’re transforming how legal and professional services operate — not incrementally, but end-to-end.By combining frontier agentic AI, an enterprise-grade platform, and deep domain experti...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WritemedSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Would you like to join one of the fastest-growing organizations with a goal of using the latest AI, GenAI, LLM, Cloud, and Digital Technologies to advance drug development and improve patient care ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Air AppsSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Air Apps, we believe in thinking bigger—and moving faster.We’re a family-founded company on a mission to create the world’s first AI-powered Personal & Entrepreneurial Resource Planner (PRP), an...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Founding Site Reliability Engineer

    Founding Site Reliability Engineer

    Relevance AISan Francisco, CA, United States
    serp_jobs.job_card.full_time
    San Francisco, USA (Hybrid 3 days / week).At Relevance AI, our mission is to empower anyone to delegate work to the AI workforce. We’re building a new category of AI automation, enabling teams to crea...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    BasetenSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Site Reliability Engineer (SRE).Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed.By uniting a...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30