Talent.com
VP, Reliability Engineer - OnePay
VP, Reliability Engineer - OnePayZipRecruiter • Bentonville, AR, US
serp_jobs.error_messages.no_longer_accepting
VP, Reliability Engineer - OnePay

VP, Reliability Engineer - OnePay

ZipRecruiter • Bentonville, AR, US
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Overview

Job Description

Job Description :

Role Summary / Purpose :

As the pace and complexity of software development accelerates, maintaining the reliability, performance, and scalability of critical financial systems is more vital than ever. OnePay, as a strategic partner, expects seamless integration across APIs, batch processes, file transmissions, and operational workflows with rigorous

SLAs, blazing speed, and superior customer experience.

The VP, Reliability Engineer for OnePay will lead efforts to embed reliability engineering principles across the full Synchrony technology stack that enables OnePay, ensuring high availability, fault tolerance, and rapid incident response. This leadership role bridges software engineering, system administration, and DevOps to build resilient, self-healing processes and platforms that meet and push boundaries beyond Synchrony & OnePay standards.

Your mission is to drive innovation and continuous improvement in monitoring, alerting, automation, and root cause analysis to reduce downtime, improve system resiliency, and enhance end-user satisfaction through flawless operational performance.

Our Way of Working

We're proud to offer you choice and flexibility. At Synchrony, our way of working allows you to have the option to work from home, near one of our Hubs or come into one of our offices. Occasionally you may be required to commute to our nearest office for in person engagement activities such as business or team meetings, training and culture events.

Essential Responsibilities :

Lead reliability engineeringinitiatives for OnePay's full integration landscape, including APIs, batch processes, file transmissions, and operational workflows, ensuring they meet or exceed SLAs and service reliability objectives (SLOs).

Design and implementscalable, self-healing systemsthat promote fault tolerance and rapid recovery, minimizing customer impact while supporting OnePay's appetite for speed and responsiveness.

Collaborate closely with OnePay & Synchrony development teams, platform engineers, and product stakeholders to embed reliability best practices throughout the software lifecycle.

Establish rigorousmonitoring, alerting, and real-time telemetryfocusing on key KPIs that measure availability, latency, throughput, and error rates aligned with OnePay SLAs.

Own root cause analysis (RCA) and continuous improvement processes, systematically eliminating recurring issues and driving a culture of operational excellence and resilience.

Drive automation efforts to simplify operational complexity, including CI / CD pipeline integration, automated rollbacks, and canary deployments.

Coordinate cross-functional engagements between infrastructure, security, application teams, and Synchrony + OnePay business leaders to ensure unified reliability strategies and transparent communication.

Lead incident response efforts, including on-call leadership, rapid diagnostics, remediation, and stakeholder reporting to meet client expectations for responsiveness.

Influence and coach development teams on reliability design patterns such as graceful degradation, rate limiting, retry mechanisms, and service mesh integrations.

Stay current with fintech industry trends, OnePay-specific regulatory requirements, and emerging technology to anticipate and mitigate risk.

Facilitate & influence best practice sharing across our Reliability Engineering Community of Practice aimed to enhance enterprise & OnePay resiliency

Manage special projects and other duties as assigned.

Qualifications / Requirements

Bachelor's degree in Computer Science, Engineering, or related field, or equivalent experience (minimum 5 years combined experience in software development and systems reliability) OR in Lieu of degree, 10+ years of experience within Software development and Systems reliability.

Minimum 5 years' experience in full-stack development with technologies including Spring Framework, Java, REST APIs, and front-end UI frameworks.

Extensive experience designing, deploying, and troubleshootinglarge-scale distributed systemsand service-oriented architectures, ideally within payment or fintech ecosystems.

Proven track record in systems engineering, reliability engineering, and infrastructure operations.

Experience influencing development teams in planning product implementations to tackle and resolve technical debt

Deep understanding of API gateways, batch processing workflows, file transmission protocols, and operational monitoring best practices.

Strong experience with DevOps, CI / CD tools (Jenkins, Maven, Gradle), container orchestration, and cloud- deployments.

Deep understanding of monitoring, observability and alerting capabilities across a multitude of tools (Splunk, New Relic, Akamai, Grafana, ServiceNow)

Expertise creating OpenAPI / Swagger specifications and integrating with microservices and mobile / web clients.

Exceptional communication skills with demonstrated ability to collaborate across diverse technical teams, product managers, and stakeholders.

Familiarity with financial services compliance and security standards is a plus.

Proactive problem solver with a passion for innovation, automation, and delivering superior customer experiences.

Characteristics

Leadership program experience, such as SYF BLP or GE IT Leadership Program, is but not mandatory.

Deep fintech domain expertise, particularly in consumer payment processing and financing technologies like OnePay.

Agile methodology experience, working in dynamic, fast-paced environments.

Critical thinker with strong analytical, creative problem-solving skills.

Experience building mobile-first solutions and mobile app integrations.

Track record driving culture change toward reliability and operational excellence in large enterprise organizations.

Grade / Level : 12

The salary range for this position is 135,000.00 - 230,000.00 USD Annual and is eligible for an annual bonus based on individual and company performance.

Actual compensation offered within the posted salary range will be based upon work experience, skill level or knowledge.

Salaries are adjusted according to market in CA, NY Metro and Seattle.

Eligibility Requirements :

You must be 18 years or older

You must have a high school diploma or equivalent

You must be willing to take a drug test, submit to a background investigation and submit fingerprints as part of the onboarding process

You must be able to satisfy the requirements of Section 19 of the Federal Deposit Insurance Act.

New hires (Level 4-7) must have 9 months of continuous service with the company before they are eligible to post on other roles. Once this new hire time in position requirement is met, the associate will have a minimum 6 months' time in position before they can post for future non-exempt roles. Employees, level 8 or greater, must have at least 18 months' time in position before they can post. All internal employees must consistently meet performance expectations and have approval from your manager to post (or the approval of your manager and HR if you don't meet the time in position or performance expectations).

Legal authorization to work in the U.S. is required. We will not sponsor individuals for employment visas, now or in the future, for this job opening.All qualified applicants will receive consideration for employment without regard to , , , , , , , , or veteran status.

Our Commitment :

When you join us, you'll be part of an inclusive culture where your individual skills, experience, and voice are not only heard – but valued. Together, we're building a future where we can all belong, connect, and turn ideals into action. More than 50% of our workforce is engaged in our Employee Resource Groups (ERGs), where community and passion intersect to offer a safe space to learn and grow.

This starts when you choose to apply for a role at Synchrony. We ensure all qualified applicants will receive consideration for employment without regard to , , , , , , , , , or veteran status. We're proud to have an award-winning culture for all.

Reasonable Accommodation Notice :

Federal law requires employers to provide reasonable accommodation to qualified individuals with disabilities. Please tell us if you require a reasonable accommodation to apply for a job or to perform your job. Examples of reasonable accommodation include making a change to the application process or work procedures, providing documents in an alternate format, using a sign interpreter, or using specialized equipment.

If you need special accommodations, please call our Career Support Line so that we can discuss your specific situation. We can be reached at 1-866-301-5627. Representatives are available from 8am – 5pm Monday to Friday, Central Standard Time

Job Family Group :

Information Technology

J-18808-Ljbffr

serp_jobs.job_alerts.create_a_job

Reliability Engineer • Bentonville, AR, US