Search jobs > San Francisco, CA > Senior site reliability

Senior Staff Site Reliability Engineer

WEX Health, Inc.
San Francisco, CA
$156K-$208K a year
Full-time

About the Role

The WEX Site Reliability Engineering (SRE) team is seeking a Senior Staff SRE who is passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance.

The team will be part of the Benefits Reliability organization which supports our internal stakeholders and our Benefits Platform teams.

As part of the Benefits Reliability organization you’ll have the opportunity to solve complex challenges and improve the quality of life of our engineering teams as well as our ability to service our customers.

The ideal candidate will be a technical leader with a proven track record of designing, implementing, and managing complex systems at scale.

They will have a deep understanding of software development, cloud computing, and operational best practices. The Senior Staff SRE will work closely with engineering teams to ensure that our systems are reliable, performant, and secure.

How you’ll make an impact

Technical Leadership : Provide technical guidance and mentorship to other SREs and engineers. Lead the design and implementation of complex systems and solutions.

Drive the adoption of SRE best practices across the organization.

System Design : Architect and implement highly available, scalable, and fault-tolerant systems. Optimize system performance and resource utilization.

Proactively identify and mitigate risks to system reliability.

Incident Response : Lead incident response efforts, driving efficient resolution and post-incident analysis. Develop and implement processes to improve incident response capabilities.

Automation and Tooling : Design and develop automation tools to streamline operational tasks, improve system reliability, and reduce toil.

Utilize monitoring and observability tools to gain deep insights into system behavior.

Collaboration : Work closely with development teams to ensure software design meets operational requirements. Foster a culture of collaboration and knowledge sharing across teams.

Capacity Planning & Performance Optimization : Forecast future capacity needs and implement strategies to ensure systems scale efficiently.

Continuously identify performance bottlenecks and lead efforts to optimize system performance.

Security & Compliance : Champion security best practices and ensure that systems are designed and operated in compliance with industry standards and regulations.

Innovation : Stay current with emerging technologies and industry trends. Evaluate and introduce new tools and techniques to improve SRE practices and system reliability.

Experience you’ll bring

7+ years of hands-on experience as a Site Reliability Engineer or equivalent role

7+ years of development experience with at least one major programming language

Expert-level knowledge of Cloud Computing platforms (AWS and Azure)

Proven ability to lead complex technical projects and initiatives

Strong communication and collaboration skills, with the ability to influence and build consensus

Deep understanding of observability, logging, and monitoring technologies

Experience with a variety of RDBMS and NoSQL data stores

Expertise in containerization technologies such as Docker and Kubernetes

Expertise in infrastructure as code

Experience designing and building RESTful APIs

Extensive hands-on experience with (Datadog, Splunk, or other tooling)

Familiarity with Agile methodologies and practices

Extensive experience in providing and leading critical application support in a 24 / 7 / 365 high-availability environment.

Experience with GitOps

BA / BS degree in Computer Science or related technical field, or equivalent job experience

This Senior Staff SRE role offers a unique opportunity to make a significant impact on the reliability and performance of WEX's critical Benefits systems.

You will play a key role in shaping the future of SRE at WEX and driving innovation across the organization.

The base pay range represents the anticipated low and high end of the pay range for this position. Actual pay rates will vary and will be based on various factors, such as your qualifications, skills, competencies, and proficiency for the role.

Base pay is one component of WEX's total compensation package. Most sales positions are eligible for commission under the terms of an applicable plan.

Non-sales roles are typically eligible for a quarterly or annual bonus based on their role and applicable plan. WEX's comprehensive and market competitive benefits are designed to support your personal and professional well-being.

Benefits include health, dental and vision insurances, retirement savings plan, paid time off, health savings account, flexible spending accounts, life insurance, disability insurance, tuition reimbursement, and more.

For more information, check out the "About Us" section.Pay Range : $156,000.00 - $208,000.00

1 day ago
Related jobs
Promoted
Amino Health
San Francisco, California

Our engineering team is small but mighty, and we are searching for a Senior / Staff Platform Engineer to act as a technical lead for the DevOps and Site Reliability disciplines. Most immediately, you’ll have an opportunity to work directly with the CTO as well as senior Security and Product leads to...

Promoted
ThousandEyes (part of Cisco)
San Francisco, California

We’re looking for talented engineers with a software or operations background, experienced in designing and operating large-scale highly available distributed systems in the cloud. You must be willing to work closely with our application development teams to ensure the reliability, performance and s...

Promoted
Salesforce
San Francisco, California

Provide quality assurance of ITGC controls for Engineering to ensure operational effectiveness of those security controls in Engineering. You have the focus and organization to champion the adoption of sound security and SOX ITGC practices across all of Slack’s business and engineering teams. Act as...

Promoted
Patch Technologies, Inc
San Francisco, California

Work through problems with your team, roll up your sleeves, form an opinion, and advocate for engineering-specific roadmap items. Offer insightful feedback on fellow engineers' designs and code, fostering a culture of continuous improvement and excellence. Enhance the quality and robustness of the c...

Promoted
Crusoe Energy Inc
San Francisco, California

Crusoe Security & Compliance is hiring a Senior/Staff Application Security Engineer to play a critical role in ensuring the security and integrity of our applications and digital infrastructure. Partner with product and engineering teams to develop and integrate security practices into the devel...

Promoted
Zip
San Francisco, California

You will collaborate with product engineering teams to design new features for Ent and TAO that better support their use cases. Additionally, you will enhance database scalability, reliability, and performance to support the company's growth. Improve database scalability, reliability, performance, a...

Promoted
Swish Analytics
San Francisco, California
Remote

The Swish Analytics DevSecOps and Infrastructure team is looking for an experienced Site Reliability Engineer who will support our enterprise infrastructure. We believe that oddsmaking is a challenge rooted in engineering, mathematics, and sports betting expertise; not intuition. Work closely with t...

Infused Solutions
San Francisco, California

Our client is looking for a skilled Senior Site Reliability Engineer with an Microsoft Azure background and a good level of software engineering experience. Senior Site Reliability Engineer. Infused Solutions have partnered with a market leader in the San Francisco area, they are looking for a Senio...

Splunk Inc
San Francisco, California
Remote

Site Reliability Engineers in this role will be engaging with multiple service owners across the platform to teach and implement modern interpretations ofSRE,observability, Chaos Engineering andDevOps. Splunk's Cloud Services group is looking for a Site ReliabilityEngineer to help lead, design and b...

Halcyon Financial Technology
San Francisco, California

We are looking for a Full Time Senior IT Support Engineer to efficiently work in an Onsite work environment. This is an Onsite role based in San Fransisco, CA. Employees who sit in an onsite role must be able to arrive at their client's location within an hour. Provide high quality, executive-level ...