Search jobs > Austin, TX > Site reliability engineer

Staff Site Reliability Engineer (SRE)

H-E-B, L.P.
Austin, Texas, US
Full-time

H-E-B Digital is seeking new team members (Partners)! Since our inception, we’ve been investing heavily in our customers’ digital experience, reinventing how they find inspiration from food, how they make food decisions, and how they ultimately get food into their homes.

This is an exciting time to join H-E-B Digital, and we’re hiring across the stack : front-end web and mobile, full-stack, and backend engineering.

We’re using the best available technologies to deliver modern, engaging, reliable, and scalable experiences to meet the needs of our growing audience.

Our digital solutions are growing in popularity and adoption like Curbside and Home Delivery so you’ll get the opportunity to define the user experience for millions of customers and hundreds of thousands of Partners.

If you’re someone who enjoys taking on new challenges, working in a rapidly changing environment, learning new skills, and applying it all to solve large and impactful business problems, we want you as part of our team.

HEART FOR PEOPLE : you can organize multiple engineers, negotiate solutions, and provide upward communication.

Applying for this role is straight forward Scroll down and click on Apply to be considered for this position.

HEAD FOR BUSINESS : you consistently demonstrate and uphold the standards of coding, infrastructure, and process.

PASSION FOR RESULTS : you’re capable of high-velocity contributions in multiple technical domains.

As a Site Reliability Engineer , you’ll use your engineering skills to maximize reliability, availability, and efficiency of our systems, and improve workflow, automation, and scalability.

We are looking for :

  • 7+ years of related experience
  • Extensive experience applying software engineering principles in the context of infrastructure, reliability, and scalability
  • Expertise in one or more programming languages suited for SRE work (e.g., Python, Go, Java, Rust) with the ability to mentor others

What is the work?

Design & Development :

  • Design and lead the implementation of fault-tolerant architectures. Employ redundancy, self-healing mechanisms, and graceful degradation techniques to maintain service continuity despite unexpected failures.
  • Influence code architecture, development practices, and deployment approaches to enhance service reliability throughout the organization.
  • Act as a key technical leader in establishing reliability standards and procedures across projects.
  • Partner with senior architects to inform the design of large-scale, highly available distributed systems.
  • Coach and guide teams to independently enhance resilience capabilities.
  • Collaborate with development teams to design service architectures, software platforms and frameworks, perform capacity planning, and conduct launch reviews.

What is your background?

  • M.S. or B.S. in Computer Science or related field (or equivalent experience in large-scale distributed systems).
  • Extensive experience applying software engineering principles in the context of infrastructure, reliability, and scalability.
  • Expertise in one or more programming languages suited for SRE work (e.g., Python, Go, Java, Rust) with the ability to mentor others.

Do you have what it takes to be a fit as an H-E-B SRE?

  • Industry-recognized expertise in building solutions to reliability challenges that include systems and network engineering principles.
  • Strategic, visionary, and collaborative leader : Drive cross-organizational initiatives, influencing technology direction and setting reliability standards across teams.
  • Exceptional analytical and problem-solving skills, focused on systemic improvement : Identify systemic patterns, champion process innovation, and shape data-driven reliability roadmaps.
  • Proven effectiveness in high-stakes, high-growth environments : Excel at independently navigating ambiguity, balancing rapid execution with well-reasoned risk assessment.
  • Ability to translate complex technical concepts for diverse audiences : Effectively communicate across multiple levels of the organization, from engineers to senior leadership, to explain SRE concepts and advocate for strategic investments.
  • Dedication to fostering a culture of innovation and continuous learning : Mentor SRE leaders, promote technical excellence, and disseminate cutting-edge industry practices to continuously raise the bar for reliability at H-E-B.

Can you

  • Function in a fast-paced, retail, office environment?
  • Travel by car or plane with overnight stays?
  • Work extended hours; sit for extended periods; work rotating and on-call schedules?

What are the Perks?

  • A robust Benefits plan with coverage starting Day One.
  • Dental, vision, life, and other insurance plans; flexible spending accounts; short term / long term disability coverage.
  • Telehealth offers 24 / 7 access to board-certified doctors by phone.
  • Partner Guidance allows free counselor visits.
  • Funeral leave, jury duty, and military pay (subject to applicable law).
  • Maternal / paternal leave for new parents, including adoptions.
  • 10% off H-E-B brand products in-store and online.
  • Eligibility to participate in 401(k).
  • Opportunity to become a Partner-Owner after 12 months.

Who We Are

H-E-B is one of the largest, independently owned food retailers in the nation, operating over 400 stores throughout Texas and Mexico, with annual sales generating over $25 billion.

We hire talented people (109,000+ Partners) and give them autonomy to be creative in how they impact the business. We’re a Partner-driven company with a Bold Promise Because People Matter.

We embrace Diversity and Inclusion as core values, and support them with thriving company-wide programs. We’re a truly original Texas-based company that created the Spirit of Giving to help Texas communities every day.

Once eligible, our Partners become Owners in the company. Partner-owned means our most important resources People drive the innovation, growth, and success that make H-E-B The Greatest Retailing Company.

J-18808-Ljbffr

7 hours ago
Related jobs
Promoted
Liquibase
Austin, Texas
Remote

Site Reliability Engineer (SRE). Educate and guide Engineering teams on best practices wrt reliability, resiliency, security, etc. SRE experience supporting AWS-based, cloud-native applications. Develop, implement, and maintain robust monitoring and alerting solutions to ensure the reliability and p...

Promoted
Apple Inc.
Austin, Texas

Site Reliability Engineer – Software CSG. The Hardware Technology Compute and Storage Group is looking for a customer service oriented, self-driven, and motivated SRE to join our operations team with an emphasis on software automation. Support and improve the Hardware Technology engineering environm...

Unreal Gigs
Austin, Texas
Remote

Site Reliability Engineer (SRE). As a Site Reliability Engineer at. You’ll collaborate to implement reliability engineering practices such as service level indicators (SLIs) and service level objectives (SLOs) and enforce best practices for system reliability. Equivalent experience in site reliabili...

VISA
Austin, Texas

Recommend necessary changes to the system to DAP platform engineering by checking system activity and user logs for triaging and troubleshooting. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in t...

Circle
Austin, Texas

Staff Site Reliability Engineer (IV). Staff Site Reliability Engineer (IV). Staff Site Reliability Engineer. As a Senior Site Reliability Engineer at Circle, you will design, build, and maintain Circle's infrastructure estate to meet the growing worldwide customer base on public cloud providers acro...

Jesica.ai
Austin, Texas

Collaborate with software engineers, system administrators, and other SREs to define and implement reliability initiatives. Strong understanding of site reliability engineering principles and best practices. Conduct system reliability assessments, identify potential issues, and implement solutions t...

Galaxy i Technologies, Inc
Austin, Texas

PLM TEAMCENTER DEVELOPER   Roles & Responsibilities: • Specialist in Teamcenter PLM and Active Workspace (AWC) upgrade • Experience in Teamcenter 13.Experience in setting up replica/test/prototype environments based on Production • Experience in Te...

ServiceNow
Austin, Texas
Remote

As a Staff Hardware Reliability Developer you will be responsible for building hardware specific tools that scale systems through automation and evolve ServiceNow’s infrastructure by pushing for changes that improve reliability and ensure optimal performance of the hardware that powers ServiceNow’s ...

LogicMonitor
Austin, Texas

Ready to step into the spotlight and play a pivotal role in enhancing the reliability and growth of the LogicMonitor platform? You'll be at the helm, overseeing a worldwide network of hybrid cloud computing services, ensuring they operate seamlessly. Collaborating closely with developers, you'll spe...

Oracle
Austin, Texas

As a Site Reliability Engineer, you will solve interesting technical challenges by defining, designing, deploying, and solving key Oracle Cloud services, platforms, and infrastructure, always thinking about reliability, scalability, resilience, security, and performance. As an SRE, you will be a gui...