Principal Site Reliability Engineer

Care.com
Dallas, TX, US
$180K-$200K a year
Full-time

Job Description

Job Description

About Care.com

Care.com is a consumer tech company with heart. We're on a mission to solve a human challenge we all face : finding great care for the ones we love.

We're moms and dads and pet parents. We have parents and grandparents, so we understand that everyone, at some point in their lives, could use a helping hand.

Our culture and our products reflect that.

Here, entrepreneurs, self-starters, team players, and big thinkers unite behind a common cause. Here, we're applying data analytics, AI, and the latest technologies to solve universal problems and connect people in new ways.

If you like having autonomy, if you thrive on collaboration and building new things, and if you're all about using your talent for good, Care.

com is the place for you.

Work Environment : Hybrid (In Office - Monday, Wednesday & Friday)

Locations : Salt Lake City Austin Dallas

What You'll Be Working On

As a Principal Site Reliability Engineer (SRE), you will be responsible for ensuring the reliability, scalability, and performance of our critical systems.

You'll lead incident response, manage releases, improve observability, and collaborate across development and operations teams to drive continuous improvements.

Key Responsibilities

  • Release Management : Coordinate releases for applications, ensuring efficient deployment and smooth rollbacks.
  • Incident Response : Lead incident management, facilitate root cause analysis, and continuously update response processes.
  • Monitoring & Alerting : Implement proactive monitoring, create dashboards, and set up real-time alerts for critical services.
  • Hypercare : Ensure system stability during critical post-release periods, monitoring performance and preventing incidents.
  • Collaboration with Dev & QA : Work closely with developers and QA teams to ensure performance benchmarks and observability goals are met.
  • SLI / SLA / SLO Management : Define and measure service levels for key workflows and APIs, ensuring alignment with business expectations.
  • Observability Maturity : Continuously assess and improve observability practices across teams, driving data-driven insights.

What You'll Need to Succeed

  • 6+ years of experience in SRE or DevOps roles with a focus on monoliths and distributed microservices in cloud environments (AWS, GCP).
  • Proficiency in CI / CD tools (Jenkins, Terraform, Ansible).
  • Strong experience with Kubernetes, Docker, and JVM-based monoliths.
  • Expertise in monitoring tools (SignalFX, Splunk, Amplitude) and production incident management.
  • Scripting skills (Python, Bash, or Groovy).
  • Strong understanding of cloud-based systems and containerization.
  • Excellent communication skills and a collaborative approach to working cross-functionally.
  • Experience optimizing large-scale, customer-facing platforms in fast-paced environments.

For a list of our Perks + Benefits, click here!

Care.com supports diverse families and communities and seeks employees who are just as diverse. As an equal opportunity employer, Care.

com recognizes the power of a diverse and inclusive workforce and encourages applications from individuals with varied experiences, perspectives, and backgrounds.

Care.com is committed to providing reasonable accommodations for qualified individuals with disabilities. If you need assistance or accommodation, please reach out to [email protected].

Company Overview :

Available in 21 countries, Care.com is one of the largest providers of online services for finding family care and care jobs, spanning in-home and in-center care solutions.

Since 2007, families have relied on Care.com for an array of care for children, seniors, pets, and the home. Designed to meet the evolving needs of today's families and caregivers, the Company also offers customized corporate benefits packages to support working families, household tax and payroll services, and innovations for caregivers to find and book jobs.

Care.com is an IAC company (NASDAQ : IAC).

Salary Range : $180,000 to $200,000.

The base salary range above represents the anticipated low and high end of the national salary range for this position. Actual salaries may vary and may be above or below the range based on various factors including but not limited to work location, experience, and performance.

The range listed is just one component of Care.com's total compensation package for employees. Other rewards may include annual bonuses and short- and long-term incentives.

In addition, Care.com provides a variety of benefits to employees, including health insurance coverage, life, and disability insurance, a generous 401K employer matching program, paid holidays, and paid time off (PTO).

19 days ago
Related jobs
Promoted
Hispanic Technology Executive Council
Irving, Texas

Certification or formal training in site reliability engineering concepts and practices. Engineering excellence and secure by design are important principles for our CISO organization. You will raise the bar on both our existing products but also the frameworks and capabilities that are engineered w...

Promoted
Canonical - Jobs
Dallas, Texas

As a Senior Site Reliability / Gitops Engineer you will. As an Senior SRE & Gitops engineer you'll be in a unique position to drive operations automation to the next level, both in our own private clouds as well as in the public clouds. Provide assistance and work with globally distributed e...

Promoted
Hispanic Technology Executive Council
Irving, Texas

The Site Reliability Engineer is responsible for leading a variety of engineering activities including the design, acquisition and deployment of hardware, software and network infrastructure in coordination with the Technology team. Proficiency in product engineering and administration, application ...

Promoted
Canonical - Jobs
Dallas, Texas

As a Site Reliability / Gitops Engineer engineer you will. As an SRE & Gitops engineer you'll be in a unique position to drive operations automation to the next level, both in our own private clouds as well as in the public clouds. Provide assistance and work with globally distributed engine...

Promoted
Semiconductor Components Industries, LLC
Richardson, Texas

Want to make an application Make sure your CV is up to date, then read the following job specs carefully before applying.In this highly visible role, the candidate will work with multiple business units, multidisciplinary teams and interface with Tier 1 customers to ensure we exceed customers’ needs...

Promoted
https:/wayup.com/sitemap.xml
Irving, Texas

The Site Reliability Engineer (SRE) will be responsible for ensuring the reliability, scalability, and availability of services across cloud and on-prem platforms, with a focus on OpenShift and Grafana. ...

WELLS FARGO BANK
Irving, Texas

Site Reliability Engineers leverage their experience as software and systems engineers to ensure applications onboarded to SRE are available, have full stack observability, introduce continuous improvement through code and automation, provide operational insight through analytics, continuously test,...

Federal Reserve System
Dallas, Texas

The Federal Reserve Bank of Dallas is seeking a highly motivated and experienced Software Engineer to join our - Site Reliability Engineering (SRE) team. Proven work experience as a Site Reliability Engineer or similar role (preferred). Relevant training and/or certifications as a Site Reliability E...

National Life Group
Addison, Texas

You will lead initiatives to improve overall system reliability, scalability, and maintainability. You will need to communicate effectively with stakeholders, including developers, QA engineers, and project managers. Broad engineering awareness of the following technical domains and expert technical...

Splunk Inc
Texas, United States

Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splunk organization. As a TechOps Engineer, you will be interfacing with...