Search jobs > Mountain View, CA > Site reliability engineer

Staff Site Reliability Engineer (Mountain View, CA)

SmartThings
Mountain View, California, US
$169K-$253.5K a year
Full-time

Staff Site Reliability Engineer (Mountain View, CA)

Department : Behaviors, Execution and Foundation

Employment Type : Full Time

Location : Mountain View, CA

Reporting To : Angela Tan

Ensure you read the information regarding this opportunity thoroughly before making an application.

Description

We’re SmartThings, one of the leading IoT ecosystems in the world, creating the most effortless way for anyone to create a smart home.

As a wholly owned subsidiary of Samsung, our corporate offices are based in Minneapolis and the Bay Area.

More than 270 million people worldwide use SmartThings to control and manage their connected life. SmartThings delivers simple, powerful experiences across Samsung’s leading portfolio of phones, TVs, and appliances, and we offer the most versatile smart home experience as an open platform with a rich partner ecosystem.

As a founding member of Matter, we are a leader in the industry to help make smart homes more secure, reliable and seamless to use.

Like the smartphone revolution, smart home technology is transforming the way we interact with the world around us. With SmartThings products, we’re reducing global emissions, improving service industries, and creating a safer, smarter planet.

Come be a part of the transformation with us!

SmartThings Culture & Ways of Working

SmartThings’ dynamic culture continuously moves forward with agility and determination, providing an opportunity for impactful contributions across all roles.

Our commitment to diversity, equity, inclusion and belonging is deeply ingrained in our core values, fostering a culture that values, celebrates, and honors the unique perspectives and experiences of every individual.

At SmartThings, we’re creating immersive, interconnected experiences for both our customers and our team members. Our workplace mirrors this ethos, offering a versatile hybrid environment that nurtures personal connections and fosters collaborative efforts.

About The Team

SmartThings is seeking a Staff Site Reliability Engineer to be the technical leader on a newly formed SRE team whose mission is to drive platform reliability and operations improvements across critical areas such as availability, latency, efficiency, capacity, change management, monitoring, and incident response.

Key Responsibilities

  • Reliability of the platform KPI (SLI, SLOs)
  • Direct work using data (metrics)
  • Continuous analysis and highlight areas of reliability deficiency
  • Advocate, influence, and follow up on action items regarding reliability
  • Monitoring and metrics
  • Collaborate with global engineering stakeholders to establish a higher-level platform dashboard that will allow a shared centralized view of key platform metrics.
  • Work closely with service teams to build and maintain an SLO framework
  • Support critical hardware and software releases and product launches
  • Create, monitor, and maintain platform monitors and assist in triage
  • Actively participate and improve the SmartThings Incident Commander Group
  • Active participant in on-call rotation for incidents
  • Best practice development, adoption, and evaluation
  • Review post-mortems and look for areas of optimization patterns we should focus on, and work to correct these by working with teams or building company-wide best practices.
  • Facilitate a community of practice for operations and site reliability concepts to extend the capabilities of service teams through a culture of trust and team empowerment
  • Mentor engineers on Site Reliability Engineering principles, practices, and tools
  • Develop Platform Reliability Operational Health Guidance

Skills Knowledge and Expertise

What You Bring Day One (Required Qualifications)

  • Bachelor’s degree in Computer Science or Electrical / Computer Engineering or similar experience
  • 8 years of software engineering experience
  • 5 years of operational experience in improving Service Reliability, Availability, and Performance.
  • Advanced knowledge of distributed systems and network infrastructure protocols.
  • Demonstrated ability to manage, troubleshoot, and resolve incidents in distributed environments.
  • Experience solving problems.
  • Expertise in analyzing and fixing large-scale distributed systems
  • Experience with Observability tooling (e.g. Sumologic and Datadog)
  • Experience with AWS cloud technologies
  • Programming experience with an object-oriented programming language (eg. Java, Kotlin), and scripting languages (eg. Python).
  • Proficiency in Linux Operating Systems
  • Excellent communication skills, including the ability to build trust and influence others

What Could Set You Apart?

  • Experience working across time zones, geographies, languages, and cultures.
  • Experience working in the IoT Industry
  • Experience leading change initiatives or coaching cloud operations
  • A deep understanding of web technologies and site reliability engineering (SRE).
  • Experience as a technical lead
  • Experience working in a multi-cloud service provider environment

SmartThings Benefits

We offer an attractive compensation package with comprehensive health benefits, including medical, dental, vision, and mental health;

an HSA with employer contribution; life & disability insurance; FSAs for health and dependent care expenses; a competitive 401k with a 5% employer match, and more.

  • All of our employees enjoy unlimited PTO, 12 paid holidays, and a generous parental leave policy (8 weeks fully paid parental leave and 8 more fully paid weeks for childbirth recovery leave).
  • Eligible employees benefit from our education reimbursement program, and all employees enjoy access to learning resources through O’Reilly.
  • Our commitment to diversity, equity, inclusion and belonging is embedded into our culture and our work, and everyone has frequent opportunities to join forums and groups and participate in ongoing projects.

Compensation for this role for a candidate based in Minneapolis is expected to be between $169,015 and $253,523.

J-18808-Ljbffr

2 days ago
Related jobs
Promoted
Host Healthcare
San Jose, California

Our recruiters act not only as your dedicated travel career mentor but also become your number one advocate. Host Healthcare is an award-winning travel healthcare company with an immediate opening for this. Passionate and transparent team members have made Host Healthcare the agency of choice for ne...

Promoted
TravelNurseSource
Mountain View, California

TravelNurseSource is working with Triage Staffing to find a qualified Oncology RN in Mountain View, California, 94035!. Travel nurses can bring fresh perspectives to healthcare facilities, contributing ideas and insights that may lead to improvements in patient care, safety, and overall healthcare q...

Promoted
AlliedTravelCareers
Mountain View, California

AlliedTravelCareers is working with MedPro Healthcare Staffing to find a qualified CT Tech in Mountain View, California, 94035!. MedPro Healthcare Staffing is a Joint Commission-certified, leading provider of temporary and contract staffing services to healthcare facilities throughout the United Sta...

Promoted
Coast Medical Service
Mountain View, California

Coast Medical Service is a nationwide travel nursing & allied healthcare staffing agency dedicated to providing an elite traveler experience for the experienced or first-time traveler. Coast is featured on Blue Pipes' 2023 Best Travel Agencies and named a 2022 Top Rated Healthcare Staffing Firm ...

Promoted
AlliedTravelCareers
Mountain View, California

AlliedTravelCareers is working with AHS MedStat to find a qualified Occupational Therapist (OT) in Mountain View, California, 94040!. We keep our overhead low so that we can pay our travelers what they deserve and that is every penny we can get! For more information please visit our website at . App...

Promoted
Synopsys Inc
Mountain View, California

Scroll down to find an indepth overview of this job, and what is expected of candidates Make an application by clicking on the Apply button. We are seeking a talented and experienced professional to join our team as Staff SRE Engineer. The candidate should have an exceptional background in software ...

Promoted
AlliedTravelCareers
San Jose, California

AlliedTravelCareers is working with Triage Staffing LLC to find a qualified Mammography Tech in Mountain View, California, 94035!. Location:         Mountain View. Travel Radiology: Imaging Mountain View. We staff all four major divisions of acute care – nursing, lab, radiology, and rehab ther...

Promoted
Bayside Solutions
Sunnyvale, California

Site Reliability Engineer, Virtualization. Experience in managing, scaling, and troubleshooting Java applications. Working knowledge of common authentication schemes, certificates, and secure secrets management. SRE, virtualization, Linux, IaaS, OpenStack, CloudStack, Libvirt, QEMU, KVM, Java, Golan...

Promoted
TikTok
San Jose, California

At least 2 years of work experience in SRE of large-scale systems deployment with high reliability and scalability. Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Deliver tools/s...

Promoted
Google Cloud - Minnesota
Sunnyvale, California

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to c...