Search jobs > San Francisco, CA > Site reliability engineer

Sr. Site Reliability Engineer

Pinterest
San Francisco, CA, United States
$125.6K-$258.5K a year
Full-time

About Pinterest :

Millions of people across the world come to Pinterest to find new ideas every day. It's where they get inspiration, dream about new possibilities and plan for what matters most.

Our mission is to help those people find their inspiration and create a life they love.In your role, you'll be challenged to take on work that upholds this mission and pushes Pinterest forward.

You'll grow as a person and leader in your field, all the while helpingPinnersmake their lives better in the positive corner of the internet.

Creating a life you love also means finding a career that celebrates the unique perspectives and experiences that you bring.

As you read through the expectations of the position, consider how your skills and experiences may complement the responsibilities of the role.

We encourage you to think through your relevant and transferable skills from prior experiences.

Our new progressive work model is called PinFlex, a term that's uniquely Pinterest to describe our flexible approach to living and working.

Visit our PinFlex landing page to learn more.

The IT Site Reliability Engineering (SRE) team at Pinterest plays a crucial role in ensuring the reliability, scalability, and performance of our internal IT systems and infrastructure.

We work behind the scenes to keep Pinterest's business operations running smoothly, applying software engineering principles to IT operations challenges.

Our team is responsible for :

  • Designing and maintaining robust, scalable IT infrastructure
  • Automating IT processes to improve efficiency and reduce manual toil
  • Implementing monitoring and alerting systems for critical IT services
  • Ensuring high availability and performance of internal tools and applications
  • Collaborating with other IT and EPD teams to improve system reliability and incident response
  • Driving continuous improvement in IT practices through data-driven decision making

What you'll do :

  • Lead critical projects to improve the scalability and reliability of our IT infrastructure
  • Design and implement automation solutions to streamline IT operations
  • Develop and maintain robust monitoring and alerting systems
  • Collaborate with cross-functional teams to solve complex technical challenges
  • Mentor junior team members and share your expertise through documentation and presentations
  • Participate in on-call rotations to ensure 24 / 7 reliability of our systems
  • Conduct post-incident reviews and implementing preventative measures
  • Continuously evaluate and implement new technologies and best practices to enhance our IT operations

What we're looking for :

  • Strong software engineering skills with a focus on production-ready code
  • Deep understanding of IT infrastructure, cloud technologies, and system architecture
  • Proven experience in implementing SRE principles and best practices
  • Excellent problem-solving abilities, especially in complex and ambiguous situations
  • Track record of leading and delivering high-impact projects
  • Strong communication skills and ability to collaborate effectively across teams
  • Experience with monitoring, alerting, and observability tools
  • Passion for automation and continuous improvement
  • Ability to mentor junior team members and contribute to a positive team culture
  • Adaptability and eagerness to learn in a fast-paced environment

We value candidates who demonstrate a proactive approach to identifying and solving problems, a commitment to reliability, and a drive to elevate the entire team's performance.

If you're passionate about applying SRE principles to create robust, scalable IT systems, we want to hear from you!

Relocation Statement :

This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.

LI-REMOTE

LI-DM57

At Pinterest we believe the workplace should be equitable, inclusive, and inspiring for every employee. In an effort to provide greater transparency, we are sharing the base salary range for this position.

The position is also eligible for equity. Final salary is based on a number of factors including location, travel, relevant prior experience, or particular skills and expertise.

Information regarding the culture at Pinterest and benefits available for this position can be found here .

US based applicants only

$125,630 $258,470 USD

Our Commitment to Diversity :

Pinterest is an equal opportunity employer and makes employment decisions on the basis of merit. We want to have the best qualified people in every job.

All qualified applicants will receive consideration for employment without regard to race, color, ancestry, national origin, religion or religious creed, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, age, marital status, status as a protected veteran, physical or mental disability, medical condition, genetic information or characteristics (or those of a family member) or any other consideration made unlawful by applicable federal, state or local laws.

We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you require an accommodation during the job application process, please notify accessibility@pinterest.com for support.

2 days ago
Related jobs
Promoted
Cisco Systems, Inc.
San Francisco, California

The FedRAMP SRE team is focused on our Federal region's platform. We're looking for talented engineers with a software or operations background, experienced in designing and operating large-scale highly available distributed systems in the cloud. You must be willing to work closely with our applicat...

Promoted
Storm2
CA, United States

Senior Site Reliability Engineer. They are on the lookout for a highly skilled Senior Site Reliability Engineer to help enhance their secure and seamless financial solutions. Senior Site Reliability Engineer/similar role, ideally using tech stacks like C#, Java, Scala, Go, etc. Establish and impleme...

Varo Money, Inc.
San Francisco, California

Site Reliability, DevOps, or Software Engineer with proficiency in one or more high-level languages (such as Python, GoLang, Ruby, Java, or JavaScript) required. Varo’s SRE team is well established, designing, building, and running large-scale, distributed, fault-tolerant systems that power most of ...

WEX Inc
San Francisco Bay Area, California
Remote

The WEX Site Reliability Engineering (SRE) team is looking for individuals passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. Site Reliability Engineer or equivalent role. As part of the...

Cisco Systems, Inc.
San Francisco, California

Principal Site Reliability Engineer, Datastores (ThousandEyes). As a Principal Site Reliability you will focus on innovating and providing strong technical vision as well as work with the team to build reliable, scalable and highly available datastores on a constantly growing multi-region scale plat...

Cisco
San Francisco, California

As a Principal Site Reliability you will focus on innovating and providing strong technical vision as well as work with the team to build reliable, scalable and highly available datastores on a constantly growing multi-region scale platform. We’re looking for a reliability-focused engineering leader...

Okta, Inc.
San Francisco, California

Senior Site Reliability Engineer to join a team focused on designing and developing Security solutions to harden our cloud infrastructure. Okta’s Workforce Identity Cloud Security Engineering group. You will act as a liaison between the Security org and the Engineering org to build technical leverag...

Kofi Group
Berkeley, California

To Apply for this Job Click Here.To Apply for this Job Click Here....

Zetachain
San Francisco, California
Remote

Site Reliability Engineer to join our team and run critical infrastructure for our blockchain and web applications. DevOps Engineer/SRE Transitioning to Blockchain. An experienced DevOps Engineer or SRE looking to pivot into the blockchain sector. Ensure all processes meet our security, performance,...

Together AI
San Francisco, California

As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, and mature automation...