Search jobs > San Francisco, CA > Site reliability engineer

Staff Site Reliability Engineer

Circle
San Francisco, CA, US
$172.5K-$227.5K a year
Full-time

Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data globally, nearly instantly and less expensively than legacy settlement systems.

This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that can help raise global economic prosperity and enhance inclusion.

Our infrastructure including USDC, a blockchain-based dollar helps businesses, institutions and developers harness these breakthroughs and capitalize on this major turning point in the evolution of money and technology.

What you’ll be part of :

Circle is committed to visibility and stability in everything we do. As we grow as an organization, we're expanding into some of the world's strongest jurisdictions.

Speed and efficiency are motivators for our success and our employees live by our company values : Multistakeholder, Mindfulness, Driven by Excellence and High Integrity.

Circlers are consistently evolving in a remote world where strength in numbers fuels team success. We have built a flexible and diverse work environment where new ideas are encouraged and everyone is a stakeholder.

What you’ll be responsible for :

As a Senior Site Reliability Engineer at Circle, you will design, build, and maintain Circle’s infrastructure estate to meet the growing worldwide customer base on public cloud providers across multiple regions.

You will use your experience, knowledge, and skills to ensure Circle’s products and core systems are running consistently, reasonably, and in a performant manner.

This is a unique opportunity to develop your skills, collaborate with cross-functional teams and continuously learn in a dynamic and fast-paced environment.

Join Circle and join a fun, collaborative, and innovative team dedicated to delivering exceptional customer experiences.

What you'll work on :

  • Support multiple development teams with an agile, responsive CI / CD platform to deliver high-quality builds with measurable performance and quality;
  • Build, maintain, improve, scale, and secure cloud infrastructure and resources using IaC tools (Terraform, CloudFormation, Ansible);
  • Automate operational tasks via Go, Python, and serverless solutions (AWS Lambda, Kubernetes Jobs);
  • Design, manage, and monitor Kubernetes clusters for multiple production workloads;
  • Driving forward our blockchain infrastructure by creating and managing blockchain nodes across a wide variety of blockchains that includes Algorand, Ethereum, Hedera, Flow, Solana, Stellar, Tron;
  • Participate in an on-call rotation to mitigate disruption for any production systems and conduct root cause analysis;
  • Plan and test disaster recovery scenarios for a highly available microservices architecture;
  • Collaborate with the Security team to create and maintain security-focused tools and frameworks and exert a top-class security posture;
  • Engaging and mentoring team members and helping grow and scale the team.

Here is our team hierarchy for individual contributors :

Staff Site Reliability Engineer (IV)

Senior Site Reliability Engineer (III)

What you’ll bring to Circle (not all required) Senior Site Reliability Engineer (III)

  • 4+ years in DevOps or SRE roles, with a focus on tooling, automation, and infrastructure on a major public cloud provider;
  • Proficiency with coding and / or scripting with the following languages (Go, Python, Shell);
  • You have at least 3 years of combined experience in building and maintaining CI / CD platforms and supporting agile engineering teams in building microservices;
  • Experience with :
  • Building Docker images and deploying containers in Kubernetes clusters;
  • Any modern CI / CD platform with seemingly complex gates and workflows;
  • Blue-Green, Canary, and A / B Testing deployment strategies;
  • Distributed blockchain systems, running and maintaining blockchain full nodes;
  • Database technologies (PostgreSQL, Redis, OpenSearch);
  • Migrating and transforming large, complex datasets from diverse sources, structures, and formats;
  • Data warehousing tooling and services (Apache Airflow, AWS DMS, Snowflake);
  • Knowledge of networking routing, DNS, load balancing, and edge networking;
  • Knowledge of APM, RUM, monitoring, and telemetry tools;
  • Helm charts and deploying and maintaining Kubernetes clusters;
  • Authoring and maintaining IaC with Terraform and using IaC to deploy resources in AWS, Azure, GCP, or any other public cloud providers;
  • Strong skills around observability, troubleshooting, and performance solutions;
  • Ability and eagerness to deep dive into understanding, debugging, and improving any layer of the tech stack;
  • Exhibit strong communication skills and ability to explain technical concepts to peers and stakeholders.

Staff Site Reliability Engineer (IV)

All the requirements of a Senior Site Reliability Engineer and :

  • 7+ years in DevOps or SRE roles, with a focus on tooling, automation, and infrastructure on a major public cloud provider;
  • Led teams technically on architecture and system design;
  • Deep understanding / experience with :
  • API design and REST principles;
  • Cloud services (AWS, Google Cloud, Microsoft Azure, etc);
  • Containers and Kubernetes;
  • SQL databases and designing schemas;
  • Deep focus on coding standards and code quality a desire to have excellent test coverage.

Additional Information :

This position is eligible for day-one PERM sponsorship for qualified candidates.

Circle is on a mission to create an inclusive financial future, with transparency at our core. We consider a wide variety of elements when crafting our compensation ranges and total compensation packages.

Starting pay is determined by various factors, including but not limited to : relevant experience, skill set, qualifications, and other business and organizational needs.

Please note that compensation ranges may differ for candidates in other locations.

Senior Site Reliability Engineer

Base Pay Range : $147,500 - $195,000

Annual Bonus Target : 12.5%

Staff Site Reliability Engineer

Base Pay Range : $172,500 - $227,500

Annual Bonus Target : 15%

Also Included : Equity & Benefits (including medical, dental, vision and 401(k)). Circle has a discretionary vacation policy.

We also provide 10 days of paid sick leave per year and 11 paid holidays per year in the U.S.

We are an equal opportunity employer and value diversity at Circle. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Additionally, Circle participates in the E-Verify Program in certain locations, as required by law.

J-18808-Ljbffr

1 day ago
Related jobs
Promoted
Crusoe Energy Inc
San Francisco, California

Crusoe Energy is on a mission to unlock value in stranded energy resources through the power of computation.Take a look at what we do! - https://www.We aim to align the long term interests of the climate with the future of global computing infrastructure.As data centers consume an exponentially grow...

Promoted
HashiCorp
San Francisco, California

As a Senior Site Reliability Engineer on the Infrastructure Services team, you will play a pivotal role in designing, building, and maintaining the infrastructure that underpins all HashiCorp cloud products. Have extensive experience in site reliability engineering, cloud infrastructure management, ...

Promoted
Span
San Francisco, California

Span is seeking an experienced and driven electrical engineer to lead the reliability program for Span through all stages of development. In this role, you will rely on your past experience to develop a comprehensive reliability test program for new products and support any reliability issues that a...

Promoted
Cisco Systems, Inc.
San Francisco, California

Senior Site Reliability Engineer, FedRAMP. We’re looking for talented engineers with a software or operations background, experienced in designing and operating large-scale highly available distributed systems in the cloud. You must be willing to work closely with our application development teams t...

Promoted
Arta Finance
San Francisco, California

Arta is on an audacious and incredibly rewarding mission: to pave the way for people everywhere to lead more successful financial lives.We value trust, teamwork, and adaptability.The Infrastructure team within Arta is building the backbone of this mission, from ingesting data and making it accessibl...

Promoted
Tampa Gardens Senior Living
San Francisco, California

In this role, you will join our Site Reliability and Infrastructure Team in deploying, managing, optimizing, and upgrading the systems that run Sight Machine software. Success will take a blend of technical expertise, experience with deployment technology frameworks, customer-centric focus, and a te...

Xero
San Mateo, California

Reliability Enablement (AKA Reliability Rangers) As a member of our Reliability Enablement team at Xero, you’ll help teams deliver a great customer experience through a better understanding of the behaviour and operation of their systems. There will be a lot of variety to your work as a part of reli...

E-Solutions
California, United States

Site Reliability Engineer (SRE). We are seeking a skilled Site Reliability Engineer (SRE) to join our dynamic team. You will be responsible for ensuring the availability and reliability of our SaaS products, which host customer data and require 24x7 uptime. Ensure the reliability, availability, and ...

Splunk Inc
California, United States

Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splunk organization. As a TechOps Engineer, you will be interfacing with...

WEX Inc
San Francisco Bay Area, California
Remote

The WEX Site Reliability Engineering (SRE) team seeks individuals passionate about developing software and solutions for observability, incident response, reliability, performance, operational excellence, and compliance. Site Reliability Engineer or equivalent role. As part of the Platform Reliabili...