Talent.com
Founding Site Reliability Engineer (Remote - US)
Founding Site Reliability Engineer (Remote - US)Jobgether • San Francisco, CA, United States
Founding Site Reliability Engineer (Remote - US)

Founding Site Reliability Engineer (Remote - US)

Jobgether • San Francisco, CA, United States
job_description.job_card.variable_hours_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
  • serp_jobs.filters.remote
job_description.job_card.job_description

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Founding Site Reliability Engineer in the United States .

This is a unique opportunity to join a rapidly growing AI company as the first SRE hire in the San Francisco office. In this role, you will define and scale the Site Reliability Engineering discipline, ensuring the platform is reliable, secure, and performant at enterprise scale. You will work closely with engineering leads, product teams, and company founders to build infrastructure, establish best practices, and drive the organization’s reliability culture. The role involves hands‑on system design, automation, and observability work, while providing leadership and strategic input to shape long‑term operational excellence. Ideal candidates are technically strong, highly collaborative, and motivated by building world‑class systems from the ground up.

Accountabilities

  • Establish and scale the SRE discipline , including best practices, tooling, and culture.
  • Ensure 99.9% uptime of production systems and maintain global platform reliability.
  • Architect, automate, and manage AWS infrastructure using Terraform, CI / CD pipelines, and Infrastructure as Code.
  • Design and implement observability systems across microservices, APIs, and vector workloads, including metrics, tracing, and logging.
  • Lead incident management , reducing MTTR through runbooks, alerts, and postmortems.
  • Collaborate with engineering teams to embed reliability principles into the software development lifecycle.
  • Influence organizational strategy and culture as a founding voice in the engineering team.

Qualifications

  • 5+ years of experience in SRE, DevOps, or infrastructure roles, ideally in enterprise SaaS environments.
  • Expertise in AWS services (EC2, ECS / EKS, Lambda, RDS, VPC, IAM).
  • Proven experience with Infrastructure as Code (Terraform, Kubernetes / EKS, CDK, or CloudFormation).
  • Hands‑on experience with observability and monitoring stacks (CloudWatch, Grafana, Prometheus, Datadog).
  • Experience in incident management, on‑call responsibilities, and postmortem‑driven reliability improvements.
  • Bonus : exposure to AI / ML platforms, data‑heavy systems, or multi‑agent workloads.
  • Strong problem‑solving, communication, and collaboration skills.
  • Benefits

  • Competitive salary and equity options.
  • Health, dental, and vision insurance, including dependents coverage.
  • Paid time off and holidays, with parental leave benefits.
  • 401(k) plan and other financial perks.
  • Opportunity to shape company culture and systems at a high‑growth AI startup.
  • Thank you for your interest!

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Site Reliability Engineer • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    Customer Reliability Engineer

    Customer Reliability Engineer

    VirtualVocations • Hayward, California, United States
    serp_jobs.job_card.permanent
    A company is looking for a Customer Reliability Engineer to ensure the stability and performance of solutions while providing technical escalation support for customers.Key Responsibilities Serve...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConductorOne • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Shape the future of identity with the highest-caliber team.If you’re amazing at what you do and want to solve big challenges in identity and security, come on board. Identity is how companies are be...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    VirtualVocations • Santa Clara, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Site Reliability Engineer.Key Responsibilities Design, develop, and implement software to enhance system availability, scalability, latency, and efficiency Lead...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    VirtualVocations • Santa Clara, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Site Reliability Engineer II- Process Automation.Key Responsibilities Optimize and automate incident and change management processes to enhance system efficiency and re...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Fortinet • Santa Clara, CA, United States
    serp_jobs.job_card.full_time
    At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Latent • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Latent is building the intelligence infrastructure for American healthcare.Our products are already helping hospitals and clinics dramatically increase workflow output, speed up patient access to m...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Bits to Atoms • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Site Reliability Engineer (SRE).You’ll work at the intersection of infrastructure, AI / ML systems, and mission-critical physical operations. You’ll collaborate directly with engineering, AI, and oper...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantum • Palo Alto, CA, United States
    serp_jobs.job_card.full_time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer Lead

    Site Reliability Engineer Lead

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Site Reliability Engineer, Team Lead.Key Responsibilities Ensure 24x7 availability of production application systems and drive operational efficiency initiatives Ident...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Site Reliability Engineer

    Site Reliability Engineer

    Fortinet • Sunnyvale, CA, United States
    serp_jobs.job_card.full_time
    At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Reliability Engineer Team Lead

    Site Reliability Engineer Team Lead

    VirtualVocations • Hayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Site Reliability Engineer, Team Lead.Key Responsibilities Ensure 24x7 availability of production application systems Drive initiatives to improve operational efficienc...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Founding Site Reliability Engineer

    Founding Site Reliability Engineer

    Reducto • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Nearly 80% of enterprise data is in unstructured formats like PDFs.PDFs are the status quo for enterprise knowledge in nearly every industry. Reducto helps extract data from complex documents, enabl...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WorkOS • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    About WorkOS 🚀 WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We’re a fully distributed team with ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer - Openstack

    Site Reliability Engineer - Openstack

    Fortinet • Sunnyvale, CA, United States
    serp_jobs.job_card.full_time
    Fortinet is recruiting a Site Reliability Engineer- OPENSTACK to join our FortiStack team.This team is responsible for the management, operation and continued development of our Openstack-based pri...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Primer • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Primer helps B2B products break out of the B2C-centric marketing box.Our platform turns consumer ad channels, data streams, and emerging AI workflows into measurable growth engines for go-to-market...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Writemed • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Would you like to join one of the fastest-growing organizations with a goal of using the latest AI, GenAI, LLM, Cloud, and Digital Technologies to advance drug development and improve patient care ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VirtualVocations • Fremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Site Reliability Engineer to provide engineering and operational support for cloud and application services in Oracle Cloud Infrastructure (OCI).Key Responsibilities De...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Signify Technology • Palo Alto, CA, United States
    serp_jobs.job_card.full_time
    Competitive, based on experience.We are a technology startup advancing healthcare with a safety-focused AI platform that assists medical professionals by managing patient communications, including ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted