Talent.com
Senior Site Reliability Engineer (Remote)

Senior Site Reliability Engineer (Remote)

JobgetherUS
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
  • serp_jobs.filters.remote
  • serp_jobs.filters_job_card.quick_apply
job_description.job_card.job_description

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Site Reliability Engineer in the United States.

We are seeking a Senior Site Reliability Engineer to help shape and maintain the technical foundations of a high-growth platform. In this role, you will design and operate large-scale, secure, and highly available infrastructure while ensuring teams can deliver product features safely and efficiently. You will champion observability, CI / CD, and automation practices, enabling rapid innovation across engineering. This position offers the opportunity to work closely with senior leaders, partner with cross-functional teams, and have a measurable impact on platform reliability, scalability, and performance in a fast-paced, collaborative environment. The role requires both strategic thinking and hands-on technical expertise.

Accountabilities

  • Design, implement, and maintain scalable and secure cloud infrastructure on AWS.
  • Build and manage automation tools and self-service methods for infrastructure management (e.g., Terraform, CI / CD pipelines).
  • Partner with engineering, QA, and FinOps teams to enable fast, safe, and cost-effective deployments.
  • Own observability systems, establishing best practices for performance monitoring and service reliability across the organization.
  • Contribute to disaster recovery, incident management, and risk mitigation strategies.
  • Collaborate with teams to optimize infrastructure costs and improve operational efficiency.
  • Provide mentorship and guidance to engineering teams, fostering a culture of reliability and engineering rigor.

Requirements

  • BS or MS in Computer Science or related technical field, or equivalent experience.
  • 5+ years of experience in a dedicated Site Reliability Engineering role.
  • Strong experience with distributed systems and cloud infrastructure (AWS : EC2, RDS, EKS, CloudFront, ECR, S3, IAM, Lambda, Route53).
  • In-depth knowledge of Kubernetes, including deployment, scaling, orchestration, and automation for teams.
  • Experience building automation tools and using at least one programming language (e.g., Python, Golang, Rust) for infrastructure solutions.
  • Proven ability to analyze systems, identify performance bottlenecks, and implement improvements.
  • Strong communication and collaboration skills with cross-functional teams.
  • Bonus Points :

  • Familiarity with AWS Well-Architected Framework.
  • Experience with event-driven systems and messaging technologies (Kafka, NATS, Aeron).
  • Knowledge of security frameworks and infrastructure hardening.
  • Experience with diverse database architectures or Elasticsearch management.
  • Understanding of Ruby on Rails ecosystem and its operational considerations.
  • Benefits

  • Competitive base salary of $170,000–$230,000 / year plus equity.
  • Flexible remote work arrangements with optional in-office collaboration.
  • Comprehensive healthcare, dental, and vision benefits.
  • Opportunities for professional growth in a fast-paced, high-impact environment.
  • Hands-on role influencing platform reliability, performance, and engineering culture.
  • Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

    When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly.

    🔍 Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements.

    📊 It compares your profile to the job’s core requirements and past success factors to determine your match score.

    🎯 Based on this analysis, we automatically shortlist the 3 candidates with the highest match to the role.

    🧠 When necessary, our human team may perform an additional manual review to ensure no strong profile is missed.

    The process is transparent, skills-based, and free of bias — focusing solely on your fit for the role. Once the shortlist is completed, we share it directly with the company that owns the job opening. The final decision and next steps (such as interviews or additional assessments) are then made by their internal hiring team.

    Thank you for your interest!

    #LI-CL1

    serp_jobs.job_alerts.create_a_job

    Senior Site Reliability Engineer • US