Talent.com
Senior Site Reliability Engineer

Senior Site Reliability Engineer

VirtualVocationsNewark, Delaware, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

A company is looking for a Systems Software Engineer, AI Infrastructure.

Key Responsibilities

Develop and maintain large-scale systems for AI Infrastructure, ensuring reliability and scalability

Implement SRE fundamentals, design automation tools, and optimize performance

Build observability tools and frameworks, and lead incident response protocols to enhance system resilience

Required Qualifications

Degree in Computer Science or related field, or equivalent experience with 8+ years in Software Development, SRE, or Production Engineering

Proficiency in Python and at least one other programming language (C / C++, Go, Perl, Ruby)

Expertise in systems engineering within Linux or Windows environments and cloud platforms (AWS, OCI, Azure, GCP)

Strong understanding of SRE principles and Infrastructure as Code tools (e.g., Terraform CDK)

Hands-on experience with observability platforms and CI / CD systems (e.g., GitLab)

serp_jobs.job_alerts.create_a_job

Senior Site Reliability Engineer • Newark, Delaware, United States