Talent.com
Senior Site Reliability Engineer (SRE) - (Dublin, CA)
Senior Site Reliability Engineer (SRE) - (Dublin, CA)Articul8 • Dublin, CA, US
Senior Site Reliability Engineer (SRE) - (Dublin, CA)

Senior Site Reliability Engineer (SRE) - (Dublin, CA)

Articul8 • Dublin, CA, US
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

About Us

Articul8 AI is at the forefront of Generative AI innovation, delivering cutting-edge SaaS products that transform how businesses operate. Our platform empowers organizations to leverage the power of artificial intelligence in a reliable, scalable, and secure environment.

Position Overview

We are seeking an experienced Site Reliability Engineer (SRE) to join our team and help ensure the reliability, performance, and scalability of our GenAI SaaS platform. As an SRE, you will bridge the gap between development and operations, implementing automation and best practices to maintain our service reliability objectives while supporting rapid innovation.

Key Responsibilities

Architect and maintain scalable, highly available infrastructure for our GenAI platform.

Design and implement robust monitoring, alerting, and observability solutions to proactively ensure system health and performance.

Automate deployment, scaling, and management of our cloud-native infrastructure, reducing toil and improving efficiency.

Define, measure, and improve Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to deliver outstanding service quality.

Participate in on-call rotations and provide rapid response to production incidents, minimizing downtime and user impact.

Collaborate closely with development teams to build reliable, scalable, and efficient systems for complex AI workloads.

Lead incident response efforts, conduct thorough post-mortems, and champion continuous improvement initiatives.

Optimize infrastructure for performance, scalability, and cost-effectiveness—especially for high-demand AI workloads.

Implement and enforce security best practices across all systems and environments.

Create and maintain comprehensive documentation, including runbooks and knowledge base articles, to foster a culture of shared knowledge.

Qualifications

Required

Bachelor's degree in Computer Science, Engineering, or related field, or equivalent practical experience

5+ years of experience in DevOps, SRE, or similar roles

Strong experience with cloud platforms (AWS, GCP, or Azure)

Proficiency in at least one programming / scripting language (Python, Go, Bash, etc.)

Hands-on experience with infrastructure as code tools (Terraform, CloudFormation, etc.)

Solid background in containerization technologies (Docker, Kubernetes)

Proven experience with monitoring and observability tools (Prometheus, Grafana, ELK stack, etc.)

Strong understanding of CI / CD pipelines and automation

Exceptional troubleshooting and problem-solving skills and ability to troubleshoot complex systems

Preferred

Experience supporting AI / ML systems in production

Knowledge of GPU infrastructure management and optimization

Familiarity with distributed systems and high-performance computing

Experience with database systems (SQL and NoSQL)

Certifications in cloud platforms (AWS, GCP, Azure)

Experience with chaos engineering and resilience testing

Knowledge of security best practices and compliance requirements

Ready to shape the future of resilient software systems? Apply now and help drive the reliability of tomorrow's AI at Articul8 AI!

J-18808-Ljbffr

serp_jobs.job_alerts.create_a_job

Senior Site Reliability Engineer • Dublin, CA, US

Job_description.internal_linking.related_jobs
Senior Site Reliability Engineer, Scalability

Senior Site Reliability Engineer, Scalability

Meraki, LLC • San Francisco, CA, United States
serp_jobs.job_card.full_time
Application window is open until further notice.The Infrastructure SRE team is responsible for the compute, storage and security underpinning Meraki's cloud in 10 data centers worldwide.Meraki's hi...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer - SRE at Descope Los Altos, CA

Site Reliability Engineer - SRE at Descope Los Altos, CA

Itlearn360 • Los Altos, CA, United States
serp_jobs.job_card.full_time
Site Reliability Engineer - SRE job at Descope.Descope R&D group is a skilled team of developers with a unique DNA of creativity,flexibility,anopen mindset. We are looking for a passionate SRE to jo...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

ConductorOne • San Francisco, CA, United States
serp_jobs.job_card.full_time
Shape the future of identity with the highest-caliber team.If you’re amazing at what you do and want to solve big challenges in identity and security, come on board. Identity is how companies are be...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

Fortinet • Sunnyvale, CA, United States
serp_jobs.job_card.full_time
At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineer I

Site Reliability Engineer I

prosper.com • San Francisco, CA, United States
serp_jobs.job_card.full_time
As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

Bits to Atoms • San Francisco, CA, United States
serp_jobs.job_card.full_time
Site Reliability Engineer (SRE).You’ll work at the intersection of infrastructure, AI / ML systems, and mission-critical physical operations. You’ll collaborate directly with engineering, AI, and oper...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

ZipRecruiter • Brentwood, CA, US
serp_jobs.job_card.full_time
Title : Senior Site Reliability Engineer.Location : Hybrid in Nashville, TN () or Remote (U.Compensation : $150,000 - $220,000 base salary (depending on experience and level).Privacy is a leading SaaS...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Reliability Engineer

Reliability Engineer

Periodic • Menlo Park, CA, United States
serp_jobs.job_card.full_time
We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries.We are well funded and growing rapidly. Team members are owners who identify and solve prob...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

Redwood Materials, Inc. • San Francisco, CA, United States
serp_jobs.job_card.full_time
Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

Sigmaways Inc • San Francisco, CA, United States
serp_jobs.job_card.full_time
As a Site reliability engineer, you will partner with development and IT teams to implement CI / CD pipelines, develop automation and monitoring solutions to ensure our platforms are secure, scalable...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

WorkOS • San Francisco, CA, United States
serp_jobs.job_card.full_time
About WorkOS 🚀 WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We’re a fully distributed team with ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer (SRE) - grok.com & API

Site Reliability Engineer (SRE) - grok.com & API

Pantera Capital • Palo Alto, CA, United States
serp_jobs.job_card.full_time
AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Fivetran • Oakland, CA, US
serp_jobs.job_card.full_time
Fivetran is building automated data pipelines that power the modern data stack for thousands of companies worldwide.We're looking for a high-performing, experienced Site Reliability Engineer (SRE) ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

Redwood Materials • San Francisco, CA, United States
serp_jobs.job_card.full_time
Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling — keeping critical minerals in circulation and driving the energy transition.Founded in...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

Fractal • San Francisco, CA, United States
serp_jobs.job_card.full_time
This range is provided by Fractal.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Fractal Analytics is a strategic AI partner to Fortune 500 com...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Gridware Technologies Inc. • San Francisco, CA, United States
serp_jobs.job_card.full_time
Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid.We pioneered a groundbreaking new class of grid management called active grid response...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Zipline • South San Francisco, CA, US
serp_jobs.job_card.full_time
Do you want to change the world? Zipline is on a mission to transform the way goods move.Our aim is to solve the world's most urgent and complex access challenges by building, manufacturing and ope...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Baseten • San Francisco, CA, United States
serp_jobs.job_card.full_time
Site Reliability Engineer (SRE).Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed.By uniting a...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted