Search jobs > San Francisco, CA > Permanent > Senior site reliability

Senior Site Reliability/DevOps Engineer

AutoRABIT Holding Inc.
San Francisco, CA, US
$90K-$120K a year
Full-time
Quick Apply

About AutoRABIT : AutoRABIT is a hyper-growth SaaS software company and the leading provider of Salesforce DevSecOps platform for regulated industries such financial institutions, insurance, and healthcare.

AutoRABIT solutions enable developers to automate their daily tasks to be more productive and increase the release velocity for their development team, while meeting the stringent security, compliance, and privacy regulations.

About the role : AutoRABIT is looking for a Senior Site Reliability / DevSecOps Engineer to help develop, scale and operate our cloud services In this role you will be an experienced business professional able to implement and execute best practice operations and improvements across teams by providing visibility and recommendations for improved reliability and automation.

Responsible for the security, availability, performance, efficiency, change management, monitoring, emergency response, capacity planning, back-up, and disaster recovery of our technical ecosystem, as well as drive automation while building a robust and agile DevSecOps framework.

Accountability, agility and strong analytical skills paired with an obsession for learning, gathering data and executing on that data, are key to being successful in this role.

Responsibilities : Broadly, Site Reliability or DevSecOps engineer with a passion for, automation, reliability, scalability, monitoring, and capacity planning.

But you have the breadth of knowledge necessary to support a wide variety of software and systems. Contribute to the development and maintenance of frameworks for monitoring, automation and code to increase the scalability and reliability of the service Assist both internal and customer facing teams with deployment of new software releases, VPN and other related security infrastructure interfacing.

Assist with resolution of AutoRABIT service or customer issues as required Participate in and practice sustainable incident response and blameless postmortems Contribute to the automation of manual tasks, such as the provisioning of users in production and test environments.

Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration Work within a small agile team to develop and improve SRE software, support your peers, plan and self-improve Participate in a regular on-call or rotational schedule needed to support AutoRABIT servers, including weekends and holidays Required Skills and Experience : Design, implement, and maintain scalable, resilient, and secure infrastructure using AWS.

Develop and manage infrastructure as code using Terraform. Implement and manage CI / CD pipelines to automate deployments and ensure smooth delivery of applications.

Monitor system performance, identify bottlenecks, and implement solutions to improve reliability and performance. Troubleshoot, resolve, and perform RCAs for incidents, while ensuring minimal disruption to services.

Collaborate with development teams to ensure applications are designed for reliability and performance. Working Experience with Shell Scripting (Bash), Python or equivalent is required Good Knowledge of programming languages such as Python, Go, or Java.

Working Experience with configuration management tools such as Ansible or Chef. Implement and maintain monitoring, logging, and alerting systems to ensure the health and performance of our infrastructure.

Ensure security best practices are followed and compliance requirements are met. Responsibility to adhere to set internal controls.

Can-do attitude : challenging status, leading, and contributing to key improvements and innovations, while maintaining accountability Excellent written and verbal US English communication skills for working across a global team environment Education and Background : Bachelors in Computer Science, Engineering, or equivalent degree or experience 5+ years of experience in site reliability engineering, DevOps, or a related field.

AWS, GCP and / or Azure Certified 3+ Years of Kubernetes experience 3+ years' experience managing Linux-based systems in a public cloud such as AWS, GCP, or Azure 3+ years of experience with systems monitoring and logging;

knowledge of ELK is preferred Solid understanding of standard TCP / IP networking and common protocols like DNS, load balancers, HTTP, etc.

Must be a US citizen / permanent resident, and capable of obtaining a Government Security clearance if required and live and work from the US.

Green card holders qualify, but H1B or other work visa holders do not qualify for this role. Salary range for this role is $90.

000 to $120,000 depending on experience. THIS IS A 100% REMOTE JOB Powered by JazzHR

30+ days ago
Related jobs
Promoted
Cisco Systems, Inc.
San Francisco, California

As a Principal Site Reliability you will focus on innovating and providing strong technical vision as well as work with the team to build reliable, scalable and highly available datastores on a constantly growing multi-region scale platform. We're looking for a reliability-focused engineering l...

Promoted
Abnormal Security
San Francisco, California

Site Reliability Engineer, responsible for the reliability of shared services. These products must scale with the growth of our customers, and ensure reliability and availability by being resilient. Come empower the rest of engineering to stop cybercrime as we expand our offerings across both clouds...

SingleStore
San Francisco, California

Full Time] Senior Site Reliability Engineer at SingleStore (United States). Senior Site Reliability Engineer. MemSQL is seeking a Senior Site Reliability Engineer to help drive our Kubernetes product strategy surrounding our managed service. As a technical leader in the space you will collaborate wi...

Zetachain
San Francisco, California
Remote

Site Reliability Engineer to join our team and run critical infrastructure for our blockchain and web applications. DevOps Engineer/SRE Transitioning to Blockchain. An experienced DevOps Engineer or SRE looking to pivot into the blockchain sector. Ensure all processes meet our security, performance,...

Federal Reserve System
San Francisco, California

Site Reliability Engineer, you will be part of the Data & Analytics Services (DAS) Team and will get an opportunity to broadly apply your engineering skills across various technology solutions, as well as build your skills in other areas by being exposed to various aspects of product delivery from i...

Salesforce, Inc.
San Francisco, California

Site Reliability Engineer Lead. Site Reliability Engineer Lead in San Francisco, CA:. Collaborate closely with teammates and members of partner engineering teams to develop and foster solid engineering principles and represent our engineering values. Master’s degree (or its foreign degree equivalent...

BetterUp
San Francisco, California

We’re looking for a driven software engineer who cares deeply about their craft and who wants to use their skills to bring about positive change in the world while working in a high performing organization using modern software development approaches. Collaborate with engineers and cross-functional ...

IXL
San Mateo, California

We are seeking engineers with experience in site reliability and operations, who have a passion for simplifying the complex and building highly scalable infrastructure. IXL Learning, developer of personalized learning products used by millions of people globally, is expanding our software engineerin...

Sustainable Talent
CA, United States

Senior DevOps Infrastructure Engineer. Collaborate with multi-functional teams, including system engineering, software engineering, mechanical/thermal engineering, operations, data center teams, external vendors, and other partners to successfully deliver a reliable and robust platform from concept ...

Salesforce
San Francisco, California

We are seeking an experienced software engineer to join a world-class team of highly motivated software engineers and infrastructure experts. As an engineer, you have technical knowledge and hands-on experience in large-scale distributed system architecture, development and deployment, public cloud ...