Search jobs > Portland, OR > Site reliability engineer

Site Reliability Engineer

Cerbo
Portland, OR, US
Full-time
Quick Apply

The Company Cerbo is a high-growth healthcare SaaS company, doing our part in the medical market to support holistic lifestyles and personalized medicine.

Our software Cerbo EHR is a cloud-based electronic health records (EHR) and patient portal software system. Healthcare offices across the country and some around the world use Cerbo for most everything they do in their day-to-day operations.

Cerbo originally started as a developer’s nights-and-weekends project. And has grown into one of the leading EHR systems for functional or root cause medicine and membership- or cash-based clinics.

Because of our unique origins, we often approach things a bit differently. That is, success for us is not just about the bottom line.

It’s more about providing a great product, operating with integrity, and supporting our clients and our team. During the past four years our team has grown, and thousands of practitioners and patients use our product.

To this end, we’re looking for a Site Reliability Engineer to join our growing team. What You’ll Do As the Site Reliability Engineer (SRE), you will play a pivotal role managing the future of our technology.

You will work with our current SRE and engineering team to tune, optimize and enhance our Amazon Web Services Infrastructure.

If you're passionate about building and maintaining highly available, scalable systems and thrive in a fast-paced environment, we'd love to hear from you! Primary Responsibilities Design, implement, and maintain scalable and reliable cloud infrastructure on AWS Manage and optimize Kubernetes clusters using Amazon EKS Develop and maintain Infrastructure as Code using Terraform Implement and improve CI / CD pipelines using GitHub Actions and ArgoCD Ensure system security and implement best practices Monitor and optimize system performance using Grafana and Prometheus Track our AWS spending and suggest ways to cut operating costs Troubleshoot and resolve complex issues in production environments Collaborate with development teams to improve application reliability and performance Participate in On Call rotation with other SREs and engineering team membe Required Skills Extensive experience with AWS services and best practices Proficiency in managing Kubernetes clusters, particularly Amazon EKS Strong knowledge of Helm for Kubernetes package management Extensive experience with Infrastructure as Code, specifically Terraform Familiarity with CI / CD pipelines, particularly GitHub Actions Advanced Linux administration skills Solid understanding of networking concepts and protocols Experience in implementing and maintaining security best practices Proficiency in using monitoring and observability tools, especially Grafana and Prometheus Qualifications Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience) 3+ years of experience in a Site Reliability Engineering or similar role Strong problem-solving skills and attention to detail Excellent communication skills and ability to work in a team environment Certifications in AWS, Kubernetes, or other relevant technologies are a plus Compensation & Benefits Competitive compensation based on experience Comprehensive health, dental and vision benefits 401(k) plan with matching company contribution Short-term disability & long-term disability insurance Paid Time Off and company holidays Full suite of remote working tools and processes Location : 100% Remote We are an equal opportunity employer and value diversity at our company.

We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Powered by JazzHR

30+ days ago
Related jobs
Square
Portland, Oregon

As a Senior Staff Site Reliability Engineer at Block, you will be a key player in maintaining and improving the reliability of our systems. The blocks that form our foundational teams — People, Finance, Counsel, Hardware, Information Security, Platform Infrastructure Engineering, and more — provide ...

Token Metrics
Beaverton, Oregon
Remote

Candidate should possess extensive experience in administration including system administration for cloud infrastructure (AWS primarily and knowledge of multi-cloud infrastructure), process automation, site reliability and the ability to optimize the performance of our IT infrastructure. ...

Dunhill Professional Search & Government Solutions
Portland, Oregon
Remote

The Site Reliability Engineer will be joining a team responsible for developing and maintaining tools, alerts, and dashboards to support the Technical Operations team in monitoring application health and performance. The engineer will be responsible for implementing improvements to processes to impr...

Splunk Inc
Oregon, United States

Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splunk organization. As a TechOps Engineer, you will be interfacing with...

CDK Global
Portland, Oregon
Remote

Software Engineer - (SRE - Site Reliability Engineer). Work with internal groups such as Product Engineering, Tools and QA to adopt SRE best practices. ...

Conversica
Portland, Oregon

BS degree in Computer Science / Engineering or related technical field involving coding or equivalent practical experience. ...

Splunk Inc
Oregon, United States
Remote

Site Reliability Engineers in this role will be engaging with multiple service owners across the platform to teach and implement modern interpretations ofSRE,observability, Chaos Engineering andDevOps. Splunk's Cloud Services group is looking for a Site ReliabilityEngineer to help lead, design and b...

Wise Skulls llc
Beaverton, Oregon

Title: Site Reliability Engineer<br /> Location: Beaverton, OR (On-site)<br /> Duration: 6+ months<br /> Implementation Partner: Infosys<br /> End Client: To be disclosed<br /> JD: <div> <ul> <li>Practical expertise in managing and leading applicatio...

Matlen Silver
Portland, Oregon

As a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel strategy. Job Title: Site Reliability Engineer. Hybrid: 2 Days Onsite Portland, Oregon. You w...

Cerbo
Portland, Oregon

If you're passionate about building and maintaining highly available, scalable systems and thrive in a fast-paced environment, we'd love to hear from you! Primary Responsibilities Design, implement, and maintain scalable and reliable cloud infrastructure on AWS Manage and optimize Kubernetes cluster...