Site Reliability Engineering (SRE) Lead

IDEXX
Virtual, Wisconsin
Full-time

Are you interested in working on a fast-paced Agile team, building modern & global LIMS platform? Do you want to work on a product that makes a difference in the day-to-day life of lab operations, veterinarians, and pet owners?

Are you a self-starter individual? We are looking for a motivated engineer to join the Site Reliability Engineering Team to help drive performance, stability and customer satisfaction with the product and the team.

The right lab information managementsystem (LIMS) is critical to operations, clinical outcomes, client relationships, and more.

Applying system thinking to these challenges supports our desire for change while also creating pull-through innovations that are led by our commercial needs, and the needs of our customers.

IDEXX is looking for a SRETechnical Lead to work on a complex distributed platform that scales globally across the entire Reference Lab operationsecosystem.

You will participate in everything from high-level design, execute product strategies, oversee the management and optimization of our vendor software products and solutions.

You will be working on a platform that is composed of numerous technologies and leverages PAAS infrastructure in both AWS and GCP.

This opportunity offers the chance to tackle complex challenges and build a new solution from the ground up.

As the SRE Lead , you will be responsible for ensuring the reliability, scalability, and performance of our services and solutions.

The ideal candidate should have a strong technical background, excellent problem-solving skills, and a passion for building resilient systems.

In this role...

You will provide Leadership :

  • Mentor team of Site Reliability Engineers.
  • Foster a culture of collaboration, innovation, and continuous improvement within the team.
  • Identifies business needs, assesses available technologies, and develops and presents solutions.

You will be performing Reliability Engineering responsibilities :

  • Design, implement, and maintain systems and processes to ensure high availability and performance of services.
  • Develop and manage SLAs, SLOs, and SLIs to monitor and improve service reliability.
  • Proactively identify and address potential reliability issues and risks.
  • Provides high level of customer service, partners with end users in the resolution of problems or in deployment of new applications.

You will drive Automation, Tooling and Development :

  • Drive automation efforts to reduce manual work and improve operational efficiency.
  • Develop and maintain infrastructure as code and configuration management tools.
  • Implement monitoring, logging, and alerting systems to ensure timely detection and resolution of issues.
  • Design, code, test, debug, and document software applications according to technical specifications developed by analysts and project teams.

Create modules that adhere to these specifications, ensuring efficient and reliable system operation.

You will lead Incident Management :

  • Lead incident response efforts, including root cause analysis and post-mortem reviews.
  • Implement processes to prevent recurrence and improve incident response times.
  • Collaborate with cross-functional teams to ensure effective communication and coordination during incidents.

You will be responsible for Continuous Improvement :

  • Identify opportunities for process improvements and implement best practices for SRE.
  • Stay up-to-date with industry trends and emerging technologies to drive innovation.
  • Foster a culture of learning and knowledge sharing within the team and across the organization.

You will collaborate :

  • Work closely with development, operations, and product teams to ensure seamless integration and delivery of services.
  • Provide technical guidance and support to other teams as needed.
  • Participate in architecture and design discussions to ensure reliability and scalability considerations are addressed.

What You Will Need to Succeed :

  • You will have a solid track record of leading technical initiatives to meet timelines and meet the expectations of various stakeholders.
  • You have a strong understanding of cloud infrastructure (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes).
  • You have 7+ years of experience in site reliability engineering, software development, or a related role.
  • You have proficiency with languages including Java / Kotlin.
  • Bachelor’s degree in computer science, Engineering, or a related field. Master’s degree preferred.
  • Proven experience leading and mentoring technical teams.
  • Proficiency in scripting and automation using languages such as Python, Go, or similar.
  • Experience with monitoring and logging tools (Splunk, ELK, CloudWatch, etc.)
  • Experience working with RESTful APIs, front end JS frameworks and Jenkins
  • Excellent problem-solving and analytical skills.
  • Experience with relational & NoSQL databases
  • Strong communication and interpersonal skills.
  • Ability to work effectively in a fast-paced, dynamic environment

Location : This position is remote and you can be based anywhere in US, with preference in the CST and EST time zones.

Why IDEXX?

We’re proud of the work we do, because our work matters. An innovation leader in every industry we serve, we follow our Purpose and Guiding Principles to help pet owners worldwide keep their companion animals healthy and happy, to ensure safe drinking water for billions, and to help farmers protect livestock and poultry from diseases.

We have customers in over 175 countries and a global workforce of over 10,000 talented people.

So, what does that mean for you? We enrich the livelihoods of our employees with a positive and respectful work culture that embraces challenges and encourages learning and discovery.

At IDEXX, you will be supported by competitive compensation, incentives, and benefits while enjoying purposeful work that drives improvement.

Let’s pursue what matters together.

LI-REMOTE

15 days ago
Related jobs
IDEXX
US, WI, Virtual

Are you interested in working on a fast-paced Agile team, building modern & global LIMS platform? Do you want to work on a product that makes a difference in the day-to-day life of lab operations, veterinarians, and pet owners? Are you a self-starter individual? We are looking for a motivated engine...

TekStream Solutions
Milwaukee, Wisconsin

The Lead Site Reliability Engineer has a pivotal role at the forefront of our engineering operations, responsible for guiding the Platform Team toward achieving exceptional standards of reliability, performance, and stability across all our applications. Lead Site Reliability Engineer. As a key lead...

iSeatz
Oconomowoc, Wisconsin

The Site Reliability Engineering (SRE) Manager reports to the Manager of Platform Services and leads full-time and contractor team members. Lead, manage and mentor a team of Site Reliability Engineers to ensure the reliability, scalability, and performance of our services. In this role, you will ens...

Promoted
JT Engineering, Inc.
Waterloo, Wisconsin

JT is on a growth spree, and we're on the lookout for talented Civil Engineer Project Managers to join our team throughout Wisconsin. You'll get to lead projects, show off your skills, and shape our services. Lead a project team to ensure quality work. Develop, monitor, and manage project budgets. ...

Promoted
Horizon Develop Build Manage
Stoughton, Wisconsin

Our Project Managers are accountable for the five phases of the project: preconstruction, job cost management, construction, project turnover and post construction. We are seeking a highly motivated and experienced Project Manager to join our team. Minimum of 5 years of experience in project manage...

Promoted
Associated Bank - Corp
Milwaukee, Wisconsin

This role will utilize automation tools like Ansible, Python, Git, Github, Terraform, CI/CD pipelines, and Jenkins to help accelerate our Infrastructure As Code transformation, while bringing Network fundamentals to the table as well. Monitor infrastructure to maintain desired capacity for existing ...

Gensler
La Crosse, Wisconsin

DevOps engineer or in a similar software engineering role. We’re looking for a DevOps engineer who will be an integral part of our digital / web solution team and play a crucial role in establishing and optimizing our development and deployment pipelines, managing our cloud infrastructure, and imple...

JNC Recruitment Limited
Milwaukee, Wisconsin

A variety of soft skills and experience may be required for the following role Please ensure you check the overview below carefully.SRE / System Administrator – London / Hybrid.A leading and rapidly growing Professional Services organisation in London is looking to bring on a SRE / System Administra...

Robert Half
Milwaukee, Wisconsin

Unleash Your Potential as a DevOps Engineer at our Premier Client's Tech Hub!. Embrace the future as a key DevOps Engineer where every day brings thrilling new challenges and opportunities! Immerse yourself in our team culture with daily brainstorming sessions and propel your career forward by p...

Walsh Group
Racine, Wisconsin

Construction Senior Project Manager - Racine, Wisconsin. We are looking for Senior Project Managers who can facilitate the work of others, feel a strong sense of ownership over both the process and the results, and can build a culture of flexible productivity. Lead interdisciplinary teams to deliver...