The Sr. Site Reliability Engineer (SRE) is responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning.
This role will be a member of a team that focuses on DevOps, DevSecOps and SRE for the Dental Software Organization. The role drives continuous improvement in delivery of resilient, scalable, performant, secure, and high-quality cloud-native services.
Collaborating with DevOps, DevSecOps, and development teams the SRE identifies cross-team issues which create risk for operations across the organization and resolving those issues with a mixture of engineering, troubleshooting expertise, and general operational guidance.
You will proactively drive improvement of enterprise cloud capabilities while creating best practices and tools to empower developers to create, deploy, and operationally support services.
As a key contributor in the organization this role is responsible for the working with the Principal SRE and guiding junior team members in DevOps culture, highly scalable architectures, and lean development utilizing agile practices.
Educate yourself and others on anything that helps service teams more quickly and easily build, test, deploy & run their services to be more reliable
Plan, design, deploy, and operate Site Reliability Engineering capabilities for cloud products & services
Recognize and address sub-standard performance based on key performance indicators (KPIs)
Build monitoring that alerts on symptoms rather than outages
Continuously build, automate, and improve upon capabilities that are secure, scalable, performant, and resilient
Work closely with Infrastructure, Network, Security, Architecture, and Development teams to build highly performing, scalable, and secure Azure environments
Define needs by documenting processes; includes research, planning and writing supporting documentation
Additional Functions
In addition to the essential functions listed above, the incumbent may perform the following additional functions.
Participate in regulatory and compliance activities as necessary
Required Qualifications
Bachelor's degree in Computer Science, Management Information Sciences or area of functional responsibility preferred, or equivalent years of industry work experience
5+ years in software or operations engineering
2+ years of DevOps and Site Reliability engineering or similar experience with cloud-native solutions
Proven experience in DevOps culture and site reliability engineering focused on the customer, cross-functional autonomous teams, and continuous improvement
DevOps experience with a cloud-native web application hosted in one of the three major cloud platforms
Familiarity with version control systems e.g., Git, SVN, CVS
Extensive database and operating systems experience
Experience in designing and implementing a continuous integration pipeline (CICD)
May interacts with vendors and service providers to resolve system related problems
Experience in monitoring infrastructure, application uptime, latency, and performance on large distributed systems
Exhibit proficiency at troubleshooting various cloud and system related issues
Demonstrable cross-functional knowledge with systems, storage, networking, security, and databases
Strong verbal and written communication skills with ability to effectively communicate at multiple levels in the organization
Preferred Qualifications
Experience managing infrastructure as code via tools such as Terraform
A passion for automation with a desire to eliminate toil wherever possible
Proven experience with application performance monitoring tools (Dynatrace, AppDynamics)
Microsoft Azure experience
Working Environment
Office environment either in Patterson facility or at home / remote location
Patterson Dental is committed to supporting a robust remote work culture, with well-established virtual collaboration practices and equal opportunities for career advancement and professional growth, regardless of physical location.
Travel to corporate sites is periodically required (Quarterly or so)
Periodic on call rotations and available outside of normal business hours on evenings and weekends during critical production release or issue escalation periods
This role is eligible for hire in any of the following States : AK, AZ, CA, CO, CT, DC, HI, ID, IL, KS, KY, ME, MA, MI, MN, MT, NE, NV, NH, NM, NY, OR, RI, SD, TX, UT, VT, WA, WV, WI
The potential compensation range for this role is below. The final offer amount could exceed this range, based on various factors such as candidate location (geographical labor market), experience, and skills.
$125,000 - $145,000