Site Reliability Engineer

Genuine Parts Company

Birmingham, Alabama

Full-time

The Site Reliability Engineer (SRE) is responsible for improving system reliability and resilience. This role focuses on building automation to reduce manual effort and prevent service-impacting incidents.

The SRE combines software and systems engineering to build and support large-scale, distributed, fault-tolerant systems.

This role ensures that critical platforms are available, reliable and able to support a fast rate of improvement. This role relies on monitoring platforms and is continually taking a holistic view of system health and performance.

The SRE will enhance and support cloud-based transformations, and is focused on pushing capabilities forward, staying ahead of customer needs and innovating for continuous improvement.

The SRE provides operational support and engineering for multiple large-scale distributed software applications

JOB DUTIES

Gathers and analyzes metrics from monitoring platforms to assist in performance tuning and fault tolerance.
Partners with development teams to improve services through testing and release procedures.
Participates in system design, platform management and capacity planning.
Balances feature development speed and reliability with service-level objectives.
Works closely with the incident response team and restoring service to normal operation.
Understands debugging and applying troubleshooting skills.
Investigates, blocks and rate-limits unwanted traffic.
Utilizes monitoring systems and dashboards for proactive changes and alerting.
Establishes continuous process improvement cycles where the process, performance, and supporting technologies are reviewed and enhanced where applicable.
Performs other duties as assigned.

EDUCATION & EXPERIENCE

Typically requires a bachelor's degree and five (5) to seven (7) years of experience in a technology and / or software engineering role or an equivalent combination.

KNOWLEDGE, SKILLS, ABILITIES

Understanding of Kubernetes, containers, clusters and elastic scalability.
Expertise in SRE principles.
Mindset of continually finding ways to drive scalability, stability and performance.
Cloud Services experience with Google Cloud Platform (GCP).
Experience with API, service-based or microservice-based architecture.
Proficiency in infrastructure, network, database, operating systems or security troubleshooting and remediation.
Architecture-level knowledge of Windows and Linux and Infrastructure systems
Experience with production deployment, monitoring and operational support fo enterprise-class applications (Dynatrace a plus).
Experience working with Continuous Integration / Continuous Deployment tools.
Experience in performance diagnostics, capacity planning, performance architecture design, performance tuning and performance monitoring.
A strong mix of software engineering and operational support skills.
Knowledge of web technologies HTTP, proxy, java, etc.
Experience with Azure DevOps (ADO), Dynatrace, Prometheus, Terraform and Grafana.

COMPANY INFORMATION : Motion offers an excellent benefits package which includes options for healthcare coverage, 401(k), tuition reimbursement, vacation, sick, and holiday pay

30+ days ago

Related jobs

Site Reliability Engineer

Genuine Parts Company

Birmingham, Alabama

The Site Reliability Engineer (SRE) is responsible for improving system reliability and resilience. The SRE combines software and systems engineering to build and support large-scale, distributed, fault-tolerant systems. The SRE provides operational support and engineering for multiple large-scale d...

Promoted

Mortgage Project Manager

First Horizon National Corporation

Birmingham, Alabama

The Project Manager leads efforts to identify areas for improvement, create project plans, and work with assigned projects to ensure their successful completion. Projects will vary in size from small to enterprise-wide projects. Confers with managers, associates, customers, and third parties as need...

Promoted

Systems Cloud Administrator

Momentum Telecom Inc

Birmingham, Alabama

Momentum is seeking a Cloud Administrator to join our team. ...

Promoted

Environmental and NEPA Project Manager

Volkert Inc

Birmingham, Alabama

Prepare NEPA studies for federally funded projects, including Categorical Exclusions (CE), Environmental Assessments (EA), Environmental Impact Statements (EIS) for various transportation project types and sizes. Environmental and NEPA Manager. Coordinate with internal project teams. Perform site vi...

Promoted

Mechanical Project Manager

Jobot

Bessemer, Alabama

Project Manager to lead and manage our regional Mechanical and Plumbing work from Birmingham. After completing just one project together, the two companies saw the tremendous potential of their combined expertise and strengths: a world-class creative and architectural engine matched with a waste-ave...

Promoted

Senior Project Manager/ Architect

Snelling - Birmingham

Birmingham, Alabama

Senior Project Architect/Manager. The successful Senior Project Architect/Manager will:. Senior Project Architect/Manager. Have a solid work history including experience in design development, construction documentation, and construction administration of public, commercial, healthcare, multi-family...

Promoted

Senior Cloud Systems Administrator

Altec

Birmingham, Alabama

Analyze and evaluate the performance and usage of the cloud infrastructure systems and provide reports and recommendations for improvement. Oversee data backup and recovery procedures for all systems. Proven Server Administration experience with Windows or Linux Systems. ...

Promoted

DevOps Engineer

Dash Solutions

Birmingham, Alabama

Headquartered in Birmingham, AL, Dash Solutions is a fast-growing fintech company that provides digital payments and engagement program management to thousands of customers throughout the US.Dash Solutions offers innovative strategies and a proprietary technology stack, including payroll, expense, g...

Promoted

Associate Software Engineer

Tocaro Blue

Birmingham, Alabama

Looking for an opportunity to make an impact at a fast growing, investor-backed AI/ML company? Do you have experience working with a software team and building cloud-based software? This is an exciting opportunity with a fast-growing team at the cutting-edge intersection of AI/ML models, sensor fusi...

Promoted