Search jobs > Atlanta, GA > Senior site reliability

Senior Site-Reliability Engineer

Cox Automotive
Atlanta GA
Full-time

Description

What you’ll be doing :

Be a main technical point of contact for cross-functional teams, stakeholders, and management, providing insights, recommendations, and updates on system health and performance.

Lead, mentor, guide and inspire a team of engineers including the ability to listen and sort through complex details, and the ability to propose ideas and solutions to meet technical challenges.

Design and implement robust, scalable, and highly available systems and architectures, considering factors like fault tolerance, disaster recovery, and efficient resource utilization.

Design and implementation of REST APIs, services, system tasks, and cloud solutions and enhance the performance and reliability of our current solutions

Write readable, maintainable, and efficient code and influence technical solutions while coaching newer or less experienced members on your Scrum team

Implement platform-level components including event architectures, messaging, and caching solutions

Construct and manage services published to both internal and external consumers

Collaborate with team members on best practices, code reviews, internal tools, and process improvements in order to design and implement reliable and scalable software solutions, ensuring operational excellence throughout the software development lifecycle.

Evangelize new ideas within your team as well as across teams

Lead and mentor a team of Site Reliability Engineers, providing technical guidance, coaching, and fostering a culture of continuous improvement.

Develop and maintain automation frameworks, tools, and processes to improve system reliability, deployment speed, and operational efficiency.

Establish and enforce best practices for incident response, including incident triage, root cause analysis, and post-incident reviews.

Drive capacity planning initiatives, analyzing system usage patterns, forecasting resource needs, and optimizing infrastructure to meet growing demands.

Stay abreast of emerging technologies, industry trends, and evolving best practices in DevOps and Site Reliability Engineering and propose innovative solutions to enhance system reliability and performance.

Utilize infrastructure as code practices using tools like Terraform, CloudFormation, or Ansible to define and manage cloud resources on platforms such as Amazon Web Services (AWS).

Implement and maintain configuration management tools like Ansible, Puppet, or Chef to manage system configurations consistently.

Lifelong learning passionate about learning and sharing knowledge with peers.

Automate routine tasks and build automation frameworks using Python, Go, Ruby, or shell scripts to reduce manual toil and improve operational efficiency.

Apply standards and practices for software development and infrastructure management, ensuring compliance with industry regulations and security requirements.

Collaborate with cross-functional teams to drive a cloud-first approach and ensure seamless integration of services and systems.

Participate in agile methodologies, including Kanban, and contribute to continuous improvement initiatives.

What we require from you :

Bachelor’s degree in Computer Science or related field plus 4+ years of relevant work experience; or a Master's degree plus 2+ years of relevant work experience;

or a PhD plus 0-1 years of relevant experience

In lieu of a degree, qualified candidates would require 8+ years of relevant professional experience

Experience in realizing applications from conception and design to implementation and support, including designing, implementing and operating applications with highly available, optimized and scalable architectures

Experience in an Agile work environment with the ability with the confidence in your ability to implement well-thought solutions.

Extensive experience as a Software Developer, with a proven track record of designing and developing robust, scalable, and high-performance software applications.

Proficiency in one or more programming languages, such as Python, Go, Ruby, or shell scripts. Strong knowledge of infrastructure as code principles and experience with tools like Terraform, CloudFormation, or Ansible.

Experience with cloud platforms, particularly Amazon Web Services (AWS), and relevant certifications like AWS Certified DevOps Engineer or AWS Certified Solutions Architect.

Experience with configuration management tools such as Ansible, Puppet, or Chef.

Solid understanding of CI / CD principles and experience with tools like Jenkins, Git, and GitHub

Knowledge of monitoring and logging tools such as CloudWatch, Prometheus, or ELK stack.

Familiarity with build automation tools like Artifactory and artifact management processes.

Experience with version control systems like Git and collaborative development using GitHub or GitHub Enterprise.

Experience tagging resources, building and maintaining new CI / CD Pipelines and migrating online resources to AWS.

Excellent problem-solving and decision-making abilities, with a focus on driving results and delivering high-quality solutions.

Strong communication and collaboration skills, with the ability to effectively interact with diverse stakeholders, including technical and non-technical audiences.

Experience with on-call support and deployment processes.

Drug Testing :

To be employed in this role, you'll need to clear a pre-employment drug test. Cox Automotive does not currently administer a pre-employment drug test for marijuana for this position.

However, we are a drug-free workplace, so the possession, use or being under the influence of drugs illegal under federal or state law during work hours, on company property and / or in company vehicles is prohibited.

About Cox Automotive

At Cox Automotive, people of every background are driven by their passion for mobility, innovation and community. We transform the way the world buys, sells, owns and uses cars, accelerating the industry with global powerhouse brands like Autotrader, Kelley Blue Book, Manheim and more.

What’s more, we do it all with an emphasis on employee growth and happiness. Drive your future forward and join Cox Automotive today!

About Cox

Cox empowers employees to build a better future and has been doing so for over 120 years. With exciting investments and innovations across transportation, communications, cleantech and healthcare, our family of businesses which includes Cox Automotive and Cox Communications is forging a better future for us all.

Ready to make your mark? Join us today!

Benefits of working at Cox may include health care insurance (medical, dental, vision), retirement planning (401(k)), and paid days off (sick leave, parental leave, flexible vacation / wellness days, and / or PTO).

For more details on what benefits you may be offered, visit our benefits page .

Cox is an Equal Employment Opportunity employer - All qualified applicants / employees will receive consideration for employment without regard to that individual’s age, race, color, religion or creed, national origin or ancestry, sex (including pregnancy), sexual orientation, gender, gender identity, physical or mental disability, veteran status, genetic information, ethnicity, citizenship, or any other characteristic protected by law.

Cox provides reasonable accommodations when requested by a qualified applicant or employee with disability, unless such accommodations would cause an undue hardship.

Statement to ALL Third-Party Agencies and Similar Organizations : Cox accepts resumes only from agencies with which we formally engage their services.

Please do not forward resumes to our applicant tracking system, Cox employees, Cox hiring manager, or send to any Cox facility.

Cox is not responsible for any fees or charges associated with unsolicited resumes.

30+ days ago
Related jobs
Promoted
VirtualVocations
Norcross, Georgia

A company is looking for a Site Reliability Engineer in Remote Kentucky. ...

Cox Automotive
Atlanta, Georgia

Stay abreast of emerging technologies, industry trends, and evolving best practices in DevOps and Site Reliability Engineering and propose innovative solutions to enhance system reliability and performance. Lead and mentor a team of Site Reliability Engineers, providing technical guidance, coaching,...

Promoted
VirtualVocations
Norcross, Georgia

...

QuEST Global Services Pte. Ltd
Atlanta, Georgia

A Reliability Engineer with Instrument background, who will act as a reliability engineer and a technical resource in performing criticality and assigning strategies for all equipment in oil & gas plant. As a team of remarkably diverse engineers, we recognize that what we are really engineering ...

Promoted
VirtualVocations
Marietta, Georgia

...

DICE
Atlanta, Georgia

Collaborate with software engineering teams and other SREs to influence design and architecture decisions to improve system reliability and performance. Atlanta Metro area - on-site on Wednesdays only. Drive the adoption of SRE best practices and ensure adherence to reliability and performance stand...

Cprime
Atlanta, Georgia

We are looking for a high-energy, experienced, customer-focused Site Reliability Engineer (SRE) to work in our Managed Services team. This position will report to the head of Managed Services and work with a team of Engineers and Atlassian Administrators to support a variety of customer requests, au...

Edjuster
Atlanta, Georgia

The Senior Site Reliability Engineer will assist with the design, development, and implementation of the cloud architecture in various cloud, hybrid, and on-premise systems. The Site Reliability Engineer will collaborate with both Information Technology and Business Units to ensure open lines of com...

Veradigm®
Atlanta, Georgia
Remote

As a Sr DevOps Engineer / Site Reliability Engineer on the Veradigm Payer Dev Ops team, you’ll work closely with Business and Technical Leaders from across the organization to manage and monitor our Azure-based cloud solutions. As systems go live, the role will transition to more traditional site re...

GEICO
Atlanta, Georgia
Remote

GEICO is seeking an experienced and visionary SRE Senior Manager to join the organization and aid the establishment and growth of the Site Reliability Engineering (SRE) practice for Hybrid Cloud - Infrastructure as a Service (IaaS). As an SRE Leader, you will be responsible for leading and driving d...