Search jobs > Orlando, FL > Site reliability engineer

Site Reliability Engineer (SRE)

EA Team Inc.
Orlando, FL, United States
Full-time

Job Title :

Site Reliability Engineer (SRE) or Production Reliability Engineer (PRE)

Job Description : We are seeking an accomplished and driven SRE / PRE Layer 3 to join our forward-thinking team. As an SRE / PRE Layer 3, you will be a key contributor in architecting, designing, and maintaining highly reliable and scalable systems.

You will collaborate with cross-functional teams to develop advanced automation, implement best practices, and drive the evolution of our infrastructure and reliability initiatives.

Responsibilities : Lead the design, implementation, and management of complex systems architecture that emphasizes reliability, scalability, and performance.

Collaborate closely with engineering teams to set and uphold service-level objectives (SLOs) and work on continuous improvements to achieve these goals.

Mentor and guide junior members of the SRE / PRE team, fostering their technical growth and professional development. Solve intricate technical challenges across the entire technology stack, from hardware and infrastructure to applications and databases.

Develop and implement robust automation solutions for deployment, configuration management, and infrastructure provisioning.

Play a pivotal role in capacity planning, performance tuning, and optimizing systems for seamless scalability. Drive the establishment of comprehensive monitoring, alerting, and logging strategies to ensure prompt identification and resolution of issues.

Participate in on-call rotations and respond promptly to incidents, taking ownership of resolution and post-incident analysis.

Continuously advance best practices and processes, promoting a culture of reliability and operational excellence. Collaborate with stakeholders to ensure alignment between development and operations, contributing to product evolution and enhancements.

Qualifications : Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience). 7+ years of experience in an SRE, PRE, or similar role, demonstrating a proven track record in driving system reliability and performance.

Proficiency in programming languages such as Python, Go, or similar for automation and tool development. Expertise in cloud platforms (e.

g., AWS, GCP, Azure) and container technologies (e.g., Kubernetes, Docker). Deep understanding of networking, operating systems, and distributed systems architecture.

Experience with infrastructure as code tools (e.g., Terraform, Ansible) for provisioning and configuration management. Strong grasp of observability tools and practices (e.

g., Prometheus, Grafana, ELK stack). Exceptional troubleshooting skills and the ability to diagnose complex technical issues.

Outstanding communication skills to collaborate effectively with diverse teams. Proactive mindset and a focus on delivering exceptional customer experiences.

Optional : Relevant certifications such as Certified Kubernetes Administrator, AWS DevOps Professional, or similar. (1.) To ensure customer engagement or satisfaction and referenceability (2.

To plan for Program and Delivery Management and ensure that the agreed deliverables in terms of margin are met. (3.) To anchor process improvementorcompliance (human error reporting) and other organizational initiatives (automation , Lean IT implemetation) (4.

To guide, manage, develop, engage the team therby ensuring employee retention (5.) To ensure upskillor creation of resources through internal academiesor trainings and growth rotation

2 days ago
Related jobs
Promoted
Optomi
Orlando, Florida

Optomi, in partnership with our premier client in the entertainment industry, is seeking an experience Site Reliability Engineer to join their team for a hybrid role! The Site Reliability Engineer will thrive in a dynamic, high-energy, and collaborative environment. Hybrid: 2x a week onsite in Orlan...

EA Team Inc.
Orlando, Florida

Site Reliability Engineer (SRE) or Production Reliability Engineer (PRE). SRE, PRE, or similar role, demonstrating a proven track record in driving system reliability and performance. Job Description:We are seeking an accomplished and driven SRE/PRE Layer 3 to join our forward-thinking team. As an S...

Resource Logistics, Inc.
Orlando, Florida

Site Reliability Engineer (SRE) or Production Reliability Engineer (PRE). SRE, PRE, or similar role, demonstrating a proven track record in driving system reliability and performance. Job Description:We are seeking an accomplished and driven SRE/PRE Layer 3 to join our forward-thinking team. As an S...

DApp360 Workforce LLC
FL, US

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. We are seeking a site reliability engineer for engineering operations. Evangelize and enact best practices as experts to guide high-quality Site R...

FIS
Virtual from Any State, FL , United States of America

Site Reliability Engineer (SRE) will focus on Scalability, High Availability, Performance, Stability and Reliability of Software Applications. SRE will build automations to simplify operations and processes, collaborate with cross-functional teams to create proactive engineering mechanisms and ensur...

DApp360 Workforce LLC
FL, US

As SRE/System engineer you will be responsible for running hybrid infrastructure. Are you passionate about blockchain technology? Here is your chance to work with world class builders and researchers with expertise across several domains: Ethereum Protocol Engineering, Layer-2, Decentralized Finance...

Russell Tobin
Bay Lake, Florida

Accountable for/teaching other engineers how to create breakdowns of tasks. Accountable for/teaching other engineers how to complete tasks on time. ...

IGT
FL Statewide, FL, US

IGT PlayDigital is looking for an experienced yet hands-on DevOps Engineer able to thrive in a fast-growing business where change is constant. ...

Splunk Inc
Florida, United States

Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splunk organization. As a TechOps Engineer, you will be interfacing with...

Splunk Inc
Florida, United States
Remote

Site Reliability Engineers in this role will be engaging with multiple service owners across the platform to teach and implement modern interpretations ofSRE,observability, Chaos Engineering andDevOps. Splunk's Cloud Services group is looking for a Site ReliabilityEngineer to help lead, design and b...