Search jobs > Greensboro, NC > Director site reliability

Director, Site Reliability Engineering (SRE) - Hybrid (Raleigh or Greensboro)

Arch Capital Group
Greensboro, NC
Full-time

With a company culture rooted in collaboration, expertise and innovation, we aim to promote progress and inspire our clients, employees, investors and communities to achieve their greatest potential.

Our work is the catalyst that helps others achieve their goals. In short, We Enable Possibility .

The Director, Site Reliability Engineering (SRE) is a pivotal role in the technology infrastructure team, responsible for ensuring the highest levels of reliability, scalability, and performance.

This leadership role will set the vision and strategic direction for a skilled SRE team, aligning with the strategic objectives of the IT Infrastructure team, and fostering a culture of continuous improvement and operational excellence.

This role will require a deep understanding of cloud-based infrastructure services and technologies, distributed systems, product delivery platforms, DevOps, automation, monitoring and a proactive approach to preventing and mitigating potential issues.

The incumbent must also foster a culture of innovation and collaboration within a team of highly skilled engineers to meet the organization’s evolving needs and deliver a superior digital experience to our product teams and customers.

This is a Hybrid, Twice-a-week onsite role at our Greensboro and Raleigh offices.

Leadership & Strategy

  • Develop and implement a comprehensive SRE strategy that aligns with the IT Infrastructure team, IT and company objectives.
  • Lead the SRE team, setting clear goals and expectations, and providing mentorship and career development opportunities.
  • Collaborate with cross-functional teams to enhance system reliability and efficiency.

Technical Expertise

  • Oversee systems related to the availability of our infrastructure ecosystem, including cloud services and internal tooling.
  • Ensure the team’s deep understanding and expertise in the system architecture, not limited to Kubernetes and OpenShift, but encompassing the entire product delivery stack.

Team Management

  • Manage the SRE team ensuring effective resource allocation and prioritization of POC’s and initiative prioritization.
  • Drive the adoption of best practices in incident management and post-mortem analysis.

Incident Management

  • Be a leader in the response to high-impact infrastructure incidents, ensuring swift resolution and minimal disruption.
  • Implement proactive monitoring and measures to prevent future incidents and improve system resilience.

Communications

  • Articulate the value and accomplishments of the SRE team to stakeholders at all levels.
  • Foster a transparent communication environment within the team and across the organization.
  • Work closely with shared infrastructure services teams (including other SRE teams) within the corporation to establish a productive and transparent partnership and help establish consistent SRE and Infrastructure practices across the company.

Knowledge & Skills :

  • Proven expertise in large-scale complex system engineering and administration including cloud-based infrastructure in Microsoft Azure.
  • Strong leadership skills with the ability to inspire and motivate a high-performing team.
  • Excellent problem-solving abilities and data-driven approach to decision-making.
  • Technical leadership skills, including collaboration, technical problem-solving, and leading complex, mission critical initiatives.
  • In-depth understanding of Kubernetes concepts, components, and APIs with hands-on experience in orchestration of containerized applications using OpenShift (on-premises or in the cloud) Experience with OpenShift’s added-value features such as advanced CI / CD pipelines for containerized product delivery.
  • Experience with GitHub, GitHub Actions, and / or Argo CD or similar technologies.
  • Strong background in working in an agile service delivery methodology arena focusing on iterative service improvement delivery.

Education & Experience :

  • A bachelor’s degree in Computer Science, Engineering, or related field; a master’s degree is preferred.
  • At least 10 years of experience in IT Infrastructure, system administration, or reliability engineering with a minimum of 5 years in a leadership role.
  • A track record of managing complex infrastructure initiatives and leading incident response efforts.

LI-Hybrid

LI-ZP1

30+ days ago
Related jobs
Arch Capital Group
Greensboro, North Carolina

The Director, Site Reliability Engineering (SRE) is a pivotal role in the technology infrastructure team, responsible for ensuring the highest levels of reliability, scalability, and performance. Work closely with shared infrastructure services teams (including other SRE teams) within the corporatio...

Promoted
Avery Dennison
Greensboro, North Carolina

Avery Dennison Corporation (NYSE: AVY) is a global materials science and digital identification solutions company that provides a wide range of branding and information solutions that optimize labor and supply chain efficiency, reduce waste, advance sustainability, circularity and transparency, and ...

Lincoln Financial Group
Greensboro, North Carolina

A leadership team that prioritizes your health and well-being; offering a remote work environment and flexible work hybrid situations. Pay is based on non-discriminatory factors including but not limited to work experience, education, location, licensure requirements, proficiency and qualifications ...

Truist
Greensboro, North Carolina

All regular teammates (not temporary or contingent workers) working 20 hours or more per week are eligible for benefits, though eligibility for specific benefits may be determined by the division of Truist offering the position. Other duties may be performed, both major and minor, which are not ment...

Apex Systems
NC, United States

If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at employeeservices@apexsystems. We do not discriminate ...

Avery Dennison
Greensboro, North Carolina
Remote

Working independently and with minimum supervision provides technical services at customer’s location and support to sales, marketing or business development within assigned territory. Promptly reports product operating abnormalities and/or failures and obtains samples for evaluation. Operates in th...

Promoted
Zachary Piper
Fort Liberty, NC

DevOps Engineer, CI/CD, cloud computing, AWS, Azure, Kubernetes, Docker, automation, scripting, configuration management, Git, monitoring, troubleshooting, infrastructure as code, Agile, security, collaboration, scalability, performance optimization, container orchestration, continuous integration, ...

Promoted
InsideHigherEd
Greensboro, North Carolina

Director of Research Operations and Environmental Health and Safety. The Joint School of Nanoscience and Nanoengineering (JSNN) combines the genuine excellence of two great universities, North Carolina A&T State University (NC A&T SU) and the University of North Carolina at Greensboro (UNCG), to bri...

Promoted
VirtualVocations
Greensboro, North Carolina

A company is looking for a Preventative Maintenance Program Manager to oversee the preventative maintenance program for HVAC and electrical systems. ...

Promoted
Stretch Zone - 1100
High Point, North Carolina

Employment Type: General Manager. Lead and work harmoniously with co-workers, clients and the general public. ...