Mgr, Site Reliability Engineering

NetApp
Morrisville, NC, US
$180.2K-$250.3K a year
Full-time

About NetApp

NetApp is the intelligent data infrastructure company, turning a world of disruption into opportunity for every customer.

No matter the data type, workload or environment, we help our customers identify and realize new business possibilities.

And it all starts with our people.

If this sounds like something you want to be part of, NetApp is the place for you. You can help bring new ideas to life, approaching each challenge with fresh eyes.

We embrace diversity and openness because it's in our DNA. Of course, you won't be doing it alone. At NetApp, we're all about asking for help when we need it, collaborating with others, and partnering across the organization - and beyond.

At NetApp, we fully embrace and advance a diverse, inclusive global workforce with a culture of belonging that leverages the backgrounds and perspectives of all employees, customers, partners, and communities to foster a higher performing organization."-George Kurian, CEO

Job Summary

The Site Reliability Engineering (SRE) Manager will lead a dynamic team responsible for ensuring our critical systems' reliability, performance, and efficiency.

This role involves a strategic blend of engineering and operations and requires a strong background in software development, systems engineering, and leadership.

This is a pivotal role in our operations, demanding a dedicated individual who excels in a fast-paced and collaborative environment.

We invite you to apply if you are driven by system reliability and ready to lead a high-performing team.

Job Responsibilities

  • Lead and mentor a team of SREs, fostering a culture of continuous improvement and innovation.
  • Collaborate with product and engineering teams to design and implement scalable solutions.
  • Develop and maintain a reliable monitoring and alerting system to detect and mitigate issues proactively.
  • Drive incident management processes and conduct post-mortem analyses to prevent future outages.
  • Manage priorities, projects, and the overall workflow of the SRE team.
  • Ensure compliance with security best practices and company policies.
  • Stay ahead of industry trends and emerging technologies to continuously improve system reliability and performance.

Job Requirements

  • Minimum of 8 years of experience in SRE, DevOps, or similar roles, with at least 2+ years in a leadership position with direct reports.
  • Experience leading geographically dispersed teams.
  • Proficiency in programming languages such as Python, Go, or Java.
  • Extensive experience with cloud services (AWS, GCP, Azure) and container orchestration tools (Kubernetes, Docker).
  • Solid understanding of CI / CD pipelines and automation tools (Jenkins, Ansible, Terraform).
  • Exceptional knowledge of observability tools and setting up architecture for proactive monitoring of the product.
  • Proven track record of designing and implementing scalable, high-availability systems.
  • Exceptional problem-solving skills and the ability to work under pressure.
  • Excellent communication and team-building skills.

Education

Bachelor’s degree in computer science, Engineering, or a related field; Master’s preferred.

Compensation

The base salary range for this position is $180,200 $250,300 and will be determined by the candidate's location, qualifications, experience, and education.

Final compensation packages are competitive and in line with industry standards, reflecting a variety of factors, and include a comprehensive benefits package.

This may cover Health Insurance, Life Insurance, Retirement or Pension Plans, Paid Time Off (PTO), various Leave options, Performance-Based Incentives, employee stock purchase plan, and / or restricted stocks (RSU’s), with all offerings subject to regional variations and governed by local laws, regulations, and company policies.

Benefits may vary by country and region, and further details will be provided as part of the recruitment process.

Equal Opportunity Employer :

NetApp is firmly committed to Equal Employment Opportunity (EEO) and to compliance with all federal, state and local laws that prohibit employment discrimination based on age, race, color, gender, sexual orientation, gender identity, national origin, religion, disability or genetic information, pregnancy, protected veteran status, and any other protected classification.

Did you know...

Statistics show women apply to jobs only when they're 100% qualified. But no one is 100% qualified. We encourage you to shift the trend and apply anyway! We look forward to hearing from you.

Why NetApp?

We are all about helping customers turn challenges into business opportunity. It starts with bringing new thinking to age-old problems, like how to use data most effectively to run better - but also to innovate.

We tailor our approach to the customer's unique needs with a combination of fresh thinking and proven approaches.

We enable a healthy work-life balance. Our volunteer time off program is best in class, offering employees 40 hours of paid time per year to volunteer with their favorite organizations.

We provide comprehensive medical, dental, wellness, and vision plans for you and your family. We offer educational assistance, legal services, and access to discounts.

Finally, we provide financial savings programs to help you plan for your future.

If you want to help us build knowledge and solve big problems, let's talk.

7 days ago
Related jobs
Promoted
NetApp
Durham, North Carolina

The Site Reliability Engineering (SRE) Manager will lead a dynamic team responsible for ensuring our critical systems' reliability, performance, and efficiency. This role involves a strategic blend of engineering and operations and requires a strong background in software development, systems engine...

Promoted
NetApp
Durham, North Carolina

The Site Reliability Engineering (SRE) Manager will lead a dynamic team responsible for ensuring our critical systems' reliability, performance, and efficiency. This role involves a strategic blend of engineering and operations and requires a strong background in software development, systems engine...

Promoted
VirtualVocations
Durham, North Carolina

A company is looking for a Staff Vice President, Information Technology - Site Reliability Engineering. ...

Arch Capital Group
Raleigh, North Carolina

The Director, Site Reliability Engineering (SRE) is a pivotal role in the technology infrastructure team, responsible for ensuring the highest levels of reliability, scalability, and performance. At least 10 years of experience in IT Infrastructure, system administration, or reliability engineering ...

Promoted
VirtualVocations
Durham, North Carolina

A company is looking for a Site Reliability Engineering (SRE) Lead to deliver mission-critical services that empower end users. ...

First Citizens Bank
Raleigh, North Carolina

Responsibilities Be part of the team that owns the availability, performance and reliability of customer-facing systems Drive adherence to SLOs through monitoring, alerting, and scaling Software Development in an Enterprise Java Environment, including experience with Spring Boot and Python for CICD ...

IDEXX
US, NC, Virtual

Are you interested in working on a fast-paced Agile team, building modern & global LIMS platform? Do you want to work on a product that makes a difference in the day-to-day life of lab operations, veterinarians, and pet owners? Are you a self-starter individual? We are looking for a motivated engine...

First Citizens Bank
Raleigh, North Carolina

As a Site Reliability Engineer you will be responsible for performance, reliability and availability of critical applications for First Citizens Bank. Understanding of Site Reliability Engineering concepts and best practices. Bachelor's Degree and 2 years of experience in Application Engineering OR ...

Pearson
Durham, North Carolina

The Principal Site Reliability Engineering is a pivotal leadership role accountable for guiding Pearson's SRE teams towards increased operational excellence, system reliability, and strategic alignment with organizational objectives. Champion operational excellence by directing initiatives that elev...

Pendo.io
Raleigh, North Carolina

For those interested in a Back End or Site Reliability Engineering (SRE) internship:. Create a real impact on the engineering team(s) by working directly with Product, Design and your team in an Agile environment rapidly developing and releasing Pendo products to our customers. ...