Search jobs > Santa Clara, CA > Sr site reliability

Sr Site Reliability Engineer (Cortex XDR Cloud)

Palo Alto Networks
Santa Clara, CA, US
$124.6K-$201.7K a year
Full-time

Company Description

Our Mission

At Palo Alto Networks everything starts and ends with our mission :

Being the cybersecurity partner of choice, protecting our digital way of life.

Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for innovators who are as committed to shaping the future of cybersecurity as we are.

Our Approach to Work

We lead with flexibility and choice in all of our people programs. We have disrupted the traditional view that all employees have the same needs and wants.

We offer personalization and offer our employees the opportunity to choose what works best for them as often as possible - from your wellbeing support to your growth and development, and beyond!

At Palo Alto Networks, we believe in the power of collaboration and value in-person interactions. This is why our employees generally work from the office three days per week, leaving two days for choice and flexibility to work where you feel most effective.

This setup fosters casual conversations, problem-solving, and trusted relationships. While details may evolve, our goal is to create an environment where innovation thrives, with office-based teams coming together three days a week to collaborate and thrive, together!

Job Description

Your Career

We are looking for a Sr DevOps / SRE to operate in production a large scale GCP cloud running our innovative SaaS cyber-security product, while continuously improving application deployment, monitoring, operability and uptime of the service.

The Cortex XDR group specializes in analysis and visualization of complex cyber-data gathered by the Palo Alto Networks products.

It combines high-performance algorithms, deep understanding of modern databases, advanced visualization and high-end UI / UX.

Your Impact

  • Work closely and in full coordination with the DevOps and the RND team to develop new features and maintain high reliability for our SAAS Products (XDR, XSIAM, XSOAR and XSPANSE)
  • Work with the US and Israeli DevOps teams to provide follow-the-sun operational coverage in the production of our SaaS product
  • Build automated tools for cloud operations such as automated remediation of known issues, auto-scaling, etc.
  • Collaborate with the US SRE team to improve the security, cost-efficiency and performance of our SAAS products and internal platform tools
  • Develop and maintain various internal platform applications that enable our engineering teams to securely and effectively develop new product features

Qualifications

Your Experience

  • High proficiency in computer programming
  • High proficiency with Linux
  • High proficiency with developing and orchestrating containerized environments (Kubernetes and Docker)
  • High proficiency in Infrastructure management tools like terraform, ansible etc.
  • Ability to grasp new technologies quickly and prioritize and multitask on multiple responsibilities
  • Ability to operate independently, make decisions, take action and take responsibility
  • Effective communication and interpersonal skills, ability to work and coordinate between multiple cross-functional and international teams

Desired Experience

  • Proficiency with Google Cloud Platform operations
  • Proficient in understanding complex python code bases and writing good python applications
  • Experience managing multiple kubernetes clusters
  • Excellent understanding of computer networking, networking in cloud computing and containerized environments (kubernetes)
  • Proficiency with Prometheus and other cloud native monitoring solutions
  • Proficiency with CI / CD and Configuration Management

Additional Information

The Team

Cortex is the industry's only open and integrated, AI-based, continuous security platform. Cortex is a significant evolution of the Application Framework designed to simplify security operations and considerably improve outcomes.

Deployed on a global, scalable public cloud platform, Cortex allows security operations teams to speed the analysis of massive data sets.

You can learn more about Cortex XDR .

Our Commitment

We’re trailblazers that dream big, take risks, and challenge cybersecurity’s status quo. It’s simple : we can’t accomplish our mission without diverse teams innovating, together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at .

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary + commission target (for sales / commissioned roles) is expected to be between $124,600 / yr to $201,650 / yr.

The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found.

LI-TD1

30+ days ago
Related jobs
Promoted
Palo Alto Networks
Santa Clara, California

The AI security cloud service engineering team is the core engineering team to build a solid product to assure the runtime security of our customers when they are using AI especially LLM services. Our engineering team is at the core of our products – deliver the best of security services on the clou...

Promoted
TikTok
San Jose, California

TikTok is one of the fastest growing apps in the world, and we're seeking Site Reliability Engineers (SREs) to join our monetization technology team. SREs keep the systems up and running with the highest level of availability, ensuring our users have the best experience possible. Deliver tools/softw...

Blue Shield of California
CA, United States

Principal Enterprise Cloud Engineer will report to the Vice President of Developer & Employee Experience. Provide mentorship for engineers, with a particular focus on Staff+ engineers, to help them grow in their responsibilities. We are setting the path for how we’ll migrate to the Cloud and ensure ...

Promoted
TikTok
San Jose, California

Site Reliability Engineering(SRE) at TikTok combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. Scale systems sustainably through mechanisms such as automation; evolve systems reliability, efficiency, and velocity by pushing for ...

TikTok
Mountain View, California

About the TeamThe Global E-commerce SRE team of US Tech Services works with engineering and product teams to build and run large-scale, globally distributed, observable, fault-tolerant systems. This role will focus on service reliability, highly-scalable design and release management in a cloud-nati...

Promoted
TikTok
San Jose, California

TikTok is the leading destination for short-form mobile video.Our mission is to inspire creativity and bring joy.TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.Creation is the core of TikTok's purpose.Our platform is built...

Walmart
Sunnyvale, California

Integrates the business goals of site reliability engineering and site safety engineering. Principal Site Reliability Engineer:. Option 1: Bachelor's degree in computer science, computer engineering, computer information systems, software engineering, or related area and5 years’ experience in softwa...

Hireio, Inc.
San Jose, California

Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. We take pride in overseeing one of the industry's most extensive cloud infrastructures. In this era, SRE takes a central role. This role demands the ability to design, develop, and operate these components, t...

InterVision Systems, LLC
Santa Clara, California

Our Amazon Connect/Cloud Engineer will be working with client leaders and engineers establishing trusted relationships as part of ongoing solution and delivery efforts with the client. This leader will be working with one of several Principal Architects in a mid-sized, geographically distributed Clo...

ByteDance
San Jose, California

The Machine Learning (ML) System sub-team combines system engineering and the art of machine learning to develop and maintain massively distributed ML training and Inference system/services around the world, providing high-performance, highly reliable, scalable systems for LLM/AIGC/AGI In our team, ...