Site Reliability Principal Engineer

City National Bank
Los Angeles, United States
$122.5K-$208.7K a year
Permanent
Full-time

Site Reliability Principal Engineer SITE RELIABILITY PRINCIPAL ENGINEER

WHAT IS THE OPPORTUNITY?

As an SRE, you will utilize your software, systems engineering, and operations background to build and run large-scale, fault-tolerant systems.

Your role is to ensure the reliability, scalability and maximum uptime of CNB systems in the Data Center or Cloud Platform.

What you will do

  • Be a technical expert to architect solutions that helps to improve reliability of CNB's software platforms
  • Participate in on-call support and work through all aspects of the Incident Management process, including orchestrating Blameless Post-mortems and encourage the practice within the organization
  • Be a subject matter expert and partner with cross functional teams to develop and maintain technical documentation, network diagrams, runbooks, and procedures
  • Design, build and manage SLIs, SLOs and Error budgets for Availability , Performance / Latency and Throughput for critical services running in production.

Be a proponent of using the SRE core principles in driving product velocity

Create educational documentation on how-to's and and blog about use-cases and architectures that relate to cloud platforms and Observability.

Co-ordinate hackathons and code reviews with goals of continuous improvement in design , build and architectural practices

  • Liaise with the team managing our public cloud environments, including setup, management, and troubleshooting
  • Coach and mentor the junior team members to nurture team productivity and professional development
  • Lead the management , forecasting and budgeting activities to ensure availability of sufficient funding and resourcing has been accounted
  • Provide ongoing and timely feedback to the leadership for improvement of quality practices with client experience being at the paramount of all supporting activities
  • All other appropriate duties as required.

Must-Have*

  • Bachelor's Degree or equivalent
  • Minimum 12 years of experience in an Operational role, DevOps, SRE, or Software Engineering
  • Minimum 8 years of experience doing development in any of Java, NodeJS, .NET Core, Python
  • Minimum 5 years of experience with development or administration on any cloud platforms (Cloud Foundry, Heroku, AWS, Azure, Google Cloud, IBM Cloud, Bluemix, Kubernetes, and others).

The ideal candidate has significant experience with Platform as a Service cloud such as Cloud Foundry)

Minimum 5 years of experience developing applications with an active user base, and deploying to production and going through any change management process (Ideal candidate is able to engage in a detailed discussion about their change management process as well as its happy / pain points)"

Skills and Knowledge

  • Minimum 2 years of Experience with log analytical and management solutions such as Splunk / Elasticsearch and Kibana
  • Minimum 2 years of experience in Monitoring tools such as Datadog, AppDynamics, Dynatrace etc
  • Creativity, energy, and passion for leveraging technology to transform our industry; the belief that automation is the only way
  • A good understanding of modern, cloud centric architectures and DevOps principles
  • Experience with the operational aspects of software systems such as monitoring, centralized logging, and alerting
  • Providing standardized offerings to facilitate and ensure operational health of stacks throughout their lifecycle including metrics collection, aggregation, and visualization, inventory, capacity, and billing / tag management
  • You arepetitive and passionate. You thrive on challenge and have a proven ability to set ambitious but achievable goals and surpass them
  • Demonstrate a team player attitude with a growth mindset to be open to learn and adapt the changing landscape of the industry

Starting base salary : $122,535 - $208,715 per year. Exactpensation may vary based on skills, experience, and location. This job is eligible for bonus and / ormissions.

To be considered for this position you must meet at least these basic qualifications

The preceding job description has been designed to indicate the general nature and level of work performed by employees within this classification.

It is not designed to contain or be interpreted as aprehensive inventory of all duties, responsibilities, and qualifications required of employees assigned to this job.

Benefits and Perks

At City National, we strive to be the best at whatever we do, including the benefits and perks we offer our colleagues. Get an inside look at our Benefits and Perks.

INCLUSION AND EQUAL OPPORTUNITY EMPLOYMENT

City National Bank is an equal opportunity employermitted to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status or any other basis protected by law.'

ABOUT CITY NATIONAL

We start with a basic premise : Business is personal. Since day one we've always gone further than thepetition to help our clients, colleagues andmunity flourish.

City National Bank was founded in 1954 by entrepreneurs for entrepreneurs and that legacy of integrity,munity and unparalleled client relationships continues to drive phenomenal growth today.

City National is a subsidiary of Royal Bank of Canada, one of North America's leading diversified financial servicespanies. Job ID 8753

30+ days ago
Related jobs
Promoted
SpaceX
Hawthorne, California

GNC SITE RELIABILITY ENGINEER (FALCON). SpaceX is looking for a GNC Site Reliability Engineer to operate and scale custom-built mission-critical products for Guidance Navigational and Control (GNC). Bachelor's degree in computer science, information systems/IT, engineering, math, or scientific disci...

Promoted
Robert Half
CA, United States

Currently, I have a client that is in the entertainment industry is looking for a Site Reliability Engineer to join their engineering team. The Site Reliability Engineer position is fully remote. The Site Reliability Engineer should have experience in AWS, Terraform, Kubernetes, and Linux. The tasks...

Promoted
Fox Corporation
Los Angeles, California

Fox is hiring a Staff Site Reliability Engineer to help build and operate infrastructure and platforms to support APIs around our live direct to consumer APIs for major live events such as the Super Bowl, World Cup, and World Series. The staff engineer will serve as an SME for solving thundering her...

Promoted
Circle
Los Angeles, California

As a Senior Site Reliability Engineer at Circle, you will design, build, and maintain Circle’s infrastructure estate to meet the growing worldwide customer base on public cloud providers across multiple regions. Staff Site Reliability Engineer (IV). Senior Site Reliability Engineer (III). Senior Sit...

Promoted
Disney Cruise Line - The Walt Disney Company
Santa Monica, California

Disney Entertainment & ESPN Technology is looking for a Site/System Reliability Engineer to join the Production Platforms team inside the Engineering Services organization. As a Site/System Reliability Engineer, you will play a pivotal role in a highly performant and geographically dispersed tea...

Fox Corporation
Los Angeles, California

Fox is hiring a Principal Site Reliability Engineer - Kubernetes to build and operate infrastructure and platforms to support APIs around our live direct to consumer APIs for major live events such as the Super Bowl, World Cup, and World Series. The principal engineer will additionally work with the...

CoStar Group
CA, Orange County

On-site fitness center and/or reimbursed fitness center membership costs (location dependent), with yoga studio, Pelotons, personal training, group exercise classes, as well as Segways and bikes available for use during the day. ...

Splunk Inc
California, United States

Learn more aboutSplunkcareers and how you can become a part of our journey!Role:Splunk is looking for a TechOps Engineer with the ability to provide day-to-day technical expertise for our Splunk Cloud Azure TechOps team and the Splunk organization. As a TechOps Engineer, you will be interfacing with...

Tencent
Los Angeles, California

Are you passionate about gaming and skilled in managing distributed online systems? Uncapped Games is looking for a Site Reliability Engineer like you! Join us in our quest to revolutionize the Real-Time Strategy (RTS) genre with our groundbreaking new game. ...

City National Bank
Los Angeles, California

SITE RELIABILITY ENGINEER WHAT IS THE OPPORTUNITY? As an SRE, you will utilize your software, systems engineering, and operations background to build and run large-scale, fault-tolerant systems. Your role is to ensure the reliability, scalability and maximum uptime of CNB systems in the Data Center ...