Search jobs > New York, NY > Senior director engineering

Senior Director Operations Engineering , Risk & Problem Management

Equinix
New York
$159K-$271K a year
Permanent
Full-time

Who are we?

Equinix is the world’s digital infrastructure company®, operatingover 250 data centers across the globe.Digital leaders harness Equinix's trusted platform to bring together and interconnect foundational infrastructure at software speed.

Equinix enables organizations to access all the right places, partners and possibilities to scale with agility, speed the launch of digital services, deliver world-class experiences and multiply their value, while supporting their sustainability goals.

Joining our operations team means that you will be at the forefront of all we do, maintaining critical facilities infrastructure as part of a close-knit team delivering best-in-class service to our data center customers.

We embrace diversity in thought and contribution and are committed to providingan equitable work environment that is foundational to our core values as a company and is vital to our success.

Senior Director Operations Engineering , Risk & Problem Management

Equinix is the world’s digital infrastructure company, operating 260+ data centers across the globe and providing interconnections to all the key clouds and networks.

Businesses need one place to simplify and bring together fragmented, complex infrastructure that spans private and public cloud environments.

Our global platform allows customers to place infrastructure wherever they need it and connect it to everything they need to succeed.

We are a fast-growing global company with 21 years of continuous growth. Through our innovative portfolio of high-performance products and services, we have created the largest, most active global ecosystem of 10,000+ companies, including 2,100 networks and 3,000+ cloud and IT service providers in 32 countries spanning six continents and counting!

A leadership role at Equinix means you will drive and collaborate on work that impacts the world. We embrace diversity in thought and contribution and are committed to providing an equitable work environment that is foundational to our core values as a company and is vital to our success.

The Job

The Senior Director of Engineering, Problem & Risk Management, will be responsible for leading the Problem Management and Risk Management function within the Engineering department.

This role is vital in ensuring the operational resilience, stability, reliability and availability of our technology platforms by proactively identifying, analyzing, and mitigating risks, as well as identifying and resolving (recurring) technical issues.

The Senior Director will work closely with cross-functional teams including Incident, Change and Asset Management, (Data Center) Operational Excellence, company-wide Equinix Enterprise Risk, Compliance and Field Operations teams in all three regions, to drive continuous improvement and safeguard the organization’s critical facility assets.

Responsibilities

Leadership & Strategy

Develop and execute the strategic vision for the Problem & Risk Management functions within the Engineering department

Lead and mentor a team of problem managers, risk managers, and engineers, fostering a culture of accountability, innovation, and continuous improvement

Collaborate with executive leadership to ensure alignment of problem and risk management strategies with overall business objectives

Problem Management

Oversee the identification, analysis, and resolution of (recurring) issues across the critical facility assets in all our data centers

Ensure thorough root cause analysis (RCA), utilizing structured methodologies (e.g. 5-Whys, Ishikawa) is conducted for incidents and the identification of systemic issues, and that permanent corrective actions are implemented effectively

Manage the entire lifecycle of problems, from detection and prioritization to documentation and resolution, minimizing impact on business operations

Risk Management

Identify, assess, and mitigate risks related to technology and operations, ensuring the protection of the organization’s assets and operations

Develop and implement risk management frameworks, policies, and procedures in alignment with industry standards

Monitor emerging risks, industry trends, and regulatory changes, adjusting strategies as needed to address potential threats

Physical Audits

Continue and optimize a system of identifying the highest risks data centers where physical audits of critical infrastructure and its operations will have to be performed

Process Improvement

Continuously evaluate and enhance problem and risk management processes to improve efficiency, effectiveness, and responsiveness

Develop and maintain key performance indicators (KPIs), metrics, dashboards, and reports to monitor performance and drive accountability

Lead post-incident reviews and drive cross-functional initiatives to prevent recurrence of issues and mitigate risks proactively

Cross-functional Collaboration

Collaborate closely with Incident-, Change- and Asset management, the overall company risk department, Operational Excellence, the regional Field Ops teams, the Compliance departments and other key stakeholders to ensure problems and risks are managed effectively

Facilitate communication and coordination among teams to minimize downtime, manage risks, and ensure seamless operations

Lead cross-functional task forces during major incidents or risk events, ensuring swift and effective resolution and mitigation

Compliance & Governance

Ensure adherence to regulatory requirements, industry standards, and internal policies related to risk management and problem resolution

Implement governance processes to track and manage risks, incidents, and problems, ensuring compliance with relevant frameworks (e.g. ITIL)

Provide regular reports to senior leadership on the status of problem and risk management activities, including key trends, metrics, and action plans

Stakeholder Communication

Serve as the point of escalation for complex or high-impact problems and risks, ensuring timely resolution and clear communication with stakeholders

Provide executive leadership with regular updates on risk exposure, problem resolution progress, and key initiatives

Build and maintain strong relationships with internal and external stakeholders, ensuring transparency and trust in problem and risk management efforts

Team Leadership

Manages a global team of subject matter experts motivating them around a clear, shared vision

Shape a collaborative, transparent, and entrepreneurial culture, in tandem with the extended global operations leadership and the corporate leadership team

Qualifications

Education

Strong intellect ideally with a bachelor’s or master’s degree or equivalent experience in engineering or related field

Experience

Substantial experience in engineering in data centers or a related field such as mission critical, oil and gas, chemicals, aerospace or transportation with a core focus on problem management and risk management

Demonstrable leadership experience as a manager of leaders (directors), steering similar-sized cross-functional teams (c20) in a complex, fast-paced environment

Proven experience in implementing and managing problem and risk management processes aligned with industry standards (e.g., ITIL)

Skills

Strong technical background with deep knowledge of critical facility infrastructure, data center operations, risk management and operational environments

Exceptional problem-solving and analytical skills, with the ability to assess complex issues and risks and drive effective solutions

Excellent communication and interpersonal skills, with the ability to influence and build relationships at all levels of the organization

Experience with risk assessment methodologies, root cause analysis (RCA), and incident management tools

Familiarity with relevant tools and technologies for monitoring, logging, managing incidents / problems, and assessing risks

Attributes

Consistent record in seeing the bigger long-term picture’ and thinking strategically while achieving short-term results

Tight-knit collaboration skills and confirmed ability to work cross-functionally with diverse business unit leaders

Outstanding ability to influence and cultivate deep relationships with a range of internal stakeholders (especially sales) to drive customer value

Demonstrated ability to communicate effectively and persuasively

Proactive, transparent and inclusive 'in-service' authentic leadership style able to work comfortably in a non-hierarchical, matrixed context comprising a broad range of collaborators

Inspirational leader adept at inspiring behavior change through motivating teams, planning initiatives, designating priorities, and being decisive when faced with ambiguity

Work Environment

Able and willing to travel occasionally (6-10 times / year) to various company locations and partner sites

The Senior Director of Engineering, Problem & Risk Management, will work in a collaborative environment with a mix of on-site and remote work arrangements

Fluency in English is essential

Equinix is an Equal Employment Opportunity and, in the U.S., an Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to unlawful consideration of race, color, religion, creed, national or ethnic origin, ancestry, place of birth, citizenship, sex, pregnancy / childbirth or related medical conditions, sexual orientation, gender identity or expression, marital or domestic partnership status, age, veteran or military status, physical or mental disability, medical condition, genetic information, political / organizational affiliation, status as a victim or family member of a victim of crime or abuse, or any other status protected by applicable law.

Résumé du poste

Le directeur principal de l'ingénierie, gestion des problèmes et des risques, sera chargé de diriger la fonction de gestion des problèmes et des risques au sein du département de l'ingénierie.

Ce rôle est essentiel pour assurer la résilience opérationnelle, la stabilité, la fiabilité et la disponibilité de nos plateformes technologiques en identifiant, analysant et atténuant les risques de manière proactive, ainsi qu'en identifiant et en résolvant les problèmes techniques (récurrents).

Le directeur principal travaillera en étroite collaboration avec des équipes interfonctionnelles, notamment la gestion des incidents, des changements et des actifs, l'excellence opérationnelle (centre de données), le risque d'entreprise d'Equinix à l'échelle de l'entreprise, la conformité et les équipes des opérations sur le terrain dans les trois régions, afin de favoriser l'amélioration continue et de protéger les actifs critiques des installations de l'organisation.

Responsabilités

Leadership et stratégie

Élaborer et mettre en œuvre la vision stratégique des fonctions de gestion des problèmes et des risques au sein du département Ingénierie

Diriger et encadrer une équipe de gestionnaires de problèmes, de gestionnaires de risques et d'ingénieurs, en favorisant une culture de la responsabilité, de l'innovation et de l'amélioration continue

Collaborer avec la direction pour garantir l'alignement des stratégies de gestion des problèmes et des risques sur les objectifs généraux de l'entreprise

Gestion des problèmes

Superviser l'identification, l'analyse et la résolution des problèmes (récurrents) dans les installations critiques de tous nos centres de données

S'assurer qu'une analyse approfondie des causes profondes (RCA), utilisant des méthodologies structurées (par exemple 5-Whys, Ishikawa), est menée pour les incidents et l'identification des problèmes systémiques, et que des actions correctives permanentes sont mises en œuvre efficacement

Gérer l'ensemble du cycle de vie des problèmes, depuis la détection et la hiérarchisation jusqu'à la documentation et la résolution, en minimisant l'impact sur les activités de l'entreprise

Gestion des risques

Identifier, évaluer et atténuer les risques liés à la technologie et aux opérations, en veillant à la protection des actifs et des opérations de l'organisation

Élaborer et mettre en œuvre des cadres, des politiques et des procédures de gestion des risques conformes aux normes du secteur

Surveiller les risques émergents, les tendances du secteur et les changements réglementaires, et adapter les stratégies en fonction des besoins pour faire face aux menaces potentielles

Audits physiques

Poursuivre et optimiser un système d'identification des centres de données présentant les risques les plus élevés, où des audits physiques de l'infrastructure critique et de ses opérations devront être réalisés

Amélioration des processus

Évaluer et améliorer en permanence les processus de gestion des problèmes et des risques afin d'accroître l'efficience, l'efficacité et la réactivité

Élaborer et tenir à jour des indicateurs de performance clés, des mesures, des tableaux de bord et des rapports afin de contrôler les performances et de favoriser la responsabilisation

Diriger les examens post-incidents et mener des initiatives interfonctionnelles afin d'éviter que les problèmes ne se reproduisent et d'atténuer les risques de manière proactive

Collaboration interfonctionnelle

Collaborer étroitement avec la gestion des incidents, des changements et des actifs, le département des risques de l'entreprise, l'excellence opérationnelle, les équipes régionales d'opérations sur le terrain, les départements chargés de la conformité et d'autres parties prenantes clés afin de garantir une gestion efficace des problèmes et des risques

Faciliter la communication et la coordination entre les équipes afin de minimiser les temps d'arrêt, de gérer les risques et d'assurer la continuité des opérations

Diriger des groupes de travail interfonctionnels lors d'incidents majeurs ou d'événements à risque, en veillant à ce que les problèmes soient résolus et atténués rapidement et efficacement

Conformité et gouvernance

Garantir le respect des exigences réglementaires, des normes industrielles et des politiques internes relatives à la gestion des risques et à la résolution des problèmes

Mettre en œuvre des processus de gouvernance pour suivre et gérer les risques, les incidents et les problèmes, en veillant à la conformité avec les cadres pertinents (par exemple, ITIL)

Fournir des rapports réguliers à la direction générale sur l'état des activités de gestion des problèmes et des risques, y compris les principales tendances, les mesures et les plans d'action

Communication avec les parties prenantes

Servir de point d'escalade pour les problèmes et les risques complexes ou à fort impact, en veillant à une résolution rapide et à une communication claire avec les parties prenantes

Fournir à la direction générale des mises à jour régulières sur l'exposition aux risques, l'avancement de la résolution des problèmes et les initiatives clés

Établir et maintenir des relations solides avec les parties prenantes internes et externes, en assurant la transparence et la confiance dans les efforts de gestion des problèmes et des risques

Direction d'équipe

Gérer une équipe mondiale d'experts en la matière en les motivant autour d'une vision claire et partagée

Façonner une culture de collaboration, de transparence et d'entreprise, en tandem avec la direction élargie des opérations mondiales et l'équipe de direction de l'entreprise

Qualifications

Formation

Idéalement, une licence ou un master ou une expérience équivalente en ingénierie ou dans un domaine connexe

Expérience

Expérience substantielle en ingénierie dans les centres de données ou dans un domaine connexe tel que les missions critiques, le pétrole et le gaz, les produits chimiques, l'aérospatiale ou les transports, avec un accent particulier sur la gestion des problèmes et la gestion des risques

Expérience démontrée en matière de leadership en tant que responsable de dirigeants (directeurs), dirigeant des équipes interfonctionnelles de taille similaire (c20) dans un environnement complexe et en évolution rapide

Expérience avérée de la mise en œuvre et de la gestion de processus de gestion des problèmes et des risques alignés sur les normes industrielles (par exemple, ITIL)

Compétences

Solide expérience technique avec une connaissance approfondie de l'infrastructure des installations critiques, des opérations des centres de données, de la gestion des risques et des environnements opérationnels

Compétences exceptionnelles en matière de résolution de problèmes et d'analyse, avec la capacité d'évaluer des problèmes et des risques complexes et de trouver des solutions efficaces

Excellentes compétences en matière de communication et de relations interpersonnelles, avec la capacité d'influencer et d'établir des relations à tous les niveaux de l'organisation

Expérience des méthodologies d'évaluation des risques, de l'analyse des causes profondes (RCA) et des outils de gestion des incidents

Connaissance des outils et technologies pertinents pour la surveillance, l'enregistrement, la gestion des incidents / problèmes et l'évaluation des risques

Attributs

Capacité constante à avoir une vision globale à long terme et à penser de manière stratégique tout en obtenant des résultats à court terme

Aptitude à collaborer étroitement et capacité confirmée à travailler de manière transversale avec divers responsables d'unités opérationnelles

Capacité exceptionnelle à influencer et à cultiver des relations profondes avec une série de parties prenantes internes (en particulier les ventes) afin de créer de la valeur pour le client

Capacité avérée à communiquer de manière efficace et convaincante

Style de leadership authentique, proactif, transparent et inclusif, capable de travailler confortablement dans un contexte matriciel non hiérarchique comprenant un large éventail de collaborateurs

Leader inspirant, capable de susciter des changements de comportement en motivant les équipes, en planifiant des initiatives, en définissant des priorités et en faisant preuve de détermination face à l'ambiguïté

Environnement de travail

Capable et désireux de voyager occasionnellement (6 à 10 fois par an) sur les différents sites de l'entreprise et de ses partenaires

Le directeur principal de l'ingénierie, de la gestion des problèmes et des risques, travaillera dans un environnement collaboratif avec une combinaison de travail sur site et à distance

The United States targeted pay range for this position in the following location is / locations are :

San Francisco, CA / Bay Area : $179,000 to $304,000 per year

California (Non-SF / Bay Area), Connecticut, Maryland, New York, New Jersey, Washington state : $172,000 to $293,000 per year

Colorado, Nevada, Rhode Island : $159,000 to $271,000 per year

Our pay ranges reflect the minimum and maximum target for new hire pay for the full-time position determined by role, level, and location.

Individual pay is based on additional factors including job-related skills, experience, and relevant education and / or training.

This position may be offered in other locations. Your recruiter can share more about the specific pay range for your preferred location during the hiring process.

The targeted pay range listed reflects the base pay only and does not include bonus, equity, or benefits. Employees are eligible for bonus, and equity may be offered depending on the position.

As an employee, you become important to Equinix’s success. Details about our company benefits can be found at the following link :

Equinix is committed to ensuring that our employment process is open to all individuals, including those with a disability.

If you are a qualified candidate and need assistance or an accommodation, please let us know by completing .

Equinix is an Equal Employment Opportunity and, in the U.S., an Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to unlawful consideration of race, color, religion, creed, national or ethnic origin, ancestry, place of birth, citizenship, sex, pregnancy / childbirth or related medical conditions, sexual orientation, gender identity or expression, marital or domestic partnership status, age, veteran or military status, physical or mental disability, medical condition, genetic information, political / organizational affiliation, status as a victim or family member of a victim of crime or abuse, or any other status protected by applicable law.

30+ days ago
Related jobs
Promoted
Capital One
Queens, New York

The second-line Chief Tech Risk Officer (CTRO) and the Technology Risk Management (TRM) organization have broader responsibilities for cybersecurity but also reliability, software quality, resilience, and other technology risks. West Creek 3 (12073), United States of America, Richmond, VirginiaSenio...

Promoted
Charter School Business Management
New York, New York

Senior Director of Accounting Quality and Compliance. The Senior Director of Accounting Quality and Compliance will be responsible for. At least 4 years of project management experience. Project Management Software – Monday, MS Project, Asana, Trello, or similar software. ...

Promoted
Capital One
Queens, New York

About the team: As the Senior Director of Technical Program Management in Capital One’s Enterprise Data and Machine Learning (EDML) division, you’ll spearhead strategic initiatives to enhance data management, drive innovation, and optimize operational efficiencies. Strong technical backgrounds (idea...

Promoted
Broadgate
New York, New York

Stakeholder Engagement: Collaborate effectively with cross-functional teams and senior management to ensure stakeholder buy-in and alignment with vendor management practices. Tool Selection and Management: Research, evaluate, and select tools across the vendor management space that best meet busines...

Promoted
Capital One
Bronx, New York

About the team: As the Senior Director of Technical Program Management in Capital One’s Enterprise Data and Machine Learning (EDML) division, you’ll spearhead strategic initiatives to enhance data management, drive innovation, and optimize operational efficiencies. Strong technical backgrounds (idea...

Promoted
Capital One
Queens, New York

About the team: As the Senior Director of Technical Program Management in Capital One’s Enterprise Data and Machine Learning (EDML) division, you’ll spearhead strategic initiatives to enhance data management, drive innovation, and optimize operational efficiencies. Strong technical backgrounds (idea...

BrightSpring Health Services
New York, New York

Director of Client Services - Account Management. All Account Management oversight in the State of NY for all types of customers to include Skilled Nursing facilities, Senior Living, IDD, Detox, & Group Homes. The Sr Director of Client Services - Account Management cultivates and maintains on-going ...

Promoted
Capital One
Willowbrook, New York

Center 3 (19075), United States of America, McLean, VirginiaSenior Director of Technical Program Management (AXT Program Delivery Office)Are you interested in leading programs that deliver on critical business goals and build large scale products & platforms?At Capital One, we’re a bank, but we don’...

Live Nation Worldwide, Inc.
New York, New York

The Director of Risk Management is a key member of the Corporate Risk Management Team, which supports the global operations of Live Nation. Oversight of company merger, acquisition and divestiture activities as relates to risk management and insurance operations, including integration management int...

PricewaterhouseCoopers Advisory Services LLC
New York, New York
Remote

As part of PwC’s Technology Operations (IT4IT) practice, the Service Management and Operations capability helps our clients transform their business through innovative technology solutions and effective Service Management Operations. As a Senior Manager, you'll work as part of a team of problem solv...