Talent.com
serp_jobs.error_messages.no_longer_accepting
Distinguished Software Engineer, Reliability Infra

Distinguished Software Engineer, Reliability Infra

LinkedInMountain View, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Distinguished Software Engineer, Reliability Infra

LinkedIn is the world's largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover exciting opportunities, build necessary skills, and gain valuable insights every day. We're also committed to providing transformational opportunities for our own employees by investing in their growth. We aspire to create a culture that's built on trust, care, inclusion, and fun where everyone can succeed.

Job Description

At LinkedIn, our approach to flexible work is centered on trust and optimized for culture, connection, clarity, and the evolving needs of our business. The work location of this role is hybrid, meaning it will be performed both from home and from a LinkedIn office on select days, as determined by the business needs of the team.

This role will be based in Sunnyvale, CA or San Francisco, CA.

Responsibilities

  • Serve as a senior technical leader driving the long-term reliability and observability strategy across LinkedIn's infrastructure
  • Re-architect LinkedIn's backend systems to enable granular failure domains and reduce the blast radius of incidents
  • Design and implement next-generation failure mitigation strategies that avoid full-region or full-datacenter failovers
  • Partner closely with across many different types of engineers to raise the bar for operational excellence and incident response
  • Define and build frameworks to improve monitoring, alerting, and observability across hundreds of services and systems
  • Define and own the roadmap of bringing observability to critical user journeys for LinkedIn's products to help capture and improve the experience of LinkedIn's members / customers
  • Spearhead a multi-year initiative to transition LinkedIn's infrastructure to a regionalized model with localized failover, enhancing both scalability and availability
  • Lead technical discussions on the future of Engineering at LinkedIn, what the function should evolve into over the next 3- 5 years
  • Deliver key insights, executive level reporting across the cross-functional engineering teams to enable the right business decisions around improving quality and reliability of our services and products
  • Act as a force multiplier by mentoring engineers, influencing technical direction across orgs, and contributing deeply to culture, hiring, and technical excellence
  • Lead incident response and post-incident reviews to identify root causes and implement preventive measures.
  • Develop and maintain incident management processes and procedures to ensure timely resolution of issues and minimize impact on customers

Qualifications

Basic Qualifications

  • 15+ years of software engineering experience
  • 8+ years focused on infrastructure, reliability focused engineering, or distributed systems
  • Preferred Qualifications

  • Hands-on experience with large-scale incident response, root cause analysis, and resiliency engineering
  • Strong communication and cross-functional collaboration skills, with experience influencing across multiple orgs and leadership levels
  • Proven success designing and leading architectural transformations at internet-scale companies
  • Deep knowledge of systems reliability, observability frameworks, and fault-tolerant architecture design
  • Experience with multi-region architecture, capacity planning, and failover strategies in large-scale cloud or hybrid environments
  • Background in CI / CD, platform reliability, and automation of ops-heavy systems.
  • Familiarity with modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana) and service mesh architecture
  • Track record of setting long-term technical strategy and driving systemic improvements in availability and performance
  • Previous experience in a Distinguished Engineer or equivalent role at a high-growth or web-scale technology company
  • Additional Information

    LinkedIn is committed to fair and equitable compensation practices. The pay range for this role is $238,000 to $390,000. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to skill set, depth of experience, certifications, and specific work location.

    Equal Opportunity Statement

    We seek candidates with a wide range of perspectives and backgrounds and we are proud to be an equal opportunity employer. LinkedIn considers qualified applicants without regard to race, color, religion, creed, gender, national origin, age, disability, veteran status, marital status, pregnancy, sex, gender expression or identity, sexual orientation, citizenship, or any other legally protected class.

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Reliability Engineer • Mountain View, CA, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Gaming Licensed Senior Software Engineer

    Gaming Licensed Senior Software Engineer

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Lead Software Engineer, AI Engineering.Key Responsibilities Design, develop, and operate core AI platform components, including LLM runtime services and vector s...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Software Engineer II

    Software Engineer II

    VirtualVocationsOakland, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Software Engineer II to design and implement cloud networking products.Key Responsibilities Build cloud platform functionality for managing and protecting networking in...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Mid-Level Ransomware Engineer

    Mid-Level Ransomware Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Lieutenant Junior Grade.Key Responsibilities Provide technical support on ransomware restoration engagement workstreams, ensuring adherence to processes and targets At...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Staff Site Reliability Engineer.Key Responsibilities Define and drive the strategic direction for SRE practices and reliability engineering Architect and implement com...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Software Engineer - AI Agent Infrastructure (Healthcare)

    Software Engineer - AI Agent Infrastructure (Healthcare)

    Honey HealthHayward, CA, US
    serp_jobs.job_card.full_time
    Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patient data, processing orders and prescri...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff Software Engineer

    Staff Software Engineer

    SuperDialSan Mateo County, CA, US
    serp_jobs.job_card.full_time
    SuperDial is seeking a Staff Software Engineer to build and scale the backend systems that power large language model (LLM) applications in healthcare. This role is ideal for an engineer who thrives...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VirtualVocationsSan Francisco, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Site Reliability Engineer 1.Key Responsibilities Manage deployments of services to the GovCloud Monitor KPIs of services running in the GovCloud Author and maintain d...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Systems Engineer II

    Systems Engineer II

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Systems Engineer II to manage and operate production environments while ensuring 24 / 7 availability. Key Responsibilities Monitor and maintain all production system equip...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Application Engineer II

    Application Engineer II

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for an Application Engineer II - Google Workspace.Key Responsibilities Collaborate with a multi-functional team to administer and support Google Workspace platforms Manage a...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantumPalo Alto, CA, United States
    serp_jobs.job_card.full_time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Engineer, Site Reliability

    Senior Engineer, Site Reliability

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Engineer in Site Reliability Engineering for Digital Banking.Key Responsibilities Ensure the reliability, availability, and performance of applications in produc...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Software Engineer

    Software Engineer

    Spectraforce TechnologiesPleasanton, CA, United States
    serp_jobs.job_card.full_time
    Location : Pleasanton, CA - Hybrid.Work Schedule - M-F Normal Business Hours.What are the top 3-5 skills, experience or education required for this position : . The software engineer III will work coll...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    AWS Remediation Engineer

    AWS Remediation Engineer

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior AWS Remediation Engineer.Key Responsibilities : Manage security issues and ensure timely remediation Design and implement automated security solutions for cloud ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Software Engineer, Platform - Hayward, USA

    Software Engineer, Platform - Hayward, USA

    SpeechifyHayward, CA, US
    serp_jobs.job_card.full_time
    The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify's text-to-speech products to turn whatever they're reading – ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_hour
    • serp_jobs.job_card.promoted
    Middleware Engineer

    Middleware Engineer

    Vsolutions TechnologiesHayward, CA, US
    serp_jobs.job_card.full_time
    Position : Sr Middleware Engineer.Location : Oakland, CA, Rancho Cordova, CA or Alpharetta, GA(Hybrid).Performs as an individual contributor of middleware team that builds and supports exceptional cu...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Systems Software Engineer, AI Infrastructure.Key Responsibilities Develop and maintain large-scale systems for AI Infrastructure, ensuring reliability and scalability ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Software Engineer III - Full Stack

    Software Engineer III - Full Stack

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Software Engineer III - Full Stack (Remote).Key Responsibilities Estimate and complete development tasks including coding, requirements gathering, and analysis with min...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Systems Engineer II - Storage

    Systems Engineer II - Storage

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Systems Engineer II - Storage (REMOTE).Key Responsibilities Install and maintain storage devices in data centers and perform capacity planning Troubleshoot storage iss...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Software Engineer, Internal Infrastructure

    Software Engineer, Internal Infrastructure

    VirtualVocationsHayward, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Software Engineer, Internal Infrastructure (Europe & UK).Key Responsibilities Build and operate Kubernetes compute superclusters across multiple clouds Partner with cl...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Senior Software Engineer – Distributed Systems (Erlang Preferred)

    Senior Software Engineer – Distributed Systems (Erlang Preferred)

    SourceOwls, LLCRedwood City, CA, US
    serp_jobs.job_card.full_time
    Senior Software Engineer – Distributed Systems (Erlang Preferred) Location : Onsite 3–5 days / week Type : Full-Time Visa Sponsorship : Not Available Relocation Assistance : Not Available Benefits Includ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days