Talent.com
Principal Site Reliability Engineer (Miami)
Principal Site Reliability Engineer (Miami)Kandji • Miami, FL, United States
serp_jobs.error_messages.no_longer_accepting
Principal Site Reliability Engineer (Miami)

Principal Site Reliability Engineer (Miami)

Kandji • Miami, FL, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Principal Site Reliability Engineer

As a Principal Site Reliability Engineer at Kandji, you will play a critical role in ensuring the reliability, scalability, and performance of our platform. In this strategic position, you'll work cross-functionally to build and evolve the systems, tools, and processes that keep our services resilient and performantespecially as we scale to meet the demands of a growing customer base.

You'll bring a deep understanding of distributed systems, incident management, observability, and automation. Your experience with AWS, Kubernetes, and Infrastructure-as-Code (Terraform preferred) will help drive efforts to proactively identify and eliminate reliability risks, reduce toil through automation, and establish engineering best practices across teams.

We're looking for a seasoned engineer with both technical depth and a strategic mindsetsomeone who can guide long-term reliability efforts, lead postmortems and systemic remediation, and mentor others in SRE principles. This role provides the opportunity to shape the culture and architecture of reliability at Kandji, partnering closely with engineering, infrastructure, and product teams to build systems that are not only functional, but fault-tolerant and maintainable.

How You Will Make a Difference Day to Day :

  • Reliability Strategy & Resilience Engineering : Design and implement fault-tolerant, scalable, and highly available systems across our AWS-hosted platform to ensure reliability under load and failure conditions.
  • Service Ownership & Runbook Maturity : Partner with engineering teams to define and uphold SLIs / SLOs, perform root cause analyses, and drive post-incident reviews with a focus on long-term systemic improvements. Run recurring reliability reviews, and mature incident response practices including alert quality, runbooks, and failure simulations.
  • Automation & Tooling : Build and maintain automation for deployment, incident response, and remediation workflows to reduce manual toil and increase operational efficiency.
  • Secure Systems Design : Hands-on experience implementing DevSecOps practices including secure IaC, policy-as-code, and embedding controls in pipelines or platform abstractions.
  • Observability & Monitoring : Champion the development of comprehensive observability solutionsincluding metrics, logging, tracing, and alertingto enable proactive detection and resolution of issues.
  • Infrastructure as Code : Contribute to and improve our Terraform-based infrastructure management, enabling consistent, auditable, and repeatable infrastructure deployments.
  • Capacity Planning, FinOps & Performance : Lead efforts in system tuning, load testing, and capacity forecasting to support our scaling platform and avoid bottlenecks before they occur. Lead efforts to monitor and optimize cloud costs across environments. Design and advocate for architectural trade-offs that balance cost, performance, and reliability.
  • Cross-Functional Reliability Coaching : Embed reliability thinking into engineering and product workflows. Run architecture reviews, failure simulations, and training to elevate operational discipline.
  • Mentorship & Leadership : Mentor engineers across the organization in SRE best practices, incident response, and reliability design patterns, helping build a culture of ownership and operational excellence across the company.

We'd love to hear from you if you have :

  • Experience : 10+ years in Site Reliability Engineering, DevOps, Infrastructure or related roles, with a proven track record of improving system reliability and scaling distributed systems in cloud environments (preferably AWS).
  • Technical Proficiency : Deep expertise in Infrastructure as Code (Terraform strongly preferred), Kubernetes, and container orchestration at scale; strong background in automation, scripting (e.g., Python, Go, or Bash), and CI / CD pipelines.
  • Reliability Engineering Mindset : Experience defining and maintaining SLOs / SLIs, leading incident response and postmortems, and applying SRE principles to reduce toil and improve system reliability. Deep familiarity with chaos engineering, failure mode analysis, and designing systems for graceful degradation under partial failure.
  • Observability & Performance : Strong understanding of modern observability stacks (e.g., Datadog, Prometheus, Grafana, OpenTelemetry) and performance tuning for distributed systems.
  • Security & Compliance Awareness : Solid understanding of security and compliance in cloud environments, with experience implementing secure-by-default infrastructure patterns. Familiar with secure infrastructure design, cloud compliance requirements (SOC2, ISO27001, ISO42001), and embedding DevSecOps into delivery workflows.
  • Problem Solving : Skilled in diagnosing complex, multi-layered production issues and implementing pragmatic, long-term solutions.
  • Influence & Communication : Excellent written and verbal communication skills with the ability to clearly articulate reliability trade-offs and influence engineering teams toward better operational outcomes. Trusted collaborator with product, infra, security, and GTM leaders.
  • Location : Required to work on-site 5x a week in our Miami office (Coral Gables).
  • Benefits & Perks :

  • Competitive salary
  • 100% individual and dependent medical + dental + vision coverage
  • 401(k) with a 4% company match
  • 20 days PTO
  • Kandji Wellness Week the first week in July
  • Equity for full-time employees
  • Up to 16 weeks of paid leave for new parents
  • Paid Family and Medical Leave
  • Modern Health - Mental Health Benefits - Individual and Dependents
  • Fertility Benefits
  • Working Advantage Employee Discounts
  • Free onsite fitness center
  • Free parking
  • Lunch 5 days / week
  • Exciting opportunities for career growth
  • An outstanding, inclusive culture
  • We are excited to be serving a significant need for a fast-growing market, and are proud of the high-performing team we have brought together so far. If you're someone who wants to engage in new, exciting projects that will challenge your skills in the best way possible, we would love to connect with you. At Kandji we believe in fostering an inclusive environment in which employees feel encouraged to share their unique perspectives, leverage their strengths, and act authentically. We know that diverse teams are strong teams, and welcome those from all backgrounds and varying experiences.

    Kandji is proud to be an equal opportunity employer committed to diversity and inclusion in the workplace. Qualified applicants will be considered for employment without regard to race, color, religion, national origin, age, sex, sexual orientation, gender identity, physical or mental disability, protected veteran or military status or any other status protected by applicable law.

    serp_jobs.job_alerts.create_a_job

    Site Reliability Engineer • Miami, FL, United States

    Job_description.internal_linking.related_jobs
    Principal Engineer - Construction Materials Testing (CMT)

    Principal Engineer - Construction Materials Testing (CMT)

    LVI Associates • Miami-Dade County, FL, United States
    serp_jobs.job_card.full_time
    Principal Engineer - CMT Operations Manager (South Florida).Miami, Fort Lauderdale, West Palm Beach, or Delray Beach.Are you ready to lead and grow a Construction Materials Testing (CMT) operation ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Chief Engineer (Spanish Required)

    Chief Engineer (Spanish Required)

    FirstService Residential • Sunny Isles, FL, United States
    serp_jobs.job_card.full_time
    This position assumes total responsibility for the “physical plant.Closely monitors, identifies, and communicates problems in every phase of general maintenance of the building(s), including areas ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Entry Level Field Engineer, Miami, FL - June 2026

    Entry Level Field Engineer, Miami, FL - June 2026

    DOC • Miami, FL, US
    serp_jobs.job_card.full_time
    The successful candidate for this entry-level position will be assigned to work on a project site, under the supervision of the Project Manager and Superintendent on the job.Responsibilities includ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Platform Engineer

    Platform Engineer

    NationsBenefits • Plantation, FL, United States
    serp_jobs.job_card.full_time +1
    Senior Platform Engineer (DeVops).NationsBenefits is recognized as one of the fastest growing companies in America and a Healthcare Fintech provider of supplemental benefits, flex cards, and member...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Dietitian

    Dietitian

    United States Army • Miami-Dade County, FL, United States
    serp_jobs.job_card.permanent
    Army Dietitians provide the tools for health living for Soldiers and their families.If you are a professional in the dietetic field and want to combine your knowledge and passion for living a healt...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Together We Talent • Fort Lauderdale, FL, us
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    Job Title : Site Reliability Engineer.On-site – Fort Lauderdale, FL (Local candidates only).Information Technology & Services. We’re seeking a Site Reliability Engineer with strong Java skills an...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30
    Director, Provider Relations (IDD)

    Director, Provider Relations (IDD)

    Independent Living Systems, LLC • Miami-Dade County, FL, United States
    serp_jobs.job_card.full_time
    We are seeking a Director, Provider Relations (IDD) to join our team at Independent Living Systems (ILS).ILS, along with its affiliated health plans known as Florida Community Care and Florida Comp...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Project Executive - Multifamily

    Project Executive - Multifamily

    Scott Humphrey Corporation • Miami-Dade County, FL, United States
    serp_jobs.job_card.full_time
    Top Ranked GC with ongoing projects and more in place for Q1 of next year.Specializing in Multifamily Construction.Market-leading base salary + aggressive bonus structure.Technology, vehicle, and t...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Axiom Software Solutions Limited • Miami, FL, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    Role : Site Reliability Engineer (SRE).Required Skills & Qualifications.Site Reliability Engineering, DevOps, or similar role. Strong experience with Linux / Unix systems administration and trouble...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30
    Chief Engineer

    Chief Engineer

    FirstService Residential • Key Biscayne, FL, United States
    serp_jobs.job_card.full_time
    This position assumes total responsibility for the “physical plant.Closely monitors, identifies, and communicates problems in every phase of general maintenance of the building(s), including areas ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Platform - Fort Lauderdale, USA

    Software Engineer, Platform - Fort Lauderdale, USA

    Speechify • Fort Lauderdale, FL, US
    serp_jobs.job_card.full_time
    The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify's text-to-speech products to turn whatever they're reading – ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Rental Agent

    Rental Agent

    South Dade Toyota Kia • Homestead, FL, US
    serp_jobs.job_card.full_time
    South Dade Toyota is seeking a hardworking and highly motivated Full-Time Front Desk Rental Agent to join our growing team. The ideal candidate will be the welcoming face of our rental department.Yo...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. Application Security Engineer

    Sr. Application Security Engineer

    Beacon Hill • Miami-Dade County, FL, United States
    serp_jobs.job_card.full_time
    Application Security Engineer, developing and deploying security technologies for enterprise organizations.Experience building, maintaining Public Cloud (AWS / Azure / GCP) & Network Security and netwo...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Director of Land-Use Planning

    Director of Land-Use Planning

    Concord Summit Capital • Miami-Dade County, FL, United States
    serp_jobs.job_card.full_time
    Concord Crest Real Estate is a real estate land-use advisory and permit expediting consulting company based in Miami, Florida. Utilizing backgrounds in real estate development, architecture, enginee...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Real Time Technologies, LLC • Miami, FL, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    Get to Know Us Better RT² offers the most flexible cutting-edge Retail Management Solutions that encompass sales, inventory management, frontline employee management and engagement, payments, ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30
    Principal Engineer - Construction Materials Testing (CMT) (Miami-Dade County)

    Principal Engineer - Construction Materials Testing (CMT) (Miami-Dade County)

    LVI Associates • Miami-Dade County, FL, US
    serp_jobs.job_card.part_time
    Principal Engineer - CMT Operations Manager (South Florida).Miami, Fort Lauderdale, West Palm Beach, or Delray Beach.Are you ready to lead and grow a Construction Materials Testing (CMT) operation ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal Engineer

    Principal Engineer

    KEITH • Miami, FL, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    Principal Engineer - Land Development and Water Resources About KEITH : .We are a multidisciplined consulting firm offering civil engineering, land surveying, transportation engineering, plannin...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days
    Air Interdiction Agent

    Air Interdiction Agent

    US Customs and Border Protection • Homestead, FL, US
    serp_jobs.job_card.full_time
    Air and Marine Operations (AMO), a component of U.Customs and Border Protection (CBP), offers skilled Pilots interested in law enforcement an opportunity to work with an elite team of highly traine...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    ITS Traffic Project Manager

    ITS Traffic Project Manager

    AQRA International • Miami-Dade County, FL, United States
    serp_jobs.job_card.full_time
    ITS Project Manager (Full-Time, On-Site).ITS (Intelligent Transportation Systems) Project Manager.This position serves as the primary point of contact and is responsible for the overall coordinatio...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Project Manager

    Senior Project Manager

    Trinity Search Group • Miami-Dade County, FL, United States
    serp_jobs.job_card.full_time
    This company is truly one of the most reputable and best builders in all of Florida.They are based in South Florida, have been in business for over 50 years, are completely client focused, do almos...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted