Talent.com
Senior Site Reliability Engineer, Observability and Monitoring Team
Senior Site Reliability Engineer, Observability and Monitoring TeamOkta for Developers • Bellevue, WA, US
Senior Site Reliability Engineer, Observability and Monitoring Team

Senior Site Reliability Engineer, Observability and Monitoring Team

Okta for Developers • Bellevue, WA, US
job_description.job_card.variable_hours_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Senior Site Reliability Engineer, Observability and Monitoring Team

Overview

We are seeking a Senior Site Reliability Engineer with a strong passion for observability to join Okta. You will shape the strategy and execution of our observability services—logs, metrics, and tracing—across the Observability team and the broader organization. Your Kubernetes expertise will guide the design, implementation, and operation of advanced observability capabilities on our replatformed infrastructure.

What You'll Be Doing

  • Develop deep familiarity with a critical SaaS platform used by millions of customers daily, delivering unparalleled observability insights into its behavior and performance.
  • Engage with stakeholders across the organization to understand component boundaries and dependencies, driving adoption of observability best practices and coaching teammates.
  • Champion the evolution of our SDLC by defining how we ideate, onboard, operate, and scale microservices and features in a secure, performant, always-on manner, with observability as a foundational element.
  • Identify and automate manual processes through code and smart architectures to improve collection, analysis, and actionability of observability data.
  • Support a 24x7 online environment as part of a global on-call rotation, rapidly diagnosing and resolving complex incidents.
  • Advocate for scalable, reliable, and resilient systems with a culture centered on observability.

What You'll Bring To The Role

  • 4+ years of experience as a site reliability or platform engineer with a track record leading observability initiatives.
  • 2+ years designing, scaling, and operating observability solutions in Kubernetes environments.
  • Experience with large-scale containerized deployments and understanding of observability challenges for microservices and monolithic architectures.
  • Proactive, tenacious mindset with a focus on improving system visibility and reliability.
  • Strong mentoring skills and ability to promote robust observability practices across engineering teams.
  • Solid knowledge of CI / CD, Linux fundamentals, OS hardening, networking, and Internet protocols for resilient systems.
  • Proficiency with operational tooling languages such as Python, Rust, or Go for automating observability tasks and integrations.
  • Excellent stakeholder management, translating complex observability concepts into clear, actionable insights.
  • Expertise with Splunk or similar log management tools and Grafana for dashboards and visualization of metrics.
  • Compensation and Benefits

    The annual base salary range varies by location and is complemented by equity (where applicable), bonus, and benefits including health, dental, vision, 401(k), flexible spending, and paid leave in accordance with plans and policies. Salary ranges shown are examples for candidates in the San Francisco Bay Area and other locations; actual base salary depends on skills, experience, and location. For more details, visit Okta Total Rewards.

    What you can look forward to as a full-time Okta employee : Amazing Benefits, Making Social Impact, and Talent Development with a connected community at Okta.

    About Okta

    Okta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our platforms provide secure access, authentication, and automation, placing identity at the core of business security and growth. Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, disability, or veteran status. Reasonable accommodations are available on request during the application process.

    J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Senior Site Reliability Engineer • Bellevue, WA, US