Who We Are :
Take the next step in your career now, scroll down to read the full role description and make your application.
SiriusXM and its brands (Pandora, SiriusXM Media, AdsWizz, Simplecast, and SiriusXM Connected Vehicle Services) are leading a new era of audio entertainment and services by delivering the most compelling subscription and ad-supported audio entertainment experience for listeners.
Our vision is to shape the future of audio, where everyone can be effortlessly connected to the voices, stories, and music they love wherever they are.
How you'll make an impact :
The Platform Observability team is looking for a Staff Software Engineer to help design and implement observability solutions that enable deep visibility, insights, alerting, and incident response capabilities for the production software that drives SiriusXM and Pandora.
This Staff Engineer will have the opportunity to collaborate with our internal user base of engineers across all of the SiriusXM product development teams.
What you'll do :
- Lead the design, development, and implementation of observability solutions to monitor the health and performance of the services and infrastructure underpinning SiriusXM's production software.
- Architect and implement scalable systems for collecting, processing, and analyzing telemetry data.
- Develop and maintain dashboards, alerts, and visualizations to provide actionable insights to stakeholders.
- Develop and maintain the platform's observability-as-code library.
- Participate in the cross-org Staff+ Engineer group, serving as a technical resource for all teams in Platform Engineering.
- Automate monitoring and incident response processes to enhance efficiency and reliability.
- Provide technical guidance and mentorship to engineers within the Observability team.
- Collaborate with platform and application teams to establish best practices for instrumentation and monitoring of cloud-native applications.
- Continuously evaluate and optimize observability tools and processes to improve performance and reduce operational overhead.
- Implement cost optimization strategies for observability tools, ensuring efficient utilization of resources while maintaining high-quality monitoring capabilities.
- Analyze usage patterns and identify opportunities to streamline data collection, storage, and analysis to reduce costs without sacrificing visibility or reliability.
- Monitor and track observability-related costs, providing regular reports and insights to management.
- Train team members on cost optimization best practices.
- Provide support and expertise to application teams during on-call rotations.
What you'll need :
- 7+ years of experience in a technical role (e.g. SRE, DevOps, Software Engineer).
- Experience in large software development projects in the cloud (preferably AWS).
- Hands-on experience administering / using observability tools (preferably Datadog).
- Strong familiarity with observability patterns and best practices including SLAs, SLOs, and SLIs.
- Exceptional communication and collaboration skills.
- Fluent in at least two programming languages.
- Must have legal right to work in the U.S.
Extra credit :
- Deep familiarity using and / or administering Datadog.
- Experience in building Observability resources using infrastructure as code, preferably AWS CDK and CloudFormation.
J-18808-Ljbffr