Search jobs > Hawthorne, CA > Permanent > Site reliability engineer

Site Reliability Engineer, Data (Application Software)

SpaceX
Hawthorne, CA, United States
$140K-$170K a year
Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not.

Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.

SITE RELIABILITY ENGINEER, DATA (APPLICATION SOFTWARE)

The application software team is the central nervous system of SpaceX - we create mission critical applications that are used throughout SpaceX to accelerate launch vehicle production and flight as well as systems that allow Starlink to grow into a worldwide fast, reliable Internet service.

Our missions support scientific research, classified national security space, and commercial opportunities. Software engineering and innovation is at the core of these programs.

Our team is currently creating and evolving systems to enable rapid build and reuse of Starship as well as scaling the Starlink network.

We have built systems to support concurrent streams of data from many always-on assets to manage the world's largest satellite constellation and the world's largest rocket.

We work directly with engineers across all programs to enable and accelerate the success of Starlink, Starlink, and Starshield.

Aerospace experience is not required to be successful here - rather we look for smart, motivated, collaborative site reliability engineers who love solving problems and want to make an impact on a super inspiring mission.

You will have full ownership of challenging problems, working with a team of enthusiastic engineers to design and produce solutions that enable SpaceX to move towards our goals at a rapid pace.

The success of the missions at SpaceX depends on the software that you and your team produce.

RESPONSIBILITIES :

  • Upgrade existing distributed systems to become sharded and geo-redundant in multiple data centers
  • Advance existing deployment, monitoring, and alerting infrastructure to support a multi-region environment
  • Manage petabyte scale bare metal compute clusters
  • Closely collaborate with engineers across all programs to create highly operable, scalable, and maintainable products
  • Engage throughout the whole software development lifecycle of services from inception to design, deployment, operation, and iterative refinement
  • Focus on performance bottlenecks and performance improvement techniques

BASIC QUALIFICATIONS :

  • Bachelor's degree in computer science, engineering, math, or scientific discipline; OR 2+ years of professional experience building software with site reliability or DevOps in lieu of a degree
  • Experience with Linux operating systems

PREFERRED SKILLS AND EXPERIENCE :

  • 2+ years of rigorous experience with site reliability or DevOps
  • Experience with Kubernetes and Istio for on-premise deployment
  • Experience within-stream, data processing andanalyticsusing open source platforms such as Apache Kafka, Spark, HBase, HDFS, Flink
  • Experience troubleshooting hardware and network-layer issues
  • Programming experience in Python, C#, Java, Scala,Goor similar languages
  • Good understanding of version control, testing, continuous integration, build, deployment and monitoring

ADDITIONAL REQUIREMENTS :

Willing to work extended hours and weekends when needed

COMPENSATION AND BENEFITS :

Pay Range :

Site Reliability Engineer / Level I : $120,000.00 - $145,000.00 / per year

Site Reliability Engineer / Level II : $140,000.00 - $170,000.00 / per year

Your actual level and base salary will be determined on a case-by-case basis and may vary based on the following considerations : job-related knowledge and skills, education, and experience.

Base salary is just one part of your total rewards package at SpaceX. You may also be eligible for long-term incentives, in the form of company stock, stock options, or long-term cash awards, as well as potential discretionary bonuses and the ability to purchase additional stock at a discount through an Employee Stock Purchase Plan.

You will also receive access to comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks.

You may also accrue 3 weeks of paid vacation & will be eligible for 10 or more paid holidays per year. Exempt employees are eligible for 5 days of sick leave per year.

ITAR REQUIREMENTS :

  • To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.S.C.
  • 1157, or (iv) Asylee under 8 U.S.C.
  • 1158, or be eligible to obtain the required authorizations from the U.S. Department of State. Learn more about the ITAR here .

SpaceX is an Equal Opportunity Employer; employment with SpaceX is governed on the basis of merit, competence and qualifications and will not be influenced in any manner by race, color, religion, gender, national origin / ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, mental or physical disability or any other legally protected status.

Applicants wishing to view a copy of SpaceX's Affirmative Action Plan for veterans and individuals with disabilities, or applicants requiring reasonable accommodation to the application / interview process should notify the Human Resources Department at (310) 363-6000.

2 days ago
Related jobs
Promoted
Blue Origin
Los Angeles, California

Power user of IBM Rational DOORS Next Generation, or other engineering data management tools such as Jazz CLM applications. You will support and contribute to the architecting, implementing, and execution of configuration and data management processes for system engineering and system safety product...

Promoted
VirtualVocations
Inglewood, California

Key Responsibilities:Build user experiences for gaining insights from large amounts of metric dataCollaborate with product and quality teams to deliver scalable featuresParticipate in operational duties to ensure service delivery qualityRequired Qualifications:3+ years of experience building single-...

Promoted
DISQO
Los Angeles, California

Experience managing infrastructure and configuration of Data Platforms in the Cloud - Data Lake, Data Warehouse, Data Mart, Graph Database, Time Series Database, Object Store, etc. Strong hands-on experience with application monitoring tools (Coralogix, New Relic, DataDog, Prometheus, Grafana, etc. ...

Promoted
VirtualVocations
Inglewood, California

A company is looking for a Staff Software Engineer, Data Infrastructure. Key Responsibilities:Contribute to developing the team and organization's long term strategyRefine and maintain data infrastructure technologies for real-time analysisOwn the data pipeline for surfacing daily events and tools f...

Promoted
Fox Corporation
Los Angeles, California

Fox is hiring a Senior Site Reliability Engineer to help build infrastructure and platforms to support our live direct to consumer APIs for live events such as the Super Bowl, World Cup, and World Series. The senior engineer will serve as an SME for solving thundering herd problems including partner...

Promoted
aKube, Inc.
Santa Monica, California

Primary/Relevant Work experience: Software Engineering with Big Data Experience. Build components of large-scale data platform for real-time and batch processing, and own features of big data applications to fit evolving business needs. Build next-gen cloud based big data infrastructure for batch an...

Amazon Development Center U.S., Inc.
Santa Monica, California

With Amazon Kinesis Data Streams, customers process Gigabytes per second of real-time user engagement data for gaming and marketing analytics, build real-time IoT sensor data analytics solutions, analyze millions of financial transactions in real time, perform network intrusion detection for securit...

City National Bank
Los Angeles, California

SITE RELIABILITY PRINCIPAL ENGINEER WHAT IS THE OPPORTUNITY? As an SRE, you will utilize your software, systems engineering, and operations background to build and run large-scale, fault-tolerant systems. The ideal candidate has significant experience with Platform as a Service cloud such as Cloud F...

Fox Corporation
Los Angeles, California

Fox is hiring a Senior Site Reliability Engineer to help build infrastructure and platforms to support our live direct to consumer APIs for live events such as the Super Bowl, World Cup, and World Series. The senior engineer will serve as an SME for solving thundering herd problems including partner...

Lockheed Martin
California

Most software development is expected to be completed on-site in San Diego within a secure room. Participate in development and integration of the Control and Management software used for submarine communications. Primary responsibilities include designing, implementing, unit testing, and integratin...