Site Reliability Engineer, Data (Application Software)

SpaceX

Hawthorne, CA

$120K-$145K a year

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not.

Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.

SITE RELIABILITY ENGINEER, DATA (APPLICATION SOFTWARE)

The application software team is the central nervous system of SpaceX we create mission critical applications that are used throughout SpaceX to accelerate launch vehicle production and flight as well as systems that allow Starlink to grow into a worldwide fast, reliable Internet service.

Our missions support scientific research, classified national security space, and commercial opportunities. Software engineering and innovation is at the core of these programs.

Our team is currently creating and evolving systems to enable rapid build and reuse of Starship as well as scaling the Starlink network.

We have built systems to support concurrent streams of data from many always-on assets to manage the world’s largest satellite constellation and the world’s largest rocket.

We work directly with engineers across all programs to enable and accelerate the success of Starlink, Starlink, and Starshield.

Aerospace experience is not required to be successful here - rather we look for smart, motivated, collaborative site reliability engineers who love solving problems and want to make an impact on a super inspiring mission.

You will have full ownership of challenging problems, working with a team of enthusiastic engineers to design and produce solutions that enable SpaceX to move towards our goals at a rapid pace.

The success of the missions at SpaceX depends on the software that you and your team produce.

RESPONSIBILITIES :

Upgrade existing distributed systems to become sharded and geo-redundant in multiple data centers
Advance existing deployment, monitoring, and alerting infrastructure to support a multi-region environment
Manage petabyte scale bare metal compute clusters
Closely collaborate with engineers across all programs to create highly operable, scalable, and maintainable products
Engage throughout the whole software development lifecycle of services from inception to design, deployment, operation, and iterative refinement
Focus on performance bottlenecks and performance improvement techniques

BASIC QUALIFICATIONS :

Bachelor's degree in computer science, engineering, math, or scientific discipline; OR 2+ years of professional experience building software with site reliability or DevOps in lieu of a degree
Experience with Linux operating systems

PREFERRED SKILLS AND EXPERIENCE :

2+ years of rigorous experience with site reliability or DevOps
Experience with Kubernetes and Istio for on-premise deployment
Experience with in-stream, data processing and analytics using open source platforms such as Apache Kafka, Spark, HBase, HDFS, Flink
Experience troubleshooting hardware and network-layer issues
Programming experience in Python, C#, Java, Scala, Go or similar languages
Good understanding of version control, testing, continuous integration, build, deployment and monitoring

ADDITIONAL REQUIREMENTS :

Willing to work extended hours and weekends when needed

COMPENSATION AND BENEFITS :

Pay Range :

Site Reliability Engineer / Level I : $120,000.00 - $145,000.00 / per year

Site Reliability Engineer / Level II : $140,000.00 - $170,000.00 / per year

Your actual level and base salary will be determined on a case-by-case basis and may vary based on the following considerations : job-related knowledge and skills, education, and experience.

Base salary is just one part of your total rewards package at SpaceX. You may also be eligible for long-term incentives, in the form of company stock, stock options, or long-term cash awards, as well as potential discretionary bonuses and the ability to purchase additional stock at a discount through an Employee Stock Purchase Plan.

You will also receive access to comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks.

You may also accrue 3 weeks of paid vacation & will be eligible for 10 or more paid holidays per year. Exempt employees are eligible for 5 days of sick leave per year.

ITAR REQUIREMENTS :

To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.

S.C. 1157, or (iv) Asylee under 8 U.S.C. 1158, or be eligible to obtain the required authorizations from the U.S. Department of State.

Learn more about the ITAR .

SpaceX is an Equal Opportunity Employer; employment with SpaceX is governed on the basis of merit, competence and qualifications and will not be influenced in any manner by race, color, religion, gender, national origin / ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, mental or physical disability or any other legally protected status.

Applicants wishing to view a copy of SpaceX’s Affirmative Action Plan for veterans and individuals with disabilities, or applicants requiring reasonable accommodation to the application / interview process should notify the Human Resources Department at (310) 363-6000.

30+ days ago

Related jobs

Promoted

Sr Site Reliability Engineer

VirtualVocations

Inglewood, California

...

Promoted

Sr Site Reliability Engineer

Federal Reserve Bank of Cleveland

Los Angeles, California

Site Reliability Engineer, you will be part of the Data & Analytics Services (DAS) Team and will get an opportunity to broadly apply your engineering skills across various technology solutions, as well as build your skills in other areas by being exposed to various aspects of product delivery fr...

Promoted

Kubernetes Site Reliability Engineer

Bayside Solutions

CA, United States

Kubernetes Site Reliability Engineer. You will be responsible for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow new applications and services to flourish. We require a highly self-motivated engineer who is passionate about excellence, quality, and detail and...

Promoted

Software Engineer - Data Platform

Hadrian

Los Angeles, California

As a foundational software engineer on our data platform engineering team, you will lead the charge on a variety of projects writing software to aggregate, store, and make sense of data. Examples of possible work include everything from building data warehousing for ERP and machine data, building so...

Promoted

Software Engineer - Data Platform

Hadrian

Los Angeles, California

Promoted

Staff Software Engineer, Machine Learning - Riot Games Data

Riot Games

Los Angeles, California

You will support and improve the way VALORANT game clients, game servers, and backend services publish and process data at global scale, and how we automate the preparation and processing of that data in our data warehouse. You will collaborate with engineers, data scientists, producers, game design...

Promoted

Snowflake Data Engineer

Triunity Software

CA, United States

A Data Engineer with experience using AWS and Snowflake typically has a strong background in working with large datasets and knowledge of database design and implementation. Experience with data warehousing, big data technologies, and data. They are responsible for designing, building, and maintaini...

Site Reliability Engineer (Starshield)

SpaceX

Hawthorne, California

Bachelor’s degree in computer science, information systems/IT, or an engineering discipline; OR 2+ years of professional experience in software, DevOps, or site reliability engineering in lieu of a degree. SITE RELIABILITY ENGINEER (STARSHIELD). Our software engineers are responsible for the life cy...

Site Reliability Principal Engineer

City National Bank

Los Angeles, California

SITE RELIABILITY PRINCIPAL ENGINEER WHAT IS THE OPPORTUNITY? As an SRE, you will utilize your software, systems engineering, and operations background to build and run large-scale, fault-tolerant systems. The ideal candidate has significant experience with Platform as a Service cloud such as Cloud F...

Senior Site Reliability Engineer (R50025261)

Fox Corporation

Los Angeles, California

Fox is hiring a Senior Site Reliability Engineer to help build infrastructure and platforms to support our live direct to consumer APIs for live events such as the Super Bowl, World Cup, and World Series. The senior engineer will serve as an SME for solving thundering herd problems including partner...

Site Reliability Engineer, Data (Application Software)

Sr Site Reliability Engineer

Sr Site Reliability Engineer

Kubernetes Site Reliability Engineer

Software Engineer - Data Platform

Software Engineer - Data Platform

Staff Software Engineer, Machine Learning - Riot Games Data

Snowflake Data Engineer

Site Reliability Engineer (Starshield)

Site Reliability Principal Engineer

Senior Site Reliability Engineer (R50025261)

Popular searches