Sr. Site Reliability Engineer; Software
Listed on 2026-06-30
-
IT/Tech
Unix/Linux
Sr. Site Reliability Engineer (Application Software)
Location:
Hawthorne, CA
SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.
OverviewThe application software team is the central nervous system of Space
X. We build mission‑critical platforms that accelerate vehicle software delivery, testing, and operations for every Falcon 9, Starship, and Dragon mission, while powering Starlink’s global growth.
This position will have a meaningful impact on Starship by significantly reducing safety‑critical build and test times for vehicle software. We are looking for a Site Reliability Engineer who brings a strong SRE mindset, cares deeply about safety, quality, and attention to detail, and possesses the ability to understand the big picture before writing code. The ideal candidate fully understands what they are building, enjoys hard problem solving, thinks strategically, and is decisive, organized, and self‑critical.
Responsibilities- Deploy, upgrade, operate, maintain, and scale our suite of mission‑critical products and services.
- Manage our underlying infrastructure as code and use modern observability tools to provide a complete picture of application health.
- Closely collaborate with software engineers to design and build highly operable, maintainable, and testable systems.
- Engage in and improve the entire software development lifecycle — from inception and design through deployment, operation, and continuous refinement.
- Practice sustainable incident response and blameless post‑mortems.
- Provide high‑quality end‑user support to vehicle software engineers.
- Participate in the team’s on‑call rotation.
- Identify and eliminate performance bottlenecks using measurement and creative engineering.
- Bachelor’s degree in computer science, information systems, or an engineering discipline; OR 7+ years of professional experience in SRE or Dev Ops in lieu of a degree.
- 3+ years of experience with Python and Python‑based development frameworks.
- Experience with Linux operating systems.
- Experience with build systems (Bazel, Buck, Make, etc.).
- Experience with container and virtualization technologies (Docker, Kubernetes, vSphere, QEMU, KVM, etc.).
- Experience with databases and data modeling (Postgres, MySQL, Click House, etc.).
- Experience with infrastructure as code (IaC) tools for managing fleets of servers.
- Experience with Terraform, Ansible, Puppet, or similar automation frameworks.
- Knowledge of the technologies that predate and underpin modern cloud infrastructure, with the ability to translate high‑level developer experiences into specific implementations from first principles.
- Ability to work with mission‑critical and sensitive systems with appropriate urgency and care.
- Ability to communicate effectively with customers, peers, and management in both formal and informal settings.
- Must be able to work extended hours and weekends as needed.
Pay Range:
Sr. Site Reliability Engineer: $ – $ per year.
Your actual level and base salary will be determined on a case‑by‑case basis and may vary based on the following considerations: job‑related knowledge and skills, education, and experience.
Base salary is just one part of your total rewards package may also be eligible for long‑term incentives, in the form of company stock, stock options, or long‑term cash awards, as well as potential discretionary bonuses and the ability to purchase additional stock at a discount through an Employee Stock Purchase Plan. You will also receive access to comprehensive medical, vision, and dental coverage, a 401(k) retirement plan, short‑ and long‑term disability insurance, life insurance, paid parental leave, and various other discounts and perks.
You may also accrue 3 weeks of paid vacation and will be eligible for 10 or more paid holidays per year. Employees accrue paid sick leave pursuant to Company policy which satisfies or exceeds the…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).