Site Reliability Engineer, Discovery
Listed on 2026-01-28
-
Engineering
Systems Engineer, Software Engineer -
IT/Tech
Systems Engineer
Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century’s most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril’s family of systems is powered by Lattice OS, an AI‑powered operating system that turns thousands of data streams into a realtime, 3D command and control center.
As the world enters an era of strategic competition, Anduril is committed to bringing cutting‑edge autonomy, AI, computer vision, sensor fusion, and networking technology to the military in months, not years.
The Discovery team at Anduril is at the forefront of incubating and maturing high‑potential, software‑defined, AI‑native offerings that meet the toughest, newest challenges across hardware, software, space, and cyber domains. We’re the architects of mission autonomy and mesh networking, delivering scalable hardware solutions that meet some of the most urgent national security needs. By working hand‑in‑hand with elite teams in Perception, AI, Motion Planning, Hardware, Test Engineering, Space, Networking, and Vehicle hardware, we craft cutting‑edge, end‑to‑end systems that redefine mission success.
ABOUTTHE JOB
As a site reliability engineer in Discovery, you will solve a wide variety of problems involving networking, autonomy, systems integration, robotics, and more, while making pragmatic engineering tradeoffs along the way. Your efforts will ensure that Anduril products seamlessly work together to achieve critical outcomes.
You’ll work closely with software, data, and operations engineers to get systems into the hands of customers, deployed and supported on real infrastructure, with real users.
Above all, Site Reliability Engineers must be driven by a “Whatever It Takes” mindset—executing in an expedient, scalable, and pragmatic way while keeping the mission top‑of‑mind and making sound decisions to deliver successful outcomes on‑time and with high quality.
WHAT YOU’LL DO- Improve Anduril’s operational capabilities by improving our core product offering through root cause analysis and creating tooling capable of managing large scale deployments
- Drive continuous organizational improvement by leading post‑mortem events involving diverse stakeholders
- Quickly diagnose and resolve system issues across cloud, robotics, and mesh networking architectures
- Lead the organization in building scalable, sustainable mechanisms to continue delivering to customers at the pace the business is scaling
- Design, develop, and deliver solutions using modern technologies that ensure scalable and fault tolerant delivery of systems to the warfighter
- Build strong relationships with internal and external customers to identify technical solutions to their problems
- Currently possesses and is able to maintain an active U.S. Top Secret security clearance
- STEM degree or equivalent technical experience
- Technical expertise and demonstrated performance in one or more of the following areas: networking, cloud technologies, application development, hardware design, and/or cybersecurity
- Minimum of 5 years of operations and engineering experience
- Proficiency with IaC tools (Terraform, Ansible)
- Experience with cloud platforms (Azure, AWS, GCP)
- Proficiency in containerization (Docker) and container orchestration (Kubernetes)
- Ability to quickly understand and navigate complex systems and established code bases
- A desire to work on critical software that has a real‑world impact
- Ability to drive consensus across internal and external stakeholders
- Experience developing and delivering solution to evolving problems in complex environments
- Experience in the technical, programmatic, and operational challenges of developing and deploying autonomous weapon systems across command echelons
- Experience delivering and maintaining systems that run on air‑gapped and security‑hardened networks
- Experience building scalable solutions along with plans for implementation. Not just the end state, but what…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).