Lead DevOps Engineer
Listed on 2026-05-15
-
Software Development
About Radiant
Radiant is an El Segundo, CA-based startup building the world’s first mass-produced, portable nuclear microreactors. The company’s first reactor, Kaleidos, is a 1-megawatt, fail-safe microreactor that can be transported anywhere power is needed and run for up to 5 years without refueling. Portable nuclear power with rapid-deploy capability can replace similar-sized diesel generators and provide critical asset support for hospitals, data centers, remote sites, and military bases.
Radiant’s unique, practical approach to nuclear development leverages modern software engineering to rapidly deliver safe, factory-built microreactors that use existing, well-qualified materials. Founded in 2020, Radiant is on track to test its first reactor at the Idaho National Laboratory this summer, with initial customer deliveries beginning in 2028.
Radiant is seeking a driven Technical Lead Dev Ops Engineer to own software infrastructure, deployment, and automation projects. In this role, you will collaborate across the entire org and work closely with the software team to design scalable, secure, and resilient Dev Ops practices, tools, and systems. As a technical lead, you will mentor engineers, define team scope, and shape individual responsibilities to build and grow a high-performing Dev Ops function.
The infrastructure you manage, pipelines you build, and analysis tools you create and maintain will be used to design, run, and analyze the first new reactor design in 50 years. You will have the opportunity to build a strong foundation for production.
Lead a team of Dev Ops engineers and establish responsibilities, project scope, and technical mentorship.
Architect, implement, and maintain infrastructure across AWS and on-premises Linux environments, ensuring high availability, security, and performance for mission-critical systems.
Own and optimize Kubernetes and Docker container orchestration to run simulations, internal tools, and engineering applications at scale.
Design, build, and optimize CI/CD pipelines using Git, Argo, and other automation tools to enable rapid, reliable software delivery across engineering teams.
Architect internal tools supporting build systems, testing frameworks, deployment automation, and developer environments.
Develop computational analysis tools for the digital twin platform, implementing automated profiling and monitoring to optimize HPC workloads for reactor modeling.
Implement and manage infrastructure-as-code practices using Terraform or similar tools to ensure reproducible, version-controlled infrastructure deployments.
Establish and maintain observability, monitoring, and logging infrastructure to ensure system reliability and enable rapid incident response.
Design and maintain high-performance networking infrastructure for distributed simulation systems, optimizing data transfer between HPC clusters.
Work across engineering teams to understand infrastructure requirements and eliminate bottlenecks that impact development velocity.
Qualifications and Skills:
Bachelor's degree in computer science, engineering, or a related technical field.
8+ years of professional experience in Dev Ops, Site Reliability, Infrastructure, or Platform Engineering.
2+ years in a leadership role in either a people management or technical lead scope. Mentoring engineers and establishing their scope and responsibilities.
Expert-level proficiency in programming languages such as Python, Golang, Rust, C#, or C/C++.
Strong coding skills, developing automation and monitoring tools, internal applications, and infrastructure software.
Expert-level proficiency with Kubernetes and Docker for orchestration and deployment.
Experience with Git and CI/CD tools (Argo, Git Lab CI, Jenkins, Git Hub Actions), designing automated build, test, and deployment pipelines.
Qualifications and Skills:
4+ years in a leadership role in either a people management or technical lead scope. Mentoring engineers and establishing their scope and responsibilities.
Experience deploying and maintaining cloud-native applications in production, including hands-on work with…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).