DevOps Engineer
Listed on 2026-03-01
-
IT/Tech
Cloud Computing, Systems Engineer, IT Support, Cybersecurity
Overview
Company Description
Join us and make YOUR mark on the World! Lawrence Livermore National Laboratory (LLNL) has turned bold ideas into world-changing impact advancing science and technology to strengthen U.S. security and promote global stability. Our mission spans four critical national security areas: nuclear deterrence, threat preparedness, energy security, and multi-domain defense, empowering teams to take on the toughest challenges of today and tomorrow.
With a culture built on innovation and operational excellence, LLNL is a place where your expertise can make a real impact.
Job Description
We have an opening for a Development Operations (Dev Ops) Engineer. You will develop and support a robust, scalable, and operational infrastructure at the intersection of High-Performance Computing (HPC), on-prem cloud native technologies, and AI/ML software stacks to support, develop, and deploy collaboration tools and services for users of LLNL’s high-performance computers. You will work independently, applying software engineering and Dev Ops skills on a variety of hardware platforms to enable state-of-the-art collaboration and productivity tools for developers and scientists located world-wide.
This position is in the Livermore Computing Division within the Computing Directorate. The role may be filled at SES.
1 or SES.
2 level depending on qualifications, with additional responsibilities at the higher level.
- Build, deploy, support, and enhance LLNL containerized applications and software stacks deployed in LC Open Shift/Kubernetes clusters.
- Identify issues and propose solutions to technical problems across projects to improve Dev Ops design and implementation practices.
- Perform software engineering using established development practices, tools, and processes for robust software quality, including testing, configuration management, change management, and documentation.
- Collaborate with other technical teams to ensure solutions are secure and integrated with other services.
- Engage directly with HPC customers to provide timely, customer-focused support and guidance.
- Assist with managing Open Shift/Kubernetes container orchestration infrastructure in Linux to meet complex operational, development, and security requirements.
- Investigate and deploy infrastructure monitoring, alerting, and logging tools for the Dev Ops infrastructure.
- Support and improve automated deployments of infrastructure services and applications with high availability and zero-downtime update principles.
- Work with users and LC/LLNL security on on-premises and third-party cloud-based AI offerings to determine suitability and security alignment.
- Perform other duties as assigned.
2)
- Implement automation tools to deploy, troubleshoot, and maintain cluster environments within container orchestration environments.
- Design, implement and manage build and release pipelines.
- Extend Kubernetes to simplify researchers’ usage and operations.
- Provide solutions to moderately complex problems involving identifiable factors.
- Ability to obtain and maintain a U.S. DOE Q-level security clearance which requires U.S. Citizenship.
- Bachelor’s degree in computer science, Computer Engineering, or a related field, or an equivalent combination of education and related experience.
- Familiarity with deploying web applications and/or micro-services in a containerized environment (e.g., Docker, Podman, Kubernetes, Open Shift).
- Experience with Python, Bash, JavaScript, or similar scripting languages.
- Familiarity with software testing and implementing Continuous Integration pipelines.
- Strong communication skills to collaborate in a multi-disciplinary team environment and work independently as needed.
- Experience creating CI pipelines that automate builds, tests, workflows, tasks or other processes.
- Fundamental knowledge of the Git version control system (push/pull, rebase, cherry-pick, branching).
- Experience integrating SSL/TLS certificates within applications and services according to security policy.
- Ability to apply innovative approaches and new technologies to defined tasks and projects.
- Ability to set priorities and…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).