Senior Kubernetes Platforms System Engineer
Listed on 2025-12-22
-
IT/Tech
Systems Engineer, Cloud Computing
Founded in 1999 in the beautiful Smoky Mountains of East Tennessee, Cadre5 provides innovative technical solutions to our customers locally and nationally.
Cadre5 Lab Partners, in partnership with the National Center for Computational Sciences (NCCS) at Oak Ridge National Laboratory (ORNL), is recruiting a Senior Kubernetes Platforms System Engineer. You will work in the Infrastructure team within the HPC Infrastructure and Networking group supporting activities of the supercomputer center.
The interview process begins with a Microsoft Teams meeting that requires the video to be turned on.
This is a full‑time, permanent position that can telecommute. Occasional travel to the Oak Ridge facility may be required.
Why Cadre5- Working with highly talented team members
- 3 weeks vacation
- Excellent medical insurance, including employer‑paid benefits
- Work with the team to define and implement best practices and standards within the organization
- Maintain a reliable, available, and fast Kubernetes platform
- Architect solutions that improve the reliability, scalability, performance, and efficiency of our services
- Investigate and resolve service issues from bare metal up to application code
- Coordinate with vendors to resolve hardware and software problems
- Participate in a 24/7 on‑call rotation for support and off‑hours maintenance windows
- Assist users with Kubernetes usage
- Bachelor’s degree in a scientific field and a minimum of 8 years of relevant experience (an equivalent combination of education and experience will be considered)
- Excellent interpersonal and communication skills and ability to work as part of a team
- Experience with Kubernetes and Red Hat Open Shift (including Open Shift Data Foundations, Advanced Cluster Management for Kubernetes, and Advanced Cluster Security for Kubernetes)
- Experience managing image registries such as Quay or Harbor
- Solid understanding of networked computing environment concepts
- Strong working knowledge of Unix system fundamentals and common network protocols
- Ability to develop and maintain programs and scripts that aid in operation and automation using shell, Python, and Go
- Ability to identify requirements and to define, plan, and implement solutions
- Experience using monitoring tools such as Prometheus, Nagios, and Grafana to create dashboards
- Experience designing and implementing highly available systems and services
- Experience with Infrastructure‑as‑Code tooling such as Terraform, Helm, and Puppet
- Experience with CI/CD tooling and Git Ops
- Experience with code review and familiarity with tools like git, Git Hub, and Git Lab
- Experience implementing system‑level security technologies such as SELinux and following security best practices
- The ability to obtain and maintain a Department of Energy "Q" clearance is required. This requires U.S. citizenship
Cadre5 offers excellent pay and benefits, including full medical, dental, and vision coverage, 401K match, 15 days PTO, and 10 holidays.
Cadre5 is an equal‑opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply. Cadre5 is an E‑Verify Employer.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).