Senior HPC Linux Systems Engineer
Listed on 2025-12-21
-
IT/Tech
Systems Engineer
Founded in 1999 in the beautiful Smoky Mountains of East Tennessee, Cadre5 provides innovative technical solutions to our customers locally and nationally. Our Cadre5 Lab Partners division has partnered with The High‑Performance Computing Systems Section within the National Center for Computational Sciences (NCCS) at Oak Ridge National Laboratory (ORNL) to recruit a qualified Senior HPC Linux Systems Engineer.
NCCS provides state‑of‑the‑art computational and data science infrastructure coupled with dedicated technical and scientific professionals tackling large‑scale problems across a broad range of scientific domains for accelerating scientific discovery and engineering advances. NCCS hosts the Oak Ridge Leadership Computing Facility (OLCF), one of the Department of Energy’s (DOE) National User Facilities which operates Frontier, the nation’s first exascale supercomputer.
ORNL delivers scientific discoveries and technical breakthroughs needed to realize solutions in energy and national security and provides economic benefit to the nation. This premier research institution located near Knoxville in Oak Ridge, TN, addresses national needs through impactful research and world‑leading research centers.
This is a full‑time, permanent position that can telecommute. Occasional travel to the Oak Ridge facility may be required.
Why Cadre5?- Working with highly talented team members
- 3 weeks’ vacation
- Excellent medical insurance, including employer‑paid benefits
- Lead the architecture and deployment of HPC‑scale services
- Create and maintain internal documentation of system architectures, configurations and procedures
- Serve as the highest tier of support for complex issues, providing quick and efficient resolution
- Design, deploy and manage resources in the NCCS VMware environment
- Identify potential automation targets and lead efforts to automate processes
- Define policies and procedures for automation and configuration management for the team and organization as a whole
- Design and administration of RSA Secure
ID and Ping Federate servers - Deploy, configure and support identity and access management services such as single‑sign on (SSO), OAuth, two‑factor auth, zero trust, etc.
- Lead infrastructure projects through all phases from planning to design, implementation and support
- Mentor and train junior staff, creating training documentation, holding knowledge sharing sessions, and fostering skill growth throughout the team
- Propose and implement improvements to existing infrastructure systems as well as new systems, processes and procedures
- Bachelor’s degree in computer science or closely related field and a minimum of 8 years of experience in Linux systems administration, or a Master’s Degree and a minimum of 4 years of experience in Linux systems administration. An equivalent combination of education and experience will be considered.
- The ability to obtain and maintain a Department of Energy "Q" clearance is required. This requires US Citizenship.
- Excellent interpersonal/communication skills and the ability to work within a team
- Strong experience in Identity Management, supporting SSO, OAuth, two‑factor authentication primarily in Ping Federate and RSA Secure
ID. Entra a bonus. - Strong working knowledge of Linux system fundamentals and common network protocols
- Programming and scripting skills in common languages such as Python and Bash
- Understanding of versioning and code review tools like Git Hub and Git Lab
- Experience implementing and supporting highly available systems and services
- Experience with configuration management tools such as Puppet or Ansible
- Experience deploying and maintaining virtual environments using VMWare
- Experience deploying, maintaining and troubleshooting a variety of infrastructure services such as OpenLDAP, DNS, DHCP, etc.
- Ability to plan, prioritize and complete assigned projects with minimal supervision
- Excellent pay and benefits, including full medical, dental, and vision coverage coupled with 401K match, 15 days PTO, and 10 holidays.
Cadre5 is an equal opportunity employer.
Referrals increase your chances of interviewing at Cadre5 by 2x.
Knoxville, TN $ - $
Sign in to set job alerts for "Linux System Engineer" roles.
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).