ITHigh Computing; HPC and Storage System Administrator
Listed on 2026-06-16
-
IT/Tech
Systems Administrator, Unix/Linux
At ADNET Systems, we know that our people are our greatest strength. As a services‑only company, we are committed to fostering an environment where talent thrives, careers grow, and contributions are valued. We believe in promoting from within, providing opportunities for professional development, and offering competitive compensation to ensure long‑term security.
Our entrepreneurial culture encourages autonomy, accountability, and innovation—giving you the freedom to make a real impact. If you’re looking for a dynamic workplace that values your expertise and invests in your future, explore our career opportunities and take the next step with ADNET Systems.
- Military and Family Emergency Leave
- Paid Holidays
- Performance Bonuses
- Medical and Dental Plans
- Flexible spending accounts
- Direct Deposit Payroll
- 401K Plan with Company Matching and Immediate Vesting
- Short- and long‑term disability insurance
- Life and accident insurance
- Tuition Reimbursement
- In‑house Seminars/Workshops/Classes
Position: IT051 High Performance Computing (HPC) and Storage System Administrator
Location: Greenbelt, MD
Job : IT051
# of Openings: 1
IT051 High Performance Computing (HPC) and Storage System AdministratorThis job description is for a High Performance Computing and Storage System Administrator to support the operations of the Integrated Modeling Computing Center (IMCC), formerly known as the NASA Center for Climate Simulation (NCCS). The IMCC will directly support the Integrated Modeling Virtual Institute (IMVI) to meet the Earth science modeling needs for NASA. The following describes the core duties and responsibilities and technical skills.
Ideal candidates should have excellent communication skills, problem solving, and the ability to work efficiently within a highly performing team environment.
Duties & Responsibilities:
- Full Operational Management: Perform day‑to‑day operations and management of large‑scale, supercomputing clusters to meet the required availability, and performance, including, but not limited to, integration, provisioning, software stack deployment, updates, hardware and software maintenance, and decommissioning.
- High‑Performance Storage Administration: Deploy, tune, configure, maintain, and operate massive parallel file systems.
- Workload and Schedule Management: Manage, configure, optimize, and troubleshoot cluster management and job scheduling software.
- Security, Patches, and Compliance: Proactively implement security updates, coordinate systematic Operating System kernel patches, and mitigate vulnerabilities across computing and storage environments without compromising system stability.
- Preventative and Corrective Maintenance: Coordinate vendor‑supported maintenance schedules, conduct hardware and software diagnostics, and participate in rapid‑response resolution during service degradations or system blackouts.
- User Support: Provide specialized, tiered technical assistance ranging from software provisioning and workflow optimization to advanced, expert‑level troubleshooting for complex research challenges.
- GPU System Administration: Provision, configure, and maintain GPU‑accelerated computing systems, including driver management, library configuration, and performance optimization for workload acceleration.
Qualifications:
- Expert Linux System Administration: Advanced, production‑level expertise in enterprise Linux distributions (RHEL, Rocky Linux, Alma Linux, or Ubuntu Server), incorporating expert‑level command‑line proficiency, kernel tuning, and automated shell scripting (Bash, Python).
- Parallel File Systems Architecture: Hands‑on experience in the design, deployment, scaling, and/or optimization of high‑performance file systems. Experience in deploying, configuring, and operating IBM Spectrum Scale and/or Lustre.
- Scheduling Proficiency: Working familiarity with HPC resource management, including experience with Slurm.
- Systems Security Alignment: Robust foundation in core security frameworks, containing firewalls, identity management (LDAP/Active Directory), access control lists (ACLs), SSH hardening, and continuous patch management cycles.
- Agile Methodologies: Experience operating…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).