HPC Systems Engineer
Listed on 2026-06-02
-
IT/Tech
Systems Engineer, Systems Administrator
Job Responsibilities
- Installs, configures, and maintains large computer clusters, servers, and software.
- Day‑to‑day operations of the systems, including systems administration, monitoring, and storage performance up to and including network components.
- Manages the system’s network switch, parallel file system, and HPC software stack and tools.
- Configuration of the scheduling and queuing system.
- Diagnoses and resolves system operational problems quickly and effectively.
- Co‑ordinates with vendors to resolve hardware and software problems.
- Assists users with access and other help‑desk ticket requests or issues.
- Builds and deploys open source software and software from vendors/partners.
- Provides reliable and efficient backups and restores for all managed systems.
- Maintains and monitors the security of the HPC systems and servers.
- Documents system administration procedures for routine and complex tasks.
- Linux build automation in a large, distributed computing environment with Puppet, Ansible, Git, and Docker; scripting with Python, Shell, and Perl.
- Implementing automation and monitoring using shell scripting.
- Install, configure, and maintain job‑management tools (SLURM, Moab, TORQUE, PBS).
- Operating‑system deployment with XCAT and ROCKS.
- Configure, administer, and support network storage subsystems (IBM, Net App, Data Direct Network, LSI).
- Distributed file systems (GPFS, Lustre, Gluster).
- Configure, install, tune, and maintain scientific application software, performance‑monitoring, and optimization tools.
- Bachelor’s degree in Computer Science, Electronics Engineering, or a closely related field.
- Five years of progressive experience in scientific computing.
- Five years of experience in Linux build automation with Puppet, Ansible, Git, and Docker; scripting with Python, Shell, and Perl.
- Experience implementing automation and monitoring via shell scripting.
- Experience installing, configuring, and maintaining job‑management tools (SLURM, Moab, TORQUE, PBS).
- Experience deploying operating systems with XCAT and ROCKS.
- Experience configuring, administering, and supporting network storage subsystems (IBM, Net App, Data Direct Network, LSI).
- Experience with distributed file systems (GPFS, Lustre, Gluster).
- Experience configuring, installing, tuning, and maintaining scientific application software and performance‑monitoring and optimization tools.
Benefit Eligibility:
Yes. The University of Chicago offers a wide range of benefits programs and resources for eligible employees, including health, retirement, and paid time off.
Salary: $95,930.00 – $.
Employment Details- Job Type: Salary.
- Scheduled Weekly
Hours:
37.5. - Union Status:
Non‑Union. - Job is Exempt:
Yes. - Drug Test
Required:
No. - Health Screen
Required:
No. - Motor Vehicle Record Inquiry
Required:
No. - Background Check:
All offers of employment are contingent upon a background check that includes a review of conviction history.
The University of Chicago is an equal‑opportunity employer and does not discriminate on the basis of race, color, religion, sex, sexual orientation, gender, gender identity, or expression, national or ethnic origin, shared ancestry, age, status as an individual with a disability, military or veteran status, genetic information, or other protected classes under the law. For additional information please see the University's Notice of Non‑discrimination.
Job seekers in need of a reasonable accommodation to complete the application process may contact 773‑702‑5800 or submit a request via Applicant Inquiry Form.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).