More jobs:
HPC System Engineer
Job in
Plano, Collin County, Texas, 75094, USA
Listed on 2026-06-02
Listing for:
3B Staffing
Full Time
position Listed on 2026-06-02
Job specializations:
-
IT/Tech
Systems Engineer, IT Support, Cloud Computing, Cybersecurity
Job Description & How to Apply Below
Contract: 3 to 6 Months + Possibility to extend
Location:
Remote, USA
Required Experience:
The HPC Systems Engineer supports storage, networking, GPU systems, and compute environments, ensuring system performance, availability, and reliability while troubleshooting issues and supporting users.
Storage Administration (Net App)
- Administer Net App storage systems (volumes, aggregates, qtrees, snapshots)
- Manage replication technologies (Snap Mirror, Snap Vault)
- Monitor storage performance (I/O, latency, capacity) and report on trends
- Troubleshoot storage issues impacting HPC workloads
- Maintain backup, recovery, and data protection policies Network Administration (Arista)
- Configure and maintain Arista switches within HPC environments
- Manage VLANs, ACLs, and link aggregation
- Support network documentation, topology diagrams, and change management NVIDIA DGX & GPU Systems
- Support NVIDIA DGX systems including health checks, driver updates, and OS maintenance
- Monitor GPU utilization, thermal performance, and interconnects (DCGM, nvidia-smi)
- Troubleshoot and escalate hardware or performance issues HPC Operations
- Perform system health checks, patching, and firmware updates on HPE servers
- Support HPC schedulers such as Slurm or PBS (queue monitoring, job troubleshooting)
- Maintain documentation, runbooks, and operational logs Requirements:
- 3-5 years of Linux systems administration or HPC infrastructure experience
- Experience supporting GPU-based systems (NVIDIA preferred)
- Strong command-line troubleshooting across distributed systems
- Solid communication and documentation skills
- Preferred: Advanced Linux experience (7-10+ years)
- Preferred: Experience with Slurm or similar schedulers
- Preferred: Exposure to HPCM or parallel file systems
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×