Location and Salary
Montreal Rd, Ottawa - Hybrid: 1-2 days onsite per week.
$40 - $60 per hour (based on experience).
Tasks HPC Administrator Tasks- Maintain an HPC cluster (hardware, image management, local networking, scheduler, backups).
- Troubleshoot the environment when an incident occurs to ensure a quick return to normal operations.
- Meet with scientists and evaluate their requirements for HPC support.
- Develop a task plan to meet scientists' needs and consult the technical authority for approval.
- Application builds and installs, runtime troubleshooting (GNU, Intel, Fortran, Nvidia).
- Support for open‑source and commercial off‑the‑shelf (COTS) software, including Python and Anaconda installs.
- Bash scripts, build/make tools, Easy Build, and Spack.
- MPI implementations (MPICH, OpenMPI, Intel
MPI, HPMPI). - Assist with in‑house developed applications (compilation and runtime).
- Operating system management (patching schedule, reliability for Linux distributions).
- Accounts (creation, deletion).
- Configuration via Git, MS Dev Ops, Ansible Playbooks.
- RPM/DEB packages.
- Environment modules.
- Thin Linc troubleshooting.
- Troubleshooting jobs on schedulers (PBS Pro/Torque, SLURM, SGE).
- Ensure reliable CUDA installs, troubleshoot GPU failures and other CUDA software/driver issues.
- Hardware support (memory upgrades, storage arrays, power and network cabling, ILO).
- Document each process for every task to ensure enterprise knowledge continuity.
The bidder must demonstrate that the proposed resource has five (5) years’ experiences within the last ten (10) years in administrating HPC (High Performance Computing) systems and performing HPC analyst tasks, as per Statement of Work Section 3.1 Tasks – HPC Administrator Tasks.
The bidder must demonstrate via project description and must provide the following for each project:
To substantiate this experience, the Bidder must provide the following details for each relevant project:
The Bidder must provide reference(s) for two (2) distinct HPC system administration projects or HPC analyst projects on which the proposed resource has worked for more than twelve (12) months. Each reference provided must have been in a role of supervision of the proposed resource.
The bidder must demonstrate via project description and must provide the following for each project:
At contract award, the Bidder must demonstrate that the organization holds a valid Designated…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: