HPC Vendor Service Manager
Listed on 2026-06-23
-
IT/Tech
Hardware Engineer
Job Type: Full-time |
Location: Prince George |
Department: Operations |
Reporting to: Technical Operations Manager |
Work Location Type: #onsite
Podtech Data Centers Inc., the employing entity, is a proud member of the IREN Group and we are currently looking to hire!
IREN is a leading next‑generation data center business powering the future with 100% renewable energy. We build, own and operate our data centers and take pride in being at the forefront of sustainable solutions for the ever‑evolving applications of high‑performance compute. We believe that human progress is invaluable, but it should be done in the right way – responsibly, sustainably and having a positive impact on the communities we operate in.
We have grown substantially since 2019, from our inception in Australia to now having several facilities across North America and being listed on NASDAQ… and we are just getting started! By joining us, you will be contributing to the future of sustainable high‑performance compute and the local communities we strive to have a positive impact on.
Job SummaryThe HPC Vendor Service Manager is responsible for overseeing all Original Equipment Manufacturer (OEM) and third‑party vendor activities associated with the maintenance, repair, replacement, and support of High‑Performance Computing (HPC) infrastructure within a mission‑critical data center environment. This role serves as the primary interface between the data center operations team and hardware vendors, ensuring timely execution of break‑fix activities, warranty repairs, RMA management, parts logistics, and technical escalations.
The HPC Vendor Service Manager is accountable for vendor performance, service‑level compliance, operational coordination, and maintaining the highest levels of system availability across HPC clusters and supporting infrastructure. The position works closely with Technical Operations, Technology, Asset Management, Logistics, Security, and Customer teams to ensure vendor activities are executed safely, efficiently, and in accordance with site operational standards.
- Act as the primary point of contact for all OEM and third‑party service providers supporting HPC infrastructure.
- Manage daily activities of vendor technicians performing maintenance, diagnostics, hardware replacements, and warranty repairs.
- Ensure all vendor personnel comply with site safety, security, and operational requirements.
- Coordinate vendor access, scheduling, and work execution activities.
- Develop and maintain strong relationships with vendor account teams and field service organizations.
- Conduct regular vendor performance reviews and service‑level assessments.
- Oversee all hardware repair activities affecting HPC servers, storage systems, networking equipment, and supporting technologies.
- Prioritize repair activities based on operational risk and customer impact.
- Coordinate maintenance windows and repair schedules with operations teams.
- Ensure timely resolution of hardware failures and service interruptions.
- Monitor repair progress and elevate delays as required.
- Verify quality of completed repairs and restoration of service.
- Manage the complete Return Material Authorization (RMA) lifecycle.
- Coordinate diagnosis, part replacement, return shipments, and warranty claims.
- Track all RMAs from initiation through closure.
- Maintain visibility of open cases, parts status, and expected delivery timelines.
- Escalate delayed shipments, parts shortages, or vendor response issues.
- Ensure accurate documentation of all hardware replacements and warranty transactions.
- Oversee inventory of critical spare parts and replacement components.
- Coordinate inbound and outbound shipments with logistics providers and vendors.
- Monitor spare parts consumption and recommend inventory adjustments.
- Ensure proper handling, storage, and tracking of replacement hardware.
- Establish and monitor key vendor performance indicators.
- Track response times, repair times, first‑time fix rates, and SLA compliance.
- Identify recurring vendor performance issues and…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: