×
Register Here to Apply for Jobs or Post Jobs. X

VP, AI Infrastructure - Highrise.ai

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Hut 8
Full Time position
Listed on 2026-02-19
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

About Hut 8

Imagine the ultimate destination for those who want to work at the cutting edge of technology, energy, and infrastructure. Hut 8 is on a mission to build and operate some of the world’s largest data centers for next‑generation computing workloads, including AI, Colocation, Cloud, and Bitcoin Mining. We are proud to offer interesting and challenging opportunities for individuals who want to build teams, solve problems, and make an impact from day one.

If you’re an ambitious individual looking for a career that is as rewarding as it is challenging, you’ve come to the right place.

About Highrise.ai

, we recognize that the shift from traditional software to AI represents one of the most significant technological transformations of our era. Our mission is to accelerate this shift across industries by providing the cutting‑edge infrastructure needed to build, scale, and deploy AI at an unprecedented level.

Our products empower the development and operation of the world’s most advanced large language models (LLMs), generative models, and computer vision models. By offering highly optimized GPU infrastructure, we enable organizations to unlock the full potential of AI, pushing the boundaries of what’s possible in fields such as natural language processing, image recognition, and beyond.

We are committed to driving innovation and efficiency in AI infrastructure, making it easier for companies to transition from concept to production at scale, with the reliability and performance necessary to stay ahead in a rapidly evolving technological landscape.

About

The Role

Hut 8 is scaling our GPU platform that integrates power, colocation, and compute into a single, operationally owned stack. Customers don’t ask whether we have power or GPUs — they ask who actually runs the operation.

This role exists to be that answer. Hut 8 leadership owns strategy, capital, and customer relationships. You own execution. You are accountable for taking raw, power‑backed infrastructure and turning it into production‑ready, SLA‑backed GPU capacity — s is not a lab role or a traditional enterprise ops role. It is enterprise‑grade operational discipline applied to hyperscale infrastructure, built on direct control of power, facilities, and compute.

Some of the key responsibilities you should expect are the following:

  • You will run the full operational layer between Hut 8’s facilities and the customer:
    • Deployment, commissioning, and production sign‑off
    • Performance, uptime, and SLA ownership
    • Incident response, escalation, and root cause resolution
    • OEM, vendor, and hardware lifecycle management
    • Scaling operations from ~1,100 GPUs to 20,000+ GPUs
  • You are accountable for end‑to‑end managed GPU operations, including:
    • RDMA networking (Infini Band and/or RoCE)
    • Multi‑tenant and single‑tenant production environments
    • 99.9%+ availability targets
    • Repeatable, auditable commissioning processes
    • Enterprise readiness layered onto hyperscale infrastructure
  • You coordinate tightly with Hut 8 teams on:
    • Power, cooling, rack density, and facility readiness
    • Deployment sequencing and capacity expansion
    • Operating constraints driven by energy, thermal, and site realities
  • You build and lead the operational organization:
    • Field operations, deployment, and infra ops teams
    • Hiring, structure, and escalation paths as scale ramps
    • Clear ownership across sites and customers
  • You run a disciplined, hands‑on operating rhythm:
    • Daily stand‑ups on deployment progress, risks, and blockers
    • Direct oversight of GPU failures, RDMA performance, and thermal issues
    • Production sign‑off for each deployment tranche
    • Weekly capacity and readiness planning with facilities
    • Monthly OEM and vendor performance reviews
    • Quarterly planning for expansion, refresh cycles, and new platforms
About You

You are a senior infrastructure operator who has scaled real systems, not just designed them.

  • 10+ years in large‑scale infrastructure or hyperscale data center operations
  • 5+ years operating GPU‑accelerated and/or HPC environments
  • Direct experience deploying and operating 10,000+ GPUs in managed, production settings
  • Deep expertise in RDMA networking (Infini Band and/or RoCE)
  • Proven ownership of 99.9%+ uptime and customer‑facing…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary