Head of Infrastructure Operations
Listed on 2026-06-11
-
IT/Tech
Systems Engineer
Location: St. Louis
Nscale is the GPU cloud engineered for AI. We provide cost-effective, high-performance infrastructure for AI start-ups and large enterprise customers. Nscale enables AI-focused companies to achieve superior results by reducing the complexity of AI development. Our GPU cloud bolsters technical capabilities and directly supports strategic business outcomes, including cost management, rapid innovation, and environmental responsibility.
We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As an Nscaler, you’ll build trust through openness and transparency, where everyone is inspired to do their best work. If you join our team, you’ll be contributing to building the technology that powers the future.
Role Overview
We are seeking a Head of Infrastructure Operations to lead the end-to-end operational management of Nscale's data centre portfolio across a defined region (EMEA, APAC, Americas). You'll be responsible for ensuring operational excellence, safety, compliance, and reliability across all physical infrastructure, while driving continuous improvement and scaling operations to support rapid business growth. Regular travel to the DCs is essential to the success of this position.
This is a high-impact leadership role where you'll own the strategic direction of data centre operations, manage cross-functional teams, and serve as a critical partner to senior leadership in delivering world-class infrastructure that powers our AI cloud platform.
What You'll Do
Operational Leadership & Strategy
- Own the strategic vision and execution of data centre infrastructure operations across the region, ensuring alignment with Nscale's business objectives and growth plans.
- Establish and maintain operational standards, processes, and procedures that drive efficiency, safety, and reliability across all sites.
- Lead the development and implementation of operational roadmaps that support capacity planning, infrastructure scaling, and service delivery milestones.
- Drive continuous improvement initiatives to optimize costs, reduce downtime, and enhance operational maturity.
Team Management & Development
- Build, mentor, and lead high-performing teams across multiple data centre sites, specifically operations staff.
- Establish clear accountability structures, performance metrics, and development pathways for direct reports and broader teams.
- Foster a culture of ownership, safety, and excellence where team members are empowered to make decisions and drive impact.
- Conduct regular performance reviews, provide constructive feedback, and support career progression.
Physical Infrastructure & Facilities Management
- Oversee Data centre Leads in their execution of day to day Infrastructure Operational procedures, from routine inspections to the handling of ITSM tickets ensuring all SLAs are met.
- Support the Data centre provider (Nscale or Colo) to ensure optimum performance of the facility, including physical infrastructure, power distribution, cooling systems, security, and environmental controls.
- Maintain accurate asset inventory for all AI Infrastructure and supporting hardware and tooling.
- Support the physical security programme, maintaining audit trails, incident documentation and physical security protocols across all sites.
- Coordinate with the wider Nscale teams to ensure infrastructure layouts, rack elevations, and reference architectures are implemented correctly and optimised for efficient operations
Reliability, Safety & Compliance
- Establish and maintain SLOs/SLIs for data centre availability, performance, and incident response.
- Lead incident response and root‑cause analysis for operational failures; own remediation and prevention strategies.
- Ensure full compliance with health and safety regulations, environmental standards, and industry best practices.
- Support ongoing certifications and audits (ISO 27001, ISO 22237, SOC 2, Cyber Essentials Plus, ISO 22301).
- Maintain comprehensive documentation for compliance, audit readiness, and regulatory requirements.
- Manage relationships with critical vendors, contractors, and service…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).