Senior/Technical Program Manager - Datacenter Capacity Delivery; E2E
Listed on 2026-06-08
-
IT/Tech
Systems Engineer, Cloud Computing
Senior / Staff Technical Program Manager - Datacenter Capacity Delivery (E2E)
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning users to run large‑scale ML applications without the hassle of managing hundreds of GPUs or TPUs.
Thanks to the groundbreaking wafer‑scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU‑based hyperscale cloud inference services. This magnitude of speed transforms the user experience of AI applications, unlocking real‑time iteration and increasing intelligence via additional agentic computation.
Role OverviewThe Role
The DC Delivery E2E TPM is the single‑threaded owner for delivering data center capacity from forecast → site strategy → design → construction → infrastructure readiness → go‑live. You will operate as the SSOT (Single Source of Truth) for delivery milestones, risks, and capacity outcomes while orchestrating cross‑functional execution across internal teams and external partners. This is a frontier‑scale role where ambiguity is high, timelines are compressed, and stakes are critical to company growth.
Responsibilities- Own delivery of AI‑optimized data center capacity (colo, build‑to‑suit, retrofits, and owned facilities) from pre‑contract planning through operational readiness.
- Deliver MW‑scale infrastructure aligned to aggressive GPU/AI system deployment targets.
- Drive clarity from ambiguity—translate high‑level demand signals into executable delivery programs.
- Decompose complex build programs into work streams with clear owners, milestones, and deliverables.
- Build integrated plans spanning real estate, power/energy, design, procurement, construction, and deployment.
- Establish critical path visibility and aggressively manage schedule compression.
- Supply chain & long‑lead equipment procurement.
- Security, compliance, and operations readiness.
- Act as the primary interface with colocation providers, EPCs, utilities, and key vendors.
- Identify and drive resolution of critical risks, constraints, and blockers across power, equipment, permitting, and supply chain.
- Own and maintain program budgets, Cap Ex forecasts, and capital allocation narratives.
- Provide structured updates and escalation paths to executive leadership.
- Partner with capacity planning, AI infrastructure, and finance teams to translate model demand into site‑level capacity strategies.
- Align build plans with power availability, network topology, and hardware rollout schedules.
- Continuously optimize for time‑to‑capacity and cost‑per‑MW / cost‑per‑GPU deployed.
- Drive E2E improvements in delivery through standardization of build and commissioning processes, implementation of program tooling and dashboards, post‑mortems and lessons learned loops, and establishing scalable mechanisms to support rapid global expansion.
- Serve as SSOT for program health, milestones, and risks; deliver concise, high‑signal updates to senior executives.
- Operate effectively across distributed teams with up to 50% travel.
- 12‑15+ years in mission‑critical facilities or data center operations.
- Experience managing multi‑site, vendor‑heavy environments.
- Strong expertise in electrical and mechanical systems.
- Proven track record in improving uptime and performance.
- Experience at hyperscalers (Google, Meta, Microsoft, AWS) or neo‑cloud / AI infra companies.
- Familiarity with high‑density AI workloads (liquid cooling, >30kW racks, GPU clusters).
- Utility engagement and power delivery constraints.
- Long‑lead supply chain planning (transformers, switch gear, chillers).
- Commissioning and data center handover processes.
- Ability to operate in high‑growth, ambiguous environments with limited structure.
- Strong executive presence and ability to influence without authority.
- Extreme ownership mindset – treat capacity delivery as a…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).