Director of Engineering, Inference Services
Listed on 2026-01-01
-
Engineering
Systems Engineer, AI Engineer -
IT/Tech
Systems Engineer, AI Engineer
Director of Engineering, Inference Services
Core Weave is The Essential Cloud for AI™. Built for pioneers by pioneers, Core Weave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, Core Weave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, Core Weave became a publicly traded company (Nasdaq: CRWV) in March 2025.
Learn more at
Core Weave is looking for a Director of Engineering to own and scale our next-generation Inference Platform. In this highly technical, strategic role you will lead a world‑class engineering organization to design, build, and operate the fastest, most cost‑efficient, and most reliable GPU inference services in the industry. Your charter spans everything from model‑serving runtimes (e.g., Triton, vLLM, Tensor
RT‑LLM) and autoscaling micro‑batch schedulers to developer‑friendly SDKs and airtight, multi‑tenant security – all delivered on Core Weave’s unique accelerated‑compute infrastructure.
- Vision & Roadmap – Define and continuously refine the end‑to‑end Inference Platform roadmap, prioritizing low‑latency, high‑throughput model serving and world‑class developer UX. Set technical standards for runtime selection, GPU/CPU heterogeneity, quantization, and model‑optimization techniques.
- Platform Architecture – Design and implement a global, Kubernetes‑native inference control plane that delivers
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).