×
Register Here to Apply for Jobs or Post Jobs. X

Manager, Datacenter Network Engineering

Job in Seattle, King County, Washington, 98127, USA
Listing for: Runpod
Full Time position
Listed on 2026-02-15
Job specializations:
  • IT/Tech
    Systems Engineer, Network Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

Runpod is pioneering the future of AI and machine learning, offering cutting‑edge cloud infrastructure for full‑stack AI applications. Founded in 2022, we are a rapidly growing, well‑funded, remote‑first company with a global team across the US, Canada, and Europe. Our mission is to create a foundational platform that enables developers and companies to build, deploy, and scale custom AI systems with speed and flexibility.

We are looking for an Engineering Manager, Datacenter Network Engineering to lead the team responsible for designing, deploying, and operating Runpod's global datacenter and backbone network. This role manages engineers working on L2/L3 fabrics, high‑performance GPU networking, and global WAN connectivity that underpin our AI platform.

You will lead execution across multiple regions and vendors, while setting technical direction for network architecture that supports massive east‑west traffic, low‑latency GPU collectives, and secure multi‑tenant isolation. This role is hands‑on at the architectural level, while focused on team leadership, operational excellence, and scalability.

Responsibilities
  • Lead the Datacenter Networking Team: Manage and grow a team of network engineers responsible for datacenter fabrics, interconnects, and global WAN connectivity. Provide mentorship, technical guidance, and clear ownership boundaries.
  • Own Datacenter Network Architecture: Define and evolve network designs for GPU‑heavy clusters, including spine‑leaf topologies, ECMP routing, and high‑bandwidth east‑west traffic patterns.
  • High‑Performance GPU Networking: Oversee design and operation of Infini Band and RoCE‑based fabrics supporting distributed training and inference workloads. Ensure performance, loss characteristics, and congestion control meet AI workload requirements.
  • Encapsulation & Overlay Protocols: Guide implementation and operations of encapsulation technologies such as VXLAN, EVPN, Geneve, or similar, enabling scalable multi‑tenant isolation and flexible network provisioning.
  • Global WAN & Backbone Connectivity: Lead strategy and execution for global WAN connectivity, including private backbone links, IX connectivity, and hybrid connectivity with cloud providers and partners.
  • Reliability & Operations: Establish operational best practices for monitoring, capacity planning, change management, incident response, and post‑mortems across the network stack.
  • Cross‑Functional

    Collaboration:

    Partner closely with Infrastructure, SRE, Hardware, and Product Engineering teams to ensure network capabilities align with platform and customer requirements.
  • Vendor & Partner Management: Work with hardware vendors, colocation providers, and transit partners on network design, procurement, deployment timelines, and escalations.
  • Security & Segmentation: Ensure network designs support secure isolation, DDoS resilience, and compliance requirements without compromising performance.
Requirements
  • Engineering Leadership

    Experience:

    3+ years managing network or infrastructure engineering teams, with experience scaling teams and systems in production environments.
  • Datacenter Networking Expertise: 8+ years designing and operating large‑scale datacenter networks, including spine‑leaf architectures, BGP‑based routing, and high‑throughput fabrics.
  • Encapsulation & Overlays: Strong hands‑on experience with VXLAN/EVPN or equivalent encapsulation protocols, including control‑plane and data‑plane considerations.
  • High‑Performance Networking: Proven experience with Infini Band and/or RoCE, including congestion management, lossless Ethernet concepts, and performance tuning for GPU workloads.
  • Global WAN

    Experience:

    Deep familiarity with global WAN technologies, including private backbone design, inter‑region connectivity, routing policy, and traffic engineering.
  • Linux & Network OS Fluency: Comfortable working with Linux‑based systems, network operating systems, and automation tooling.
  • Operational Excellence: Strong background in network observability, incident management, capacity forecasting, and change control.
  • Communication & Leadership: Clear written and verbal communication skills, with the ability to align stakeholders and lead teams…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary