Senior AI Infrastructure Architect & HPC Networking Lead
Listed on 2026-06-04
-
IT/Tech
Systems Engineer, Cloud Computing: Infrastructure & Operations
About the role
Computacenter is looking for an AI Principal Consultant to join its professional services team. We are looking for someone with the ability to work on a dynamic customer focused team that requires excellent interpersonal skills.
This role will be interacting with customers, partners and internal teams, to analyse, define and implement large-scale networking projects. The scope of these efforts includes a combination of datacenter scale networking, system design and automation.
Annual compensation: $175K - $220K USD
What you'll be doing- Partner with business leaders to deliver services that support company objectives and that are consistent with Winning Together values.
- Design, architect, and implement distributed Infini Band networks for high-performance computing (HPC) anddatacenter environments.
- Providing ethernet and routingexpertiseto customers during project delivery to design,architectand test ethernet networking solutions
- Designing, implementing, andoptimizinghigh-performance fabric architectures for ourdata center and infrastructure projects.
- Support operational and reliability aspects of large-scale Al clusters, focusing on performance atscale, training stability, real-time monitoring, logging, and alerting.
- Your primary focus would be on understanding the Al workload and how it interacts with otherparts of the system like networking, storage, deep learning frameworks, data cleaning tools, etc.
- Work on multi-functional teams to provide ethernet networkexpertiseto server infrastructurebuilds, accelerated computing workloads and GPU enabled AI applications.
- Implementing tasks related to network configuration and validation fordatacenters.
- Create methods of procedure and deployment documents.
- Embrace and support Computacenter’s mission and core values.
- Bachelor's degree in Information Technology, Engineering, or related field (or equivalent experience)
- Strong understanding of NVIDIA technologies including GX Cloud, NVIDIAAl Enterprise Al Software, Base Command Manager,NEMOand NVIDIA Inference Microservices.
- Deep understanding of Kubernetes‑based GPU scheduling, GPU virtualization concepts (fractional GPUs, MIG awareness), and policy‑driven resource allocation in multi‑tenant clusters.
- Experienceoptimizingcluster‑level GPUutilization, workload throughput, and job latency using
Run:
AIin conjunction with NVIDIA GPU platforms. - Strong routing hands‑on experience including BGP, VxLAN and EVPN
- Cluster management technologies knowledge and BCM (Base Command Manager.)
- Legally eligible to work in the United States
- Experience with IT service delivery lifecycle and methodologies.
- Demonstrated experience designing, deploying, oroperating
Run:
AI–based GPU orchestration platforms in production environments. - Ability to design in-depth, complex technical solutions.
- In-depth knowledge of IT Infrastructure technology and its business application.
- Excellent communication and presentation skills, with the ability to present at large internal and external audiences and at board level.
There’s so much more to enjoy about being at Computacenter than just having a rewarding career. In addition to offering competitive compensation plans and long-term career opportunities, we provide an attractive mix of benefit plans to contribute to your good health, future financial security, and peace of mind.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).