×
Register Here to Apply for Jobs or Post Jobs. X
More jobs:

Manager, Distinguished Engineer - DGX Systems Software

Job in Santa Clara, Santa Clara County, California, 95053, USA
Listing for: NVIDIA Corporation
Full Time position
Listed on 2026-06-05
Job specializations:
  • Software Development
    AI Engineer
Salary/Wage Range or Industry Benchmark: 60000 - 80000 USD Yearly USD 60000.00 80000.00 YEAR
Job Description & How to Apply Below
Manager, Distinguished Engineer - DGX Systems Software page is loaded## Manager, Distinguished Engineer - DGX Systems Software locations:
US, CA, Santa Clara:
US, Remote time type:
Full time posted on:
Posted Yesterday job requisition :
JR2016463

NVIDIA DGX systems are the foundation of the world’s most advanced AI infrastructure—purpose-built servers, workstations, and personal AI computers that bring together GPUs, CPUs, NVLink, NVIDIA Networking, and a fully optimized AI software stack.

We are seeking an engineering leader responsible for end-to-end delivery of every DGX compute system—from firmware through the AI stack to customer deployment. You will ensure each DGX product ships as a production-ready system where firmware, OS, drivers, CUDA, networking, and AI applications work together seamlessly, while driving architecture and roadmap for next-generation platforms.
** What you’ll be doing:
*** End-to-End Stack Readiness:
Ensure every DGX platform is ready for the full NVIDIA software stack—firmware, DGX OS, GPU drivers, CUDA toolkit, DCGM, DOCA/OFED, and management tools—as a validated, production-quality product. Own the GA SW/FW release process delivering firmware bundles, BaseOS ISOs, and release notes to OEM/OSV partners. Ensure platforms support AI agents like Nemo Claw, Hermes agents, NIM microservices, and workloads customers expect out of the box.
* Platform Firmware Development:
Lead development of the manageability firmware stack (BMC, BIOS) for all DGX platforms. Ensure firmware from partner teams (GPU, CPU, networking) integrates correctly at system level. Manage 3rd-party vendors and drive platform requirements (NVPOR) across all firmware areas.
* Validation Strategy:
Define validation strategy proving each DGX platform is production-ready: end-to-end system validation including firmware regression, NVQual certification, DL workload performance, OS/CUDA stack testing, multi-user scenarios, power/thermal validation, and field upgrade reliability. Establish quality gates and zero ship-stopper discipline.
* Platform Bring-Up & Architecture:
Drive platform bring-up for each new DGX system—coordinating first boot across new silicon (CPU, GPU), board design, and firmware teams. Own architectural strategy for next-generation platforms including firmware update mechanisms, system security posture, and AI application readiness.
* Customer Deployment & Enablement:
Ensure firmware release flows meet CSP and enterprise deployment requirements. Represent DGX platform readiness in executive reviews and strategic planning with VP/SVP leadership. Engage with industry standards bodies (DMTF Redfish, OCP).
* Product Delivery Lifecycle:
Own the complete DGX delivery lifecycle—system architecture, firmware development, integration, full-stack validation, GA release, and customer deployment—for every DGX product.
* Cross-Org Alignment:
Serve as single point of accountability for DGX platform readiness across NVIDIA—aligning GPU, CPU, networking, security, OS, and AI software teams to deliver on schedule.
* Quality & Vendor Management:
Own RCCA processes for field issues. Manage external vendor partnerships (AMI for SBIOS, BMC contributors) with clear quality gates and program tracking.
* Team Leadership:
Build and lead a world-class engineering organization. Mentor and develop leaders. Foster a culture of technical excellence, intellectual honesty, and customer obsession.
** What we need to see:
*** BS or MS in Computer Science, Electrical Engineering, or related field or equivalent experience.
* 12+ overall years in systems firmware/software engineering, with 5+ years in engineering leadership.
* Deep expertise in server system stack including SBIOS, BMC, OS, applications and system-level integration of complex multi-component products.
* Proven track record delivering multi-generation server or data center platforms from architecture through customer deployment.
* Experience managing engineering organizations across multiple geographies in a matrix environment.
* Strong understanding of server hardware: CPU, GPU, interconnect, memory, PCIe, power delivery.
* Experience owning end-to-end product quality—from…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary