Senior Systems Software Engineer, Windows and Linux Enablement - DGX Station
Listed on 2026-06-12
-
Software Development
Software Engineer, DevOps, Python, Unix/Linux
DGX Station is NVIDIA’s next‑generation personal AI supercomputer—a deskside workstation built on the NVIDIA Grace Blackwell GB300 Superchip with massive coherent CPU+GPU memory, designed to bring data‑center‑class AI capabilities directly to the desks of researchers, developers, and AI engineers.
As NVIDIA brings DGX Station to a broad set of customers, we need an engineer who can own full‑stack OS enablement—from firmware and drivers through OS integration to ensuring AI applications run seamlessly on day one, with a primary focus on Windows and strong coverage of Linux.
This is a hands‑on, technically deep role where you will be the go‑to engineer for making DGX Station a first‑class Windows platform while also driving its Linux bring‑up and validation. You will work across NVIDIA’s GPU driver, CUDA, firmware, BMC, and AI software teams, collaborate closely with Microsoft and ODM/OEM partners, and ensure that developers and enterprise customers have a polished, production‑ready experience on DGX Station across both operating systems.
WhatYou’ll Be Doing
- Windows Platform Ownership (primary):
Own end‑to‑end Windows enablement for DGX Station—driving the platform from initial bring‑up on Windows through WHQL certification to customer‑ready shipping quality. You are the single point of accountability for “DGX Station works on Windows.” - Linux Bring‑up & Enablement:
Drive Linux bring‑up and continuous enablement for DGX Station on DGX OS / Ubuntu, including kernel module integration, device tree and ACPI configuration, systemd services, initramfs, and dkms packaging. Partner with the DGX OS and kernel teams to land platform support upstream and in NVIDIA’s distribution. - Firmware & Driver Enablement:
Enable and validate BIOS/UEFI, BMC, and system‑level firmware for Windows and Linux on the Grace (Arm) + Blackwell GB300 architecture. Work with firmware teams to ensure ACPI tables, SMBIOS, Secure Boot, measured boot, power management, and hardware abstraction layers are correct on both OSes. - GPU Driver Integration:
Coordinate GPU driver, display driver, and compute driver bring‑up and validation on Windows (WDDM, MCDM) and Linux (open‑gpu‑kernel‑modules, DRM/KMS). Work with the NVIDIA driver team and Microsoft to resolve compatibility issues, achieve WHQL certification, and ensure driver stability across Windows Update and Linux kernel revisions. - CUDA & AI Stack Readiness:
Ensure the CUDA toolkit, cuDNN, Tensor
RT, NCCL, and NVIDIA’s AI SDK stack are fully functional on DGX Station on both Windows and Linux. Validate AI/DL workload performance—training, fine‑tuning, and inference—and work with the CUDA team to resolve gaps on the Arm + GB300 platform. - Application Validation:
Validate that NVIDIA AI applications—NIM microservices, Nemo Claw, AI Workbench, and developer tools—run correctly on DGX Station across Windows and Linux. Define and implement test plans covering single‑user and multi‑user scenarios, container runtimes, application installation flows, and developer workflows. - System Validation & Quality:
Drive the overall test strategy for DGX Station on Windows and Linux: functional testing, stress testing, power/thermal validation, sleep/resume and S‑state cycles, Windows Update and Linux kernel‑upgrade compatibility, and long‑duration reliability. Own bug triage and resolution across firmware, BMC, driver, and OS layers. - Partner Engagement:
Be the primary technical interface with Microsoft (Windows on Arm, WHQL, driver signing) and ODM/OEM partners shipping DGX Station. Coordinate schedules, resolve cross‑company technical blockers, and represent NVIDIA’s platform requirements on both OSes. - Performance Optimization:
Profile and optimize system performance—boot time, GPU compute throughput, NVLink‑C2C and memory bandwidth utilization, power efficiency, and thermal behavior. Identify bottlenecks across the stack on Windows and Linux and drive fixes with the appropriate teams. - Documentation & Enablement:
Create and maintain platform documentation for DGX Station on Windows and Linux: bring‑up guides, known issues, driver compatibility matrices, recovery and re‑imaging procedures, and developer setup…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).