Senior Firmware Engineer – CSP Engagements
Listed on 2026-05-30
-
Software Development
AI Engineer, Software Engineer, Embedded Software Engineer, DevOps
NVIDIA is seeking a Senior Firmware Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines deep technical expertise in embedded firmware development with customer‑facing responsibilities to enable cloud service providers with next‑generation computing platforms. You will work at the intersection of hardware and software, driving technical solutions from concept through deployment.
Whatyou will be doing:
- Design and develop firmware solutions for manageability and observability of data center servers.
- Actively participate in hardware bring‑up activities, OOB firmware development, protocol stacks (Redfish, PLDM, MCTP, NSM) and hardware‑software co‑design for Cloud Service Provider deployments.
- Debug and troubleshoot NVIDIA GPU firmware issues, power management, performance, and thermal control problems for data center deployments, providing active support to CSPs.
- Partner directly with CSPs to deliver technical solutions, co‑develop & co‑debug features and optimizations, and provide support during new product introductions.
- Perform advanced system debugging, root cause analysis, and performance optimization for large‑scale data center environments.
- Collaborate with AE, FAE, and Solution Architect teams to deliver integrated customer solutions and technical documentation.
- Deep expertise in data center server architectures, HPC systems, and hardware‑software co‑design.
- Deep expertise in embedded firmware, server management controllers, and hardware bring‑up with proven track record of shipping production BMC solutions.
- Strong knowledge of DMTF protocols (Redfish, IPMI, PLDM, MCTP, SPDM), telemetry frameworks, and out‑of‑band management architectures.
- Expert‑level skills in C/C++ in resource‑constrained embedded environments, RTOS, device drivers, and low‑level protocols (I2C, SPI, UART, PCIe, MCTP).
- Experience with RAS including error handling, error injection, fault isolation, and system health monitoring.
- BS or MS in Computer Engineering, Computer Science, or related field (or equivalent experience).
- 8–12 years of system software development experience.
- Knowledge of cloud and cluster level deployment and management systems.
- Experience with GPU computing (CUDA), deep learning workloads.
- Knowledge of Memory fabric and CXL architectures.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5. You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until June 1, 2026.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).