More jobs:
Senior Software QA Test Development Engineer - Diagnostics
Job in
Santa Clara, Santa Clara County, California, 95053, USA
Listed on 2026-05-30
Listing for:
NVIDIA Gruppe
Full Time
position Listed on 2026-05-30
Job specializations:
-
Software Development
DevOps, Software Engineer, Cloud Engineer - Software
Job Description & How to Apply Below
What you’ll be doing:
- Responsible for the development and execution of NVIDIA HGX/DGX/MGX platform test plan on servers, OS, firmware and CUDA software stack from design doc.
- Installing and testing various systems OS, server firmware and software stack.
- Drive support for root cause analysis on reliability and validation test failures to identify root cause(s) and achieve mitigation.
- Build, develop and debug server and OS level automation front‑end and back‑end framework and tests.
- Review partner and supplier test results and prescribe additional reliability testing on components, servers, and packaging as needed.
- Work in an agile software development team with very high production quality standards.
- Manage bug lifecycle and collaborate with inter‑groups to drive solutions.
- Bachelor’s Degree (or equivalent experience) in a STEM field.
- 5+ years proven experience; or master’s degree.
- Proven years of OS and server level automation, CI/CD process and Dev Ops experience using Python, SHELL, Ansible, Jenkins, C/C++, Java, JavaScript.
- Strong server and Linux (Ubuntu, Red Hat, CentOS, SuSE, Fedora and etc.) troubleshooting and debugging experience in a bare‑metal and KVM/VMWare/Hyper‑V environment.
- Good knowledge and hands‑on experience in model testing, AI tools/frameworks (Tensor Flow, PyTorch, Cursor and etc.), NLP and LLM benchmarking.
- Experience in using AI development tools for test plans creation, test cases development and test cases automation.
- Strong experience in firmware, BMC/OpenBMC, network protocol, internal/external enterprise storage devices, PCIe buses and devices, IO sub‑devices, CPU and memory, ACPI, UEFI spec, Redfish – huge plus.
- Proven years of experience in Git Hub/Gitlab/Gerrit, PXE, SLURM, Stack/Kubernetes/Docker – huge plus.
- AI related tools, LLM and NLP.
- Experience working with NVIDIA GPU hardware is a strong plus.
- Good to have solid understanding of virtualization in Linux (KVM, Docker orchestrated with Kubernetes).
- Background in parallel programming ideally CUDA/OpenCL is a plus.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 140,000 USD - 224,250 USD for Level 3, and 168,000 USD - 270,250 USD for Level 4.
You will also be eligible for equity and benefits.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
#J-18808-LjbffrPosition Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×