Deep Learning Performance Architect - College Graduate
Job in
Santa Clara, Santa Clara County, California, 95050, USA
Listed on 2026-03-04
Listing for:
NVIDIA
Full Time
position Listed on 2026-03-04
Job specializations:
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer, Computer Science, Data Scientist -
Engineering
AI Engineer, Computer Science
Job Description & How to Apply Below
Job Summary
:
NVIDIA is looking for a Deep Learning Performance Architect to contribute to the development of next-generation architectures that accelerate AI and high-performance computing applications. The role involves developing innovative architectures, analyzing performance trade-offs, and collaborating with various teams to guide deep learning hardware and software direction.
Responsibilities
:
• Develop innovative architectures to extend the state of the art in deep learning performance and efficiency
• Analyze performance, cost and power trade-offs by developing analytical models, simulators and test suites
• Understand and analyze the interplay of hardware and software architectures on future algorithms, programming models and applications
• Develop, analyze, and harness groundbreaking Deep Learning frameworks, libraries, and compilers
• Actively collaborate with software, product and research teams to guide the direction of deep learning HW and SW
Qualifications
:
Required
:
• MS or PhD in Computer Science, Computer Engineering, Electrical Engineering or equivalent experience
• Strong background in GPU or Deep Learning ASIC architecture for training and/or inference
• Experience with performance modeling, architecture simulation, profiling, and analysis
• Solid foundation in machine learning and deep learning
• Strong programming skills in Python, C, C++
Preferred
:
• Background with deep neural network training, inference and optimization in leading frameworks (e.g. Pytorch, JAX, Tensor
RT)
• Experience with relevant libraries, compilers, and languages - CUDNN, CUBLAS, CUTLASS, MLIR, Triton, CUDA, OpenCL
• Experience with the architecture of or workload analysis on other DL accelerators
• Demonstration of self-motivation, with a knack for critical thinking and thinking outside the box
Company
:
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI. Founded in 1993, the company is headquartered in Santa Clara, USA, with a team of 10001+ employees. The company is currently Late Stage.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×