Senior Deep Learning Compiler Engineer - PyTorch
Listed on 2025-10-08
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer
Overview
Join to apply for the Senior Deep Learning Compiler Engineer - Py Torch role at NVIDIA
NVIDIA is seeking passionate engineers to build the next generation of tools used by AI developers and researchers worldwide. Our team is developing Thunder, an ambitious, source-to-source compiler built to unlock outstanding performance for PyTorch models on NVIDIA GPUs. This is a unique opportunity to contribute to a project that enhances the PyTorch ecosystem, working with modern compiler stacks like PyTorch 2.0's Torch Dynamo and Torch Inductor to create powerful, open-source solutions that benefit the entire community.
If you are driven to solve complex problems and want to make a foundational impact on the AI ecosystem, apply to join our collaborative and innovative team.
As a key member of our team, you will be contributing directly to the future of accelerated AI. Your role will be dynamic and deeply technical, placing you at the center of compiler innovation. You will lead the design, implementation, optimization, and maintenance of the core compiler technologies that accelerate massive deep learning workloads. This is a highly collaborative role where youll work alongside the very engineers who built PyTorch for NVIDIA hardware, helping to pioneer new features and stay at the forefront of framework development.
Youll dive deep into performance analysis, scrutinizing workloads running on thousands of GPUs to find optimization opportunities that will shape the future design of Thunder. Furthermore, you will be part of a vibrant ecosystem, working closely with leading compiler, library, and systems teams—including experts behind nv Fuser, TVM, XLA, and CUDA—to translate the latest research into practical, high-impact solutions for the open-source community.
We Need To See
We are looking for engineers who are excited about building powerful, user-centric tools and are comfortable working in a fast-paced, collaborative environment. Here are some of the expertise we would like to see:
- A Bachelor's, Master s, or Ph.D. in Computer Science or a related technical field (or equivalent experience).
- 8+ years of relevant work experience
- A strong command of Python and experience building complex, well-tested software systems.
- Hands-on experience with deep learning frameworks like PyTorch or JAX. You understand how models are built and where the performance challenges lie.
- A solid foundation in compiler concepts such as abstract syntax trees (ASTs), intermediate representations (e.g., SSA form), program analysis, and code generation.
- Excellent communication and collaboration skills, essential for working effectively in a distributed, open-source environment.
- Previous contributions to deep learning compiler projects (e.g., TVM, MLIR, IREE) or deep learning frameworks themselves.
- Deep expertise in the internals of PyTorch, particularly its compiler stack (Torch Dynamo, Torch Inductor).
- Experience with JAX-like functional transformations and their application in a compiler context.
- Familiarity with parallel programming, distributed systems, and writing high-performance CUDA code.
- A track record of impactful participation in open-source communities, such as through code contributions, design discussions, or mentorship.
NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization. Our teams are composed of driven, innovative professionals dedicated to pushing the boundaries of technology. We offer highly competitive salaries, an extensive benefits package, and a work environment that promotes diversity, inclusion, and flexibility. As an equal opportunity employer, we are committed to fostering a supportive and empowering workplace for all.
Employment type:
Full-time | Seniority level:
Mid-Senior level
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: