Senior AI Inference Engineer Throughput LLM Serving
Job in
Santa Clara, Santa Clara County, California, 95053, USA
Listed on 2026-06-17
Listing for:
NVIDIA
Full Time
position Listed on 2026-06-17
Job specializations:
-
Software Development
AI Engineer (Applied/Software), Senior Developer, Machine Learning/ ML Engineer
Job Description & How to Apply Below
NVIDIA is seeking a Senior Software Engineer – AI Inference in Santa Clara, California. This role focuses on optimizing and contributing to open-source LLM serving engines like vLLM and SGLang. Candidates should have extensive experience in production software, solid systems engineering fundamentals, and strong programming skills in Python, C++, and/or CUDA.
The position offers a competitive salary range between $152,000 and $241,500 based on experience, alongside equity and benefits. NVIDIA values diversity and is an equal-opportunity employer.
#J-18808-LjbffrPosition Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×