LLM Inference Engineer - Scalable GPU Serving
Job in
Palo Alto, Santa Clara County, California, 94306, USA
Listed on 2026-07-03
Listing for:
Hippocratic AI
Full Time
position Listed on 2026-07-03
Job specializations:
-
Software Development
AI Engineer (Applied/Software), Machine Learning/ ML Engineer
Job Description & How to Apply Below
Hippocratic AI is looking for an experienced LLM Inference Engineer based in Palo Alto, CA, to optimize their large language model (LLM) serving infrastructure. You'll design multi-node serving architectures and implement advanced optimization techniques while collaborating with a talented team.
The ideal candidate has hands-on experience with inference optimization, proficiency in Python and C++, and knowledge of GPU systems. Join us to help shape the future of AI deployment at scale!
#J-18808-LjbffrTo View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×