×
Register Here to Apply for Jobs or Post Jobs. X

ML Model Serving Engineer - Low-Latency Inference at Scale

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Sesame
Full Time position
Listed on 2026-01-25
Job specializations:
  • Engineering
    AI Engineer, Data Engineer
Job Description & How to Apply Below
A cutting-edge technology company in San Francisco is seeking a Machine Learning Engineer to enhance its innovative voice companion technologies. You will optimize and build a high-performance ML serving layer, collaborating with engineers to create reliable and efficient systems. Ideal candidates will have deep expertise in PyTorch and performance engineering.

This role offers comprehensive employee benefits including health coverage, unlimited PTO, and 401k matching.
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary