×
Register Here to Apply for Jobs or Post Jobs. X

Senior Machine Learning Engineer; Multimodal Perception, LLM​/VLM

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Waymo
Apprenticeship/Internship position
Listed on 2026-05-29
Job specializations:
  • Engineering
    Computer Science, AI Engineer (Applied/Software), Artificial Intelligence, Robotics
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: Senior Machine Learning Engineer (Multimodal Perception, LLM/VLM)

Requirements

  • BS or MS in Computer Vision, Machine Learning, Robotics, or a related field
  • ,
  • 4+ years of applied industry experience in autonomous vehicles, robotics, or complex ML systems
  • ,
  • Fluency in Python or C++, with deep hands-on expertise in PyTorch or Jax for matrix manipulation and module implementation
  • ,
  • Deep understanding and proven practical experience with model distillation frameworks and quantization techniques for real-time compute constraints
  • ,
  • Demonstrated hands-on experience building, training, or deploying Multimodal Foundation Models or Vision-Language Models (VLMs)
  • ,
  • (Desirable) PhD in Computer Vision, Machine Learning, Robotics, or a related field
  • ,
  • (Desirable) Hands-on experience managing and optimizing large-scale teacher-student training loops
  • ,
  • (Desirable) A proven track record of successfully deploying Vision-Language queries in highly constrained, real-time environments
  • ,
  • (Desirable) Experience with large-scale distributed training, Parameter-Efficient Fine-Tuning (PEFT), or Reinforcement Learning from Human Feedback (RLHF) for Foundation Models and VLMs
  • ,
  • (Desirable) Deep expertise in long-context temporal reasoning for sequential decision-making or complex video understanding
  • ,
  • (Desirable) First-author publications in premier computer vision and machine learning conferences, such as CVPR, NeurIPS, ICCV, or ECCV
What the job involves
  • The Semantics team is a specialized subgroup within the Perception organization  mission is to bring the immense reasoning power and innate world knowledge of massive foundation models directly onto the Waymo Driver
  • ,
  • We focus on building an onboard multi-task, multimodal perception model designed to tackle highly complex and unpredictable "long-tail" scenarios
  • ,
  • Architect and train large-scale, onboard ML perception models that are instrumental to ensuring vehicle safety and regulatory compliance
  • ,
  • Drive cross-functional collaboration to engineer robust, high-reliability training pipelines within a dynamic, rapid-delivery environment
  • ,
  • Leverage deep computer vision expertise to design novel, custom architectures from first principles to solve complex perception challenges
  • ,
  • Contribute to a vibrant and positive team culture where diverse skill sets and backgrounds are valued. Support the growth of junior engineers and foster a high-performing, collaborative team environment
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary