×
Register Here to Apply for Jobs or Post Jobs. X

ML Runtime Optimization Engineer - Lead

Job in Sunnyvale, Santa Clara County, California, 94087, USA
Listing for: Applied Intuition
Full Time position
Listed on 2026-02-28
Job specializations:
  • Software Development
    AI Engineer, Machine Learning/ ML Engineer, Software Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

About the role

We are looking for a lead software engineer with deep experience in optimizing ML models and deploying them on production-grade embedded runtime environments. You’ll work across the entire ML framework stack (e.g. PyTorch, JAX, ONNX, Tensor

RT, CUDA, XLA, Triton).

At Applied Intuition, you will:
  • Drive ML performance optimization on multiple technologies for on-road and off-road ADAS / AD stacks targeting deployment on a variety of embedded compute platforms
  • Bring technical leadership to the ML model performance optimization team
  • Develop compute usage strategies to optimize efficiency and latency of model inference for compute boards selected by our customers
  • Work on model pruning and quantization, and support deployment on memory constrained platforms
  • Collaborate closely with ML engineers and software developers on technical efforts to find and optimize efficient model architecture solutions
  • Set up methodologies to profile the model performance on target embedded compute platforms and identify performance bottlenecks as part of stack integration
We re looking for someone who has:
  • Bachelors in Electrical Engineering or Computer Science, OR

    B.Sc. in Computer Science, Mathematics, Physics or a related field
  • 5+ years of experience with ML accelerators, GPU, CPU, SoC architecture and micro-architecture
  • Strong software development skills with the focus on embedded programming
  • Experience profiling and optimizing model performance on embedded compute platforms
  • Experience in working with deep learning frameworks (e.g., PyTorch, JAX, ONNX, etc.)
Nice to have:
  • M.Sc or PhD in a ML related area
  • Built an ML optimization framework from scratch before
  • Deployed ML solutions to embedded chips for real time robotics applications

Compensation and benefits may include base salary, equity, and benefits such as health, dental, vision, life and disability insurance, 401k with employer match, learning and wellness stipends, and paid time off. The base salary range for this position is provided for transparency where legally required and can vary based on experience and location.

Location:
See job posting subtitle for location details.

Applied Intuition is an equal opportunity employer. We value diversity and are committed to creating an inclusive environment for all employees.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary