ML Runtime Optimization Engineer - Lead
Listed on 2026-02-28
-
Software Development
AI Engineer, Machine Learning/ ML Engineer, Software Engineer
About the role
We are looking for a lead software engineer with deep experience in optimizing ML models and deploying them on production-grade embedded runtime environments. You’ll work across the entire ML framework stack (e.g. PyTorch, JAX, ONNX, Tensor
RT, CUDA, XLA, Triton).
- Drive ML performance optimization on multiple technologies for on-road and off-road ADAS / AD stacks targeting deployment on a variety of embedded compute platforms
- Bring technical leadership to the ML model performance optimization team
- Develop compute usage strategies to optimize efficiency and latency of model inference for compute boards selected by our customers
- Work on model pruning and quantization, and support deployment on memory constrained platforms
- Collaborate closely with ML engineers and software developers on technical efforts to find and optimize efficient model architecture solutions
- Set up methodologies to profile the model performance on target embedded compute platforms and identify performance bottlenecks as part of stack integration
- Bachelors in Electrical Engineering or Computer Science, OR
B.Sc. in Computer Science, Mathematics, Physics or a related field - 5+ years of experience with ML accelerators, GPU, CPU, SoC architecture and micro-architecture
- Strong software development skills with the focus on embedded programming
- Experience profiling and optimizing model performance on embedded compute platforms
- Experience in working with deep learning frameworks (e.g., PyTorch, JAX, ONNX, etc.)
- M.Sc or PhD in a ML related area
- Built an ML optimization framework from scratch before
- Deployed ML solutions to embedded chips for real time robotics applications
Compensation and benefits may include base salary, equity, and benefits such as health, dental, vision, life and disability insurance, 401k with employer match, learning and wellness stipends, and paid time off. The base salary range for this position is provided for transparency where legally required and can vary based on experience and location.
Location:
See job posting subtitle for location details.
Applied Intuition is an equal opportunity employer. We value diversity and are committed to creating an inclusive environment for all employees.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).