Research Scientist, Reinforcement Learning
Job in
Fremont, Alameda County, California, 94539, USA
Listed on 2026-06-01
Listing for:
Deeproute.ai
Full Time
position Listed on 2026-06-01
Job specializations:
-
Software Development
Machine Learning/ ML Engineer, Robotics, AI Engineer, Data Scientist
Job Description & How to Apply Below
You will work on applying RL in closed-loop, safety-critical environments, leveraging large-scale simulation and real-world driving data to improve safety, comfort, and robustness.
- Train and deploy RL policies in closed-loop driving environments
- Scale RL training using massively parallel simulation systems
- Design and optimize reward functions for complex driving behaviors
- Improve sim-to-real transfer for real-world robustness
- Collaborate with cross-functional teams to integrate models into production systems
Core Technical Skills
- Proficiency in modern RL algorithms: DQN, PPO, SAC, TD3, etc.
- Proficiency in modern RLHF algorithms: PPO, DPO, GRPO, etc.
- Hands-on experience training reward models and fine tuning LLM/VLM/VLA
- Knowledge of distributed RL training at scale
- Proficiency with massively parallel simulation environments
- Knowledge of sim-to-real transfer techniques and domain randomization
- Proficiency in Python, comfortable with C++
- Proficiency in deep learning frameworks such as Py Torch
- Experience with distributed training frameworks (Ray, Horovod, etc.)
- Knowledge of model optimization (quantization, pruning) and CUDA is a plus
- Knowledge of traffic rules, driving behavior modeling
- Publications in top-tier venues (ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV, ICRA, IROS, etc.)
- Open-source contributions to RL libraries or autonomous driving projects
- Previous experience with LLM fine-tuning using RLHF
- Knowledge of safe RL, interpretable AI, or robustness techniques
- Familiarity with autonomous vehicle regulations and safety standards
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×