Reinforcement Learning Research Engineer
Listed on 2026-06-07
-
IT/Tech
AI Engineer, Data Scientist
Get AI-powered advice on this job and more exclusive features.
This range is provided by Strativ Group. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.
Base pay range$/yr - $/yr
Direct message the job poster from Strativ Group
Location - Remote (US Based)
A scaling, SOTA Generative AI Startup operating with a world class team (Founders have multiple prior exits) with talent from Open AI, IBM, MIT and several top orgs, focused on pioneering work and advancements in large language models (LLMs), code generation, and code translation. Their projects directly involve industry leading partners where they’re applying advanced AI to solve meaningful, practical challenges with real-world impact.
BroadResponsibilities
- Build and maintain robust distributed training systems using PyTorch and JAX
- Build and train production-ready reinforcement learning infrastructure
- Develop orchestration tools that manage complex workflows across large-scale AI model training and evaluation.
- Drive innovation by researching and developing scalable reinforcement learning (RL) algorithms and training paradigms for complex, high-dimensional optimization and decision-making tasks, including recent advancements in RL for feedback-driven optimization in LLMs.
- Design and train large-scale RL environments for optimization problems spanning multiple industries.
- Engage with frontier research through open-source projects and potential publications.
- 2+ years of experience in distributed or decentralized RL (multi-agent preferred) using PyTorch and JAX.
- Research experience with RL for high-dimensional optimization problems, particularly in multi-agent reinforcement learning settings.
- Experience implementing advanced RL techniques such as task decomposition, hierarchical RL, goal-conditioned RL, or human-AI collaboration.
- Experience deploying and managing multi-GPU training infrastructure at scale.
- Eligible for TS/SCI clearance.
Get in touch today for more details and immediate consideration / interview!
Seniority levelMid-Senior level
Employment typeFull-time
Job functionResearch and Engineering
Software Development and Research Services
Referrals increase your chances of interviewing at Strativ Group by 2x
Apply BELOW
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).