×
Register Here to Apply for Jobs or Post Jobs. X

Reinforcement Learning Research Engineer

Job in Kahului, Maui County, Hawaii, 96732, USA
Listing for: Strativ Group
Full Time position
Listed on 2026-06-07
Job specializations:
  • IT/Tech
    AI Engineer, Data Scientist
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

Get AI-powered advice on this job and more exclusive features.

This range is provided by Strativ Group. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

$/yr - $/yr

Direct message the job poster from Strativ Group

Location - Remote (US Based)

A scaling, SOTA Generative AI Startup operating with a world class team (Founders have multiple prior exits) with talent from Open AI, IBM, MIT and several top orgs, focused on pioneering work and advancements in large language models (LLMs), code generation, and code translation. Their projects directly involve industry leading partners where they’re applying advanced AI to solve meaningful, practical challenges with real-world impact.

Broad

Responsibilities
  • Build and maintain robust distributed training systems using PyTorch and JAX
  • Build and train production-ready reinforcement learning infrastructure
  • Develop orchestration tools that manage complex workflows across large-scale AI model training and evaluation.
  • Drive innovation by researching and developing scalable reinforcement learning (RL) algorithms and training paradigms for complex, high-dimensional optimization and decision-making tasks, including recent advancements in RL for feedback-driven optimization in LLMs.
  • Design and train large-scale RL environments for optimization problems spanning multiple industries.
  • Engage with frontier research through open-source projects and potential publications.
Requirements
  • 2+ years of experience in distributed or decentralized RL (multi-agent preferred) using PyTorch and JAX.
  • Research experience with RL for high-dimensional optimization problems, particularly in multi-agent reinforcement learning settings.
  • Experience implementing advanced RL techniques such as task decomposition, hierarchical RL, goal-conditioned RL, or human-AI collaboration.
  • Experience deploying and managing multi-GPU training infrastructure at scale.
  • Eligible for TS/SCI clearance.

Get in touch today for more details and immediate consideration / interview!

Seniority level

Mid-Senior level

Employment type

Full-time

Job function

Research and Engineering

Software Development and Research Services

Referrals increase your chances of interviewing at Strativ Group by 2x

Apply BELOW

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary