Reinforcement Learning Research Engineer Job Kahului area,Hawaii USA,IT/Tech

Get AI-powered advice on this job and more exclusive features.

This range is provided by Strativ Group. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

$/yr - $/yr

Direct message the job poster from Strativ Group

Location - Remote (US Based)

A scaling, SOTA Generative AI Startup operating with a world class team (Founders have multiple prior exits) with talent from Open AI, IBM, MIT and several top orgs, focused on pioneering work and advancements in large language models (LLMs), code generation, and code translation. Their projects directly involve industry leading partners where they’re applying advanced AI to solve meaningful, practical challenges with real-world impact.

Broad

Responsibilities

Build and maintain robust distributed training systems using PyTorch and JAX
Build and train production-ready reinforcement learning infrastructure
Develop orchestration tools that manage complex workflows across large-scale AI model training and evaluation.
Drive innovation by researching and developing scalable reinforcement learning (RL) algorithms and training paradigms for complex, high-dimensional optimization and decision-making tasks, including recent advancements in RL for feedback-driven optimization in LLMs.
Design and train large-scale RL environments for optimization problems spanning multiple industries.
Engage with frontier research through open-source projects and potential publications.

Requirements

2+ years of experience in distributed or decentralized RL (multi-agent preferred) using PyTorch and JAX.
Research experience with RL for high-dimensional optimization problems, particularly in multi-agent reinforcement learning settings.
Experience implementing advanced RL techniques such as task decomposition, hierarchical RL, goal-conditioned RL, or human-AI collaboration.
Experience deploying and managing multi-GPU training infrastructure at scale.
Eligible for TS/SCI clearance.

Get in touch today for more details and immediate consideration / interview!

Seniority level

Mid-Senior level

Employment type

Full-time

Job function

Research and Engineering

Software Development and Research Services

Referrals increase your chances of interviewing at Strativ Group by 2x

Apply BELOW