×
Register Here to Apply for Jobs or Post Jobs. X

Research Engineer in Automated Machine Learning

Job in Toronto, Ontario, C6A, Canada
Listing for: Preference Model
Full Time position
Listed on 2026-06-22
Job specializations:
  • Engineering
  • IT/Tech
    Machine Learning/ ML Engineer
Job Description & How to Apply Below
Join Preference Model as a Research Engineer specializing in automated ML and RL environments. Help shape the future of self-directed learning by building robust training infrastructures.

In this role, you'll be at the intersection of research and engineering, pushing the boundaries of post-training on large language models. You will train and evaluate models using proprietary RL environments, architect infrastructure, and implement methodologies for RL agents. Your contributions will have a direct impact on data quality and model capabilities.

Key Responsibilities:

• Train and evaluate models on proprietary RL environments

• Architect and optimize RL training infrastructure

• Design and test training environments and evaluation methodologies

• Profile and optimize training runs for maximum throughput

Requirements:

• Experience with end-to-end LLM post-training pipelines

• Proficiency in Python and PyTorch or JAX

• Familiarity with modern RL training frameworks

• Experience building ML infrastructure at scale

Bring your research and engineering skills to Preference Model and drive innovation in ML.
#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary