More jobs:
Job Description & How to Apply Below
About
The Role
AI Trainer jobs in Canada focus on improving large language models through RLHF-style evaluations, prompt evaluation, data labeling, and QA evaluation across real AI/ML training pipelines. You will follow detailed annotation guidelines, verify training data quality, and provide structured feedback that improves helpfulness, correctness, and safety.
What You’ll Do
Perform RLHF evaluations (pairwise ranking, rubric-based scoring) and write clear rationales
Execute prompt evaluation for instruction-following, factuality, and safety
Label and validate datasets for NLP and content safety labeling
Run QA evaluation checks (consistency, agreement, systematic error discovery)
Document edge cases and build error taxonomies to drive model performance improvement
Collaborate on rubrics, gold sets, calibration, and regression testing for model updates
Skills And Qualifications
Experience with structured evaluation and guideline-driven judgment
Strong writing and documentation for rationales and edge-case notes
Familiarity with RLHF, LLM evaluation, and prompt evaluation workflows
Comfort with data labeling, QA evaluation, and training data quality processes
Bonus: multilingual evaluation, NER/classification tasks, or multimodal evaluation
How To Apply
Browse roles on Rex.zone and apply with a resume highlighting RLHF, data labeling, QA evaluation, annotation guidelines compliance, and examples of model evaluation feedback.
Hourly base pay range: $30–$50.
#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×