×
Register Here to Apply for Jobs or Post Jobs. X

Senior Research Scientist, Model Evaluation

Job in Toronto, Ontario, C6A, Canada
Listing for: Cohere
Full Time position
Listed on 2026-01-06
Job specializations:
  • Research/Development
Job Description & How to Apply Below

Senior Research Scientist, Model Evaluation

Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises building AI systems that power content generation, semantic search, RAG, and agents. We believe our work is instrumental to the widespread adoption of AI and that each person on the team contributes to increasing the capabilities of our models and the value they bring to customers.

Why this role?

Evaluation is critical to making progress in scaling intelligence. As models become superhuman in many real-world use cases, we continue to develop new evaluation techniques that accurately reflect current capabilities and set the agenda for future progress. In this role you will create next‑generation evaluation methods and infrastructure to measure LLM progress.

Responsibilities
  • Create ambitious new evaluation benchmarks that push the limits of what our models can accomplish.
  • Work cross‑functionally with teams to translate model feedback into trustworthy, repeatable evaluations.
  • Conduct research to advance the state-of-the-art in LLM evaluation methods, including training LLM judges, refining LLM‑based data synthesis pipelines, and improving evaluation efficiency.
  • Build scalable and reusable tools for digging into model performance.
Qualifications
  • Rapidly build prototypes that demonstrate LLM boundaries and develop resources to measure those capabilities.
  • Have spent significant time reviewing complex data and LLM outputs to ensure high data quality.
  • Are obsessive about rigorously measuring AI capabilities and ensuring measurements align with desired outcomes.
  • Have strong software engineering skills.

If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply!

Inclusive Hiring

We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.

Perks
  • Open and inclusive culture and work environment.
  • Work closely with a team on the cutting edge of AI research.
  • Weekly lunch stipend, in‑office lunches & snacks.
  • Full health and dental benefits, including a separate budget for mental health.
  • 100% parental leave top‑up for up to 6 months.
  • Personal enrichment benefits towards arts, culture, fitness, well‑being, quality time, and workspace improvement.
  • Remote‑flexible offices in Toronto, New York, San Francisco, London, and Paris, plus a co‑working stipend.
  • 6 weeks of vacation (30 working days).
Seniority Level

Mid‑Senior level

Employment Type

Full‑time

Job Function

Other. Industries:
Software Development

#J-18808-Ljbffr
Position Requirements
10+ Years work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary