Categories Research/Development Jobs by Country Newest Jobs Canada

Senior Scientist LLM Evaluation Methods

Job in Toronto, Ontario, C6A, Canada

Listing for: Cohere

Full Time position
Listed on 2026-06-07

Job specializations:

Research/Development

Salary/Wage Range or Industry Benchmark: 80000 - 100000 CAD Yearly CAD 80000.00 100000.00 YEAR

Position: Senior Scientist for LLM Evaluation Methods
Lead advancements in AI evaluation as a Senior Research Scientist with a focus on model assessment. This role combines prototype development and robust analysis to enhance LLM capabilities.

Your primary goal will be to innovate significant evaluation methods that mirror and propel the capabilities of large language models. You will drive the creation of evaluation benchmarks while working alongside cross-functional teams to improve AI accuracy and efficiency. Your software engineering acumen will be crucial in constructing tools for comprehensive LLM performance analysis.

Key Responsibilities:

• Develop next-gen evaluation techniques for AI models

• Create ambitious benchmarks for assessing LLM performance

• Work collaboratively to deliver reliable evaluation frameworks

• Advance state-of-the-art research in evaluation methods

• Build scalable tools for performance insights

Requirements:

• Solid background in software engineering

• Familiarity with LLM outputs and data quality control

• Experience in measurement protocols for AI capabilities

• Ability to rapidly prototype evaluation techniques

• Encouraged to apply even with differing experiences

Help steer the future of AI evaluation through innovative methods and rigorous analysis.
#J-18808-Ljbffr

Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
View / Apply for Jobs
Matching My Jurisdiction