Job Description & How to Apply Below
Lead advancements in AI evaluation as a Senior Research Scientist with a focus on model assessment. This role combines prototype development and robust analysis to enhance LLM capabilities.
Your primary goal will be to innovate significant evaluation methods that mirror and propel the capabilities of large language models. You will drive the creation of evaluation benchmarks while working alongside cross-functional teams to improve AI accuracy and efficiency. Your software engineering acumen will be crucial in constructing tools for comprehensive LLM performance analysis.
Key Responsibilities:
• Develop next-gen evaluation techniques for AI models
• Create ambitious benchmarks for assessing LLM performance
• Work collaboratively to deliver reliable evaluation frameworks
• Advance state-of-the-art research in evaluation methods
• Build scalable tools for performance insights
Requirements:
• Solid background in software engineering
• Familiarity with LLM outputs and data quality control
• Experience in measurement protocols for AI capabilities
• Ability to rapidly prototype evaluation techniques
• Encouraged to apply even with differing experiences
Help steer the future of AI evaluation through innovative methods and rigorous analysis.
#J-18808-Ljbffr
Position Requirements
10+ Years
work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×