×
Register Here to Apply for Jobs or Post Jobs. X

Healthcare & Life Sciences AI Rater & Evaluator

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: LILT (Production)
Full Time position
Listed on 2026-03-01
Job specializations:
  • Healthcare
    Clinical Research, Medical Science, Data Scientist, Health Science
Salary/Wage Range or Industry Benchmark: 60000 - 80000 USD Yearly USD 60000.00 80000.00 YEAR
Job Description & How to Apply Below

Overview

LILT is building a global network of domain experts to support high-quality AI evaluation across training, benchmarking, red-teaming, and ongoing model monitoring. We are seeking healthcare and life sciences professionals to contribute expert judgment to human-in-the-loop AI evaluation workflows used by leading enterprises and hyperscalers.

This role is designed for professionals who understand how medical, clinical, scientific, and life sciences information is used in real-world healthcare and research environments and who can apply that expertise to evaluate, assess, and improve multilingual AI systems.

Your contribution of expertise will directly influence multilingual AI model quality, safety, and deployment readiness.

This role includes two distinct expert tracks, based on experience level and scope of responsibility.

Track A:
Healthcare & Life Sciences AI Rater

Raters execute structured evaluation tasks using clearly defined rubrics and instructions.

Responsibilities
  • Evaluate AI outputs related to healthcare, medical, and life sciences content

  • Perform structured scoring, comparison, classification, and judgment task

  • Assess clinical accuracy, scientific validity, completeness, and potential safety risk

  • Identify hallucinations, misleading medical guidance, unsupported claims, or unsafe recommendations

  • Apply domain-specific healthcare and life sciences guidelines consistently across tasks

Ideal Background
  • Healthcare professionals, clinical practitioners, life sciences researchers, or biomedical specialists

  • Experience interpreting medical literature, clinical guidelines, scientific research, or health data

  • Strong attention to detail and comfort working with structured evaluation criteria

Track B:
Healthcare & Life Sciences AI Evaluator (Senior Track)

Evaluators provide higher-level domain oversight and help shape how evaluation is performed.

Responsibilities
  • Validate and refine evaluation rubrics and edge-case handling

  • Perform adjudication where raters disagree

  • Conduct error analysis and qualitative reviews of model behavior

  • Partner with LILT research, product, and customer teams on evaluation design

  • Support red-teaming, clinical risk analysis, and model readiness assessments

Ideal Background
  • Senior clinicians, healthcare leaders, medical researchers, or life sciences subject matter experts

  • Experience defining standards, reviewing complex edge cases, or advising on clinical or scientific risk

  • Ability to clearly explain nuanced medical or scientific reasoning and tradeoffs

Evaluation Focus & Requirements Types of AI Evaluation Work

Depending on project demands, work may include:

  • Healthcare and medical content evaluation

  • Clinical reasoning and scientific accuracy assessment

  • Benchmarking and comparative model analysis

  • Safety-focused red-teaming and risk evaluation

  • Ongoing model monitoring and regression testing

What We Look For
  • Deep domain expertise in healthcare, medicine, or life sciences

  • Strong judgment and ability to apply criteria consistently

  • Comfort working with structured evaluation workflows

  • Ability to explain reasoning clearly, especially in safety-critical scenarios

  • Reliability, professionalism, and respect for quality standards

Engagement Model
  • Contract-based, flexible participation

  • Project-based work with clear expectations and timelines

  • Opportunities for recurring work based on performance and demand

  • Compensation communicated upfront per project or task type

Why This Work Matters

Your expertise helps ensure that AI systems:

  • Provide accurate and safe healthcare-related information

  • Align with clinical and scientific standards

  • Are trustworthy and responsible across languages

Language Requirements
  • Native or professional fluency in one or more supported languages is required

  • Supported languages span 30+ global languages

  • Language-specific nuance is assessed through screening and task-based evaluation, not separate job descriptions

  • English fluency is required for guidelines, feedback, and collaboration

AI is changing how the world communicates — and LILT is leading that transformation.

LILT's mission is to make the world's information available to everyone, no matter the language they speak. Join our global community who thrive on…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary