Healthcare & Life Sciences AI Rater & Evaluator Job San Francisco area,California USA,Healthcare

Overview

LILT is building a global network of domain experts to support high-quality AI evaluation across training, benchmarking, red-teaming, and ongoing model monitoring. We are seeking healthcare and life sciences professionals to contribute expert judgment to human-in-the-loop AI evaluation workflows used by leading enterprises and hyperscalers.

This role is designed for professionals who understand how medical, clinical, scientific, and life sciences information is used in real-world healthcare and research environments and who can apply that expertise to evaluate, assess, and improve multilingual AI systems.

Your contribution of expertise will directly influence multilingual AI model quality, safety, and deployment readiness.

This role includes two distinct expert tracks, based on experience level and scope of responsibility.

Track A:
Healthcare & Life Sciences AI Rater

Raters execute structured evaluation tasks using clearly defined rubrics and instructions.

Responsibilities

Evaluate AI outputs related to healthcare, medical, and life sciences content
Perform structured scoring, comparison, classification, and judgment task
Assess clinical accuracy, scientific validity, completeness, and potential safety risk
Identify hallucinations, misleading medical guidance, unsupported claims, or unsafe recommendations
Apply domain-specific healthcare and life sciences guidelines consistently across tasks

Ideal Background

Healthcare professionals, clinical practitioners, life sciences researchers, or biomedical specialists
Experience interpreting medical literature, clinical guidelines, scientific research, or health data
Strong attention to detail and comfort working with structured evaluation criteria

Track B:
Healthcare & Life Sciences AI Evaluator (Senior Track)

Evaluators provide higher-level domain oversight and help shape how evaluation is performed.

Responsibilities

Validate and refine evaluation rubrics and edge-case handling
Perform adjudication where raters disagree
Conduct error analysis and qualitative reviews of model behavior
Partner with LILT research, product, and customer teams on evaluation design
Support red-teaming, clinical risk analysis, and model readiness assessments

Ideal Background

Senior clinicians, healthcare leaders, medical researchers, or life sciences subject matter experts
Experience defining standards, reviewing complex edge cases, or advising on clinical or scientific risk
Ability to clearly explain nuanced medical or scientific reasoning and tradeoffs

Evaluation Focus & Requirements Types of AI Evaluation Work

Depending on project demands, work may include:

Healthcare and medical content evaluation
Clinical reasoning and scientific accuracy assessment
Benchmarking and comparative model analysis
Safety-focused red-teaming and risk evaluation
Ongoing model monitoring and regression testing

What We Look For

Deep domain expertise in healthcare, medicine, or life sciences
Strong judgment and ability to apply criteria consistently
Comfort working with structured evaluation workflows
Ability to explain reasoning clearly, especially in safety-critical scenarios
Reliability, professionalism, and respect for quality standards

Engagement Model

Contract-based, flexible participation
Project-based work with clear expectations and timelines
Opportunities for recurring work based on performance and demand
Compensation communicated upfront per project or task type

Why This Work Matters

Your expertise helps ensure that AI systems:

Provide accurate and safe healthcare-related information
Align with clinical and scientific standards
Are trustworthy and responsible across languages

Language Requirements

Native or professional fluency in one or more supported languages is required
Supported languages span 30+ global languages
Language-specific nuance is assessed through screening and task-based evaluation, not separate job descriptions
English fluency is required for guidelines, feedback, and collaboration

AI is changing how the world communicates — and LILT is leading that transformation.

LILT's mission is to make the world's information available to everyone, no matter the language they speak. Join our global community who thrive on…


Increase/decrease your Search Radius (miles)



Job Posting Language