×
Register Here to Apply for Jobs or Post Jobs. X

Research Scientist - Post Training

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Ersilia
Apprenticeship/Internship position
Listed on 2026-06-26
Job specializations:
  • Research/Development
    AI Evaluation, AI Business & Operations, Research Scientist
Salary/Wage Range or Industry Benchmark: 250000 - 450000 USD Yearly USD 250000.00 450000.00 YEAR
Job Description & How to Apply Below

About Us

We build training data and evaluation infrastructure that frontier AI labs use to improve their models. We partner with the world's leading labs to design high‑signal datasets and run rigorous evaluations that go beyond static benchmarks. We're a small, early team (post‑Series A) where individual contributors have direct impact on how the next generation of models learns and improves.

The Role

We're building out our post‑training research team and hiring 2–3 Research Scientists to work together on this mission. Your job is to prove that our data works. You'll design and run training experiments that isolate the impact of our datasets on model behavior, including SFT and RL‑based post‑training, to measure how different data sources shift capability, generalization, and alignment. Working closely with partner labs, you'll turn our datasets into clear, defensible evidence that the data improves performance under these conditions.

It's experimental, high‑leverage work at the edge of model development.

What You'll Do
  • Run controlled SFT and RL experiments to measure the impact of our datasets on model performance.
  • Quantify lift across capabilities—reasoning, tool use, long‑horizon tasks, and domain‑specific workflows. Share findings directly with partner labs to deepen relationships and drive sales.
  • Collaborate with internal SPLs to iterate on data quality based on your results.
  • Work closely with the other Research Scientists on this team to build shared experimental infrastructure and benchmarks.
  • What We're Looking For
  • Strong familiarity with LLM training and evaluation methodologies (SFT, RL post‑training).
  • Genuine obsession with how data structure, selection, and quality drive model behavior.
  • Ability to design lightweight experiments, move fast, and extract actionable insights from messy results.
  • Comfort working across domains—finance, software engineering, policy, and more.
  • A bias toward building over theorizing.
  • Nice‑to‑Have Requirements
  • Prior work or internship at an RL environment company, AI safety org, or benchmarking org.
  • Experience running controlled training experiments end‑to‑end.
  • Published research on model evaluation, post‑training, or data curation.
  • Strong software engineering skills alongside research instincts.
  • Compensation

    US $250K–$450K total compensation + equity.

    #J-18808-Ljbffr
    To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
    (If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
     
     
     
    Search for further Jobs Here:
    (Try combinations for better Results! Or enter less keywords for broader Results)
    Location
    Increase/decrease your Search Radius (miles)
    0
    200
    Filters
    Education Level
    Experience Level (years)
    Posted in last:
    Salary