×
Register Here to Apply for Jobs or Post Jobs. X

Applied Researcher, AI Quality

Remote / Online - Candidates ideally in
Fort Wayne, Allen County, Indiana, 46801, USA
Listing for: GitHub, Inc.
Remote/Work from Home position
Listed on 2026-06-06
Job specializations:
  • IT/Tech
    Data Scientist, AI Engineer, Data Science Manager, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: Staff Applied Researcher, AI Quality

Locations

In this role you can work from Remote, United States

Overview

At Git Hub, we’re building the next generation of AI‑powered developer experiences. We’re looking for a Staff Applied Researcher with deep expertise in Large Language Model (LLM) evaluation, LLM agents, strong engineering instincts, and a bias for action to help shape the future of Git Hub Copilot and our AI platform.

This is a high‑impact role where you will design evaluation systems that directly influence how millions of developers experience AI every day.

Responsibilities
  • Lead Model Quality & Evaluation

    Design next‑generation evaluation frameworks for code generation, reasoning, safety, multimodal tasks, and agentic workflows.

  • Develop scalable automatic metrics, LLM‑judge systems, reward models, and human‑in‑the‑loop evaluation pipelines.

  • Establish high‑signal, repeatable methodologies that influence product decisions across Git Hub AI.

  • Drive Applied Research & Engineering

    Build and optimize evaluation tooling, datasets, benchmarking systems, and experimentation pipelines.

  • Create and onboard new benchmarks for the hardest tasks for the coding agents.

  • Collaborate closely with engineering teams to product ionize research, validate improvements, and accelerate model iteration cycles.

  • Own end‑to‑end quality insights for the models behind Git Hub Copilot and new AI features.

  • Work closely with product development, engineering, and design teams to integrate advanced research findings into practical applications, ensuring alignment with product goals and user needs.

Influence, Mentor & Lead
  • Shape Git Hub’s strategy for model quality, alignment, and evaluation.

  • Mentor other researchers and engineers, helping elevate technical standards across the organization.

  • Drive clarity in ambiguous problem spaces and champion fast, high‑quality execution.

Qualifications

Required Qualifications

  • Bachelor's degree in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 8+ years' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field
  • OR master's degree in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 6+ years' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field
  • OR doctorate in Data Science, Mathematics, Physics, Statistics, Economics, Operations Research, Computer Science, or related field AND 4+ years' experience in data science (e.g., managing structured and unstructured data, applying statistical techniques) or related field
  • OR equivalent experience.
  • 3+ years of strong engineering skills in Python/Typescript and experience building production grade evaluation or data/ML pipelines at scale.
  • Proven track record shipping research or evaluation systems in production environments.
  • Strong cross‑functional communication and influence skills.

Preferred Qualifications

  • Experience with LLM judge systems, reward modeling, alignment, or safety evaluations.
  • Background in code generation, developer tools, or AI‑assisted programming.
  • Experience with large‑scale experimentation and online/offline evaluation strategies.
  • Open‑source contributions or experience working with developer communities.
  • Experience designing and leading complex research projects from ideation to implementation
  • Ability to define and articulate data‑driven strategies that consider cross‑functional impacts and align with organizational priorities, particularly in a software development platform context
Compensation

The base salary range for this job is USD $ - USD $ /Yr.

These pay ranges are intended to cover roles based across the United States. An individual's base pay depends on various factors including geographical location and review of experience, knowledge, skills, abilities of the applicant. At Git Hub certain roles are eligible for benefits and additional rewards, including annual bonus and stock. These rewards are allocated based on individual impact in role.

In addition, certain roles also have the opportunity to earn sales…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary