×
Register Here to Apply for Jobs or Post Jobs. X

Head of Research

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: GAMUT
Full Time position
Listed on 2026-06-25
Job specializations:
  • Software Development
    AI Reliability/ Performance Engineer, AI Engineer (Applied/Software)
Salary/Wage Range or Industry Benchmark: 50000 USD Yearly USD 50000.00 YEAR
Job Description & How to Apply Below

About Datawizz

Datawizz is building the agent workforce. We're a seed‑stage team backed by Human Capital (SpaceX, Snowflake, Anduril) building a platform that helps enterprises build and deploy agents — with the right permissions, guardrails, and activity logging to actually ship within companies. Early customers are already automating several hours of their workday.

We're hiring people who want to build, not manage. If you want to work on hard problems with a small team that ships, this is the place.

The Role

As Head of Research, you’ll own Datawizz’s research agenda and build an applied research organization focused on making agents radically more reliable and capable. You’ll define the roadmap across agent planning, tool orchestration, evaluation, and safety; lead hands‑on experimentation; and partner closely with engineering to product ionize breakthroughs that drive measurable quality and reliability wins for customers.

You will:
  • Set the research strategy and roadmap for agent planning, tool orchestration, multi‑agent coordination, and evaluation.
  • Build, lead, and mentor a high‑caliber applied research team.
  • Design and run rigorous experiments (ablations, offline/online A/Bs), defining clear metrics for reliability, accuracy, and performance.
  • Own our evaluation stack: datasets, benchmarks, human‑in‑the‑loop reviews, and reliability/safety assessments.
  • Develop novel methods (e.g., agent planning algorithms, tool selection strategies, multi‑agent coordination, safety/alignment techniques) and ship reference implementations.
  • Collaborate with engineering to transfer research into production and measure real‑world impact.
  • This role is in‑office, 5 days/week, based in San Francisco.
You might be a great fit if you have experience with:
  • Leading applied ML/NLP research teams and shipping work into production at a startup or high‑growth company.
  • LLM and agent internals: prompting strategies, tool use, planning/reasoning architectures, multi‑agent systems, and evaluation methodology.
  • Building evaluation frameworks (task suites, synthetic data, human eval pipelines) and tying metrics to product outcomes.
  • Large‑scale agent systems: orchestration frameworks, tool APIs, distributed execution, observability, and logging infrastructure.
  • Strong coding skills in Python and a bias toward hands‑on experimentation and rapid iteration.
  • Data curation and labeling workflows, with attention to privacy, safety, and robustness.
  • Communicating research clearly and partnering cross‑functionally with engineering and product.
  • (Nice to have) Publications or notable open‑source contributions; patents; early‑stage 0→1 experience.
Benefits
  • Competitive salary, based on experience level (Annual compensation range: $50,000-$500,000)
  • Meaningful equity
  • Opportunity to be a founding member of a growing company
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary