×
Register Here to Apply for Jobs or Post Jobs. X

Software Engineer, AI Data Platform

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Labelbox
Full Time position
Listed on 2026-06-17
Job specializations:
  • Software Development
    AI Engineer (Applied/Software), Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: Staff Software Engineer, AI Data Platform

Staff Software Engineer, AI Data Platform Shape the Future of AI

At Labelbox, we're building the critical infrastructure that powers breakthrough AI models at leading research labs and enterprises. Since 2018, we've been pioneering data‑centric approaches that are fundamental to AI development, and our work becomes even more essential as AI capabilities expand exponentially.

About Labelbox

We're the only company offering three integrated solutions for frontier AI development:

  • Enterprise Platform & Tools
    :
    Advanced annotation tools, workflow automation, and quality control systems that enable teams to produce high-quality training data at scale
  • Frontier Data Labeling Service
    :
    Specialized data labeling through Alignerr, leveraging subject matter experts for next‑generation AI models
  • Expert Marketplace
    :
    Connecting AI teams with highly skilled annotators and domain experts for flexible scaling
Why Join Us
  • High-Impact Environment
    :
    We operate like an early‑stage startup, focusing on impact over process. You'll take on expanded responsibilities quickly, with career growth directly tied to your contributions.
  • Technical Excellence
    :
    Work at the cutting edge of AI development, collaborating with industry leaders and shaping the future of artificial intelligence.
  • Innovation at Speed
    :
    We celebrate those who take ownership, move fast, and deliver impact. Our environment rewards high agency and rapid execution.
  • Continuous Growth
    :
    Every role requires continuous learning and evolution. You'll be surrounded by curious minds solving complex problems at the frontier of AI.
  • Clear Ownership
    :
    You'll know exactly what you're responsible for and have the autonomy to execute. We empower people to drive results through clear ownership and metrics.
Role Overview

Labelbox is the RL data factory for advancing frontier agent capabilities. We build the data, evaluations, and infrastructure that frontier labs use to train and judge their agents. We're looking for talented, experienced engineers to join us. The bar is high: engineers who have strong judgment and set technical direction, quickly build prototypes that scale into the reliable systems, and are at the frontier of agent‑first engineering practices and innovating to accelerate the speed of the business.

What

you may work on
  • Eval systems that run millions of agent trajectories to measure model and product quality.
  • Fine‑tuning pipelines that turn evaluation signals into measurable agent improvements.
  • Agent‑first product surfaces: UX and infrastructure for workflows where the user is a model or an agent operator.
  • The systems behind hundreds of thousands of AI interviews used to source and match freelance workers to projects.
  • Infrastructure that scales to the throughput frontier labs actually need.
  • Integration of the latest models and capabilities into production within days of release.
What we're looking for
  • 4+ year track record of shipping systems customers and other engineers rely on
  • You build full stack prototypes fast and they hold up. The v1 you ship becomes the foundation the rest of the team builds on.
  • Strong system and API design judgement
  • Hard architecture and product calls land with you. You make them, defend them under pressure, and update fast when someone else is right.
  • You ship production code with coding agents daily. You know where they break and what it takes to make them reliable to further accelerate the team's velocity.
  • You set direction by being the example. Other engineers reach for your designs and your code as the reference.
  • You move fast in ambiguous, startup‑pace environments with influence over authority.
  • You have worked in all parts of the stack
  • Deep proficiency in Type Script and/or Python.
Nice to have
  • Production experience building LLM‑ or agent‑driven products.
  • Designing evaluations for LLMs and agents, or producing high‑quality data for ML systems.
  • Background in production distributed systems, ML infrastructure, or data systems at scale.
Our Tech Stack
  • Frontend: React.js with Redux, Type Script
  • Backend: Node.js, Type Script, Python, some Java & Kotlin
  • APIs: GraphQL

Labelbox strives to ensure pay parity across the organization and discuss compensation transparently. The…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary