More jobs:
Applied Research Scientist, Agents
Job in
San Francisco, San Francisco County, California, 94199, USA
Listed on 2026-06-12
Listing for:
Labelbox
Full Time
position Listed on 2026-06-12
Job specializations:
-
IT/Tech
AI Engineer (Applied/Software), Data Scientist, Machine Learning/ ML Engineer, Artificial Intelligence
Job Description & How to Apply Below
At Labelbox, we're building the critical infrastructure that powers breakthrough AI models at leading research labs and enterprises. Since 2018, we've been pioneering data-centric approaches that are fundamental to AI development, and our work becomes even more essential as AI capabilities expand exponentially.
About Labelbox
We're the only company offering three integrated solutions for frontier AI development:
Advanced annotation tools, workflow automation, and quality control systems that enable teams to produce high-quality training data at scale
Specialized data labeling through Alignerr, leveraging subject matter experts for next-generation AI models
Connecting AI teams with highly skilled annotators and domain experts for flexible scaling
- High-Impact Environment:
We operate like an early-stage startup, focusing on impact over process. You'll take on expanded responsibilities quickly, with career growth directly tied to your contributions. - Technical Excellence:
Work at the cutting edge of AI development, collaborating with industry leaders and shaping the future of artificial intelligence. - Innovation at Speed:
We celebrate those who take ownership, move fast, and deliver impact. Our environment rewards high agency and rapid execution. - Continuous Growth:
Every role requires continuous learning and evolution. You'll be surrounded by curious minds solving complex problems at the frontier of AI. - Clear Ownership:
You'll know exactly what you're responsible for and have the autonomy to execute. We empower people to drive results through clear ownership and metrics.
As an Applied Research Engineer at Labelbox, you'll sit at the junction of advanced AI research and real product impact, with a focus on the data that makes modern agents work-browser interactions, SWE/code traces, GUI sessions, and multi-turn workflows. You'll drive the data landscape required to advance capable, adaptable agents and help shape Labelbox's strategy for collecting, synthesizing, and evaluating it.
You will possess expertise in LLM agents and planning/execution loops, plus creativity in tackling problems across data design, interaction, and measurement. You'll publish meaningful results, collaborate with customer researchers in frontier AI labs, and turn prototypes into reliable, scalable features.
Your Impact
- Create frameworks and tools to construct, train, benchmark and evaluate autonomous agent capabilities.
- Design agent-focused data programs using supervised fine-tuning (SFT) and reinforcement learning (RL) methodologies.
- Develop data pipelines from diverse sources like code repositories, web browsers, and computer systems.
- Implement and adapt popular open-source agent libraries and benchmarks with proprietary datasets and models.
- Engage with research teams in frontier AI labs and the wider AI community to understand evolving agent data needs for frontier models and share best practices.
- Collaborate closely with frontier AI lab customers to understand requirements and guide model development.
- Publish research findings in academic journals, conferences, and blog posts.
- Ph.D. or Master's degree in Computer Science, Machine Learning, AI, or related field.
- At least 3 years of experience addressing sophisticated ML problems with successful delivery to customers.
- Experience building and training autonomous agents-tool use, structured outputs, multi-step planning-across browsers/GUI, codebases, and databases using SFT and RL.
- Constructed and evaluated agentic benchmarks (e.g. SWE-bench, Web Arena, τ-bench, OSWorld) and reliability/efficiency suites (e.g. WABER).
- Adept at interpreting research literature and quickly turning new ideas into prototypes.
- Deep understanding of frontier models (autoregressive, diffusion), post-training (SFT, RLVR, RLAIF, RLHF, et al.), and their human data requirements.
- Proficient in Python, data science libraries and deep learning frameworks (e.g., PyTorch, JAX, Tensor Flow).
- Strong analytical and problem-solving abilities in ambiguous situations.
- Excellent communication skills.
- Track record of publications in top-tier AI/ML venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, etc.).
At Labelbox Applied Research, we're committed to pushing the boundaries of AI and data-centric machine learning, with a particular focus on advanced human-AI interaction techniques. We believe that high-quality human data and sophisticated human feedback integration methods are key to unlocking the next generation of AI capabilities. Our research team works at the intersection of machine learning, human-computer interaction, and AI ethics to develop innovative solutions that can be practically applied in real-world scenarios.
We foster an environment of intellectual curiosity, collaboration, and innovation. We encourage our researchers to explore new ideas, engage in open discussions, and…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×