Research Engineer; RL Environment
Job in
Greater London, London, Greater London, W1B, England, UK
Listed on 2026-06-23
Listing for:
Axiōma Search
Full Time
position Listed on 2026-06-23
Job specializations:
-
Software Development
Backend Developer, Python, AI Engineer (Applied/Software)
Job Description & How to Apply Below
Location: Greater London
Overview
Agents do not improve in a vacuum. They need environments to operate in, tasks to solve, and clear signals for what good looks like. This role exists to build that layer.
This is a VC-backed challenger lab building state-of-the-art computer‑use agents. Recent progress has made performance highly competitive on computer‑use style benchmarks, and the company has launched a more visible product layer to make the technology easier to demonstrate.
The team you'd be joining builds the playground itself: synthetic websites, structured workflows, task sets, and evaluation environments where agents can act, fail, retry, and learn.
Responsibilities- Build training and evaluation environments for agentic systems.
- Create synthetic websites, workflows, and task suites that reflect useful real‑world work.
- Define reward signals and success criteria for agent behavior in structured environments.
- Turn documentation, tools, and existing workflows into interactive agent tasks.
- Improve the realism, coverage, and difficulty of training environments over time.
- Partner with research teams to convert product failures into better environments and tasks.
- Build internal tooling to generate, run, and measure large task sets.
- Strong software engineering skills, ideally in Python plus web or backend systems.
- Experience with RL, reward design, or synthetic data generation.
- Experience building internal tools, simulations, evaluation systems, or synthetic environments.
- Ability to structure ambiguous workflows into clear tasks with measurable outcomes.
- Good product instinct for what makes an environment realistic and useful for agents.
- Comfortable working at the intersection of engineering, research, and experimentation.
- High ownership and a practical mindset.
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×