×
Register Here to Apply for Jobs or Post Jobs. X

Machine Learning Engineer; LLMs

Remote / Online - Candidates ideally in
London, UK
Listing for: AgileRL Ltd
Remote/Work from Home position
Listed on 2026-02-25
Job specializations:
  • Software Development
    Machine Learning/ ML Engineer, Software Engineer, AI Engineer, Data Engineer
Salary/Wage Range or Industry Benchmark: 150000 GBP Yearly GBP 150000.00 YEAR
Job Description & How to Apply Below
Position: Machine Learning Engineer (LLMs)
Machine Learning Engineer (Large Language Models)

We are seeking a talented and experienced Machine Learning Engineer to join our team. This engineer will contribute to the further development of Arena, a web-based software platform for LLM- and RLOps, and our open source reinforcement learning library.

As a Machine Learning Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure, tools, and services that enable businesses to build and deploy reinforcement learning models efficiently and effectively.

Responsibilities
  • Collaborate with the team to understand requirements and design new features of the Arena platform and open source framework.
  • Develop scalable and reliable infrastructure to support LLM training, reinforcement fine tuning, model deployment, and management.
  • Integrate existing machine learning frameworks and libraries into the platform and open source framework, providing a range of algorithms, environments, and tools for reinforcement learning model development.
  • Stay up to date with the latest advancements in AI, MLOps, reinforcement learning algorithms, tools, and techniques, and incorporate them into the platform as appropriate.
  • Provide technical guidance and support to internal users and external customers using the Arena platform and open source framework.
Requirements
  • Master's or Ph.D. degree in Computer Science, Engineering, or a related field, or 3+ years of relevant industry experience.
  • Solid understanding of LLM training, reinforcement learning algorithms and concepts, with hands on experience in building and training AI models.
  • Strong programming skills, with experience using ML frameworks and libraries (e.g. PyTorch, Tensor Flow, Ray, Gym, TRL, Deep Speed, VLLM), and MLOps tools.
  • Experience in building machine learning platforms or tooling for industrial or enterprise settings.
  • Proficiency in data management techniques, including storage, retrieval, and pre processing of large scale datasets.
  • Familiarity with model deployment and management, including the development of APIs, deployment pipelines, and performance optimisation.
  • Experience in designing and developing cloud based infrastructure for distributed computing and scalable data processing.
  • Deep understanding of software engineering and machine learning principles and best practices.
  • Strong problem solving and communication skills, and the ability to work independently as well as in a team environment.
Compensation
  • Competitive salary + significant stock options.
  • 30 days of holiday, plus bank holidays, per year.
  • Flexible working from home and 6 month remote working policies.
  • Enhanced parental leave.
  • Learning budget of £500 per calendar year for books, training courses and conferences.
  • Company pension scheme.
  • Regular team socials and quarterly all company parties.
Join the fast-growing Agile

RL team and play a key role in the development of cutting edge reinforcement learning tooling and infrastructure.

Full name Email address Linked In Country of residence When can you start?

Note:

for the following longer form questions we have received an overwhelming number of applications with answers that are AI generated. Any application that uses AI generated answers will not be considered.

What motivates you to apply to this role, and what are you looking forward to in contributing towards the Agile

RL mission? (200 words max) What are 3 reinforcement learning capabilities or algorithmic improvements you believe would be most valuable to add to the Arena platform or Agile

RL open source framework, and what challenges would they help users solve? (200 words max) What unique experience do you have with developing, implementing, or researching reinforcement learning algorithms and systems that makes you the ideal candidate for this role? (200 words max) Upload your CV Upload File Max file size 10MB. I agree to the Privacy Policy

Build, tune and deploy RLmodels at lightning speed.

Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary