×
Register Here to Apply for Jobs or Post Jobs. X

Artificial Intelligence Researcher

Job in Nottingham, Nottinghamshire, NG1, England, UK
Listing for: microTECH Global LTD
Full Time position
Listed on 2026-05-31
Job specializations:
  • IT/Tech
    Data Scientist, Machine Learning/ ML Engineer, AI Engineer
Job Description & How to Apply Below

This range is provided by micro

TECH Global LTD. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

Direct message the job poster from micro

TECH Global LTD

Software and IT Consultant at micro

Tech Global LTD

Job Title:

AI Researcher

Location:

Cambridge or London, UK

This is a permanent position with candidates required to do hybrid working in either Cambridge or London.

Our client are looking for AI Researchers specialising in Reinforcement Learning with Human Feedback (RLHF) and Generative AI. In this role, you will design and optimise the algorithms that align large-scale generative models with human preferences, ensuring they are safe, controllable, and capable of producing high-quality outputs across multiple modalities. You’ll sit at the intersection of RL, LLMs, and generative modelling, helping us build the next generation of foundation models.

Responsibilities
  • Develop and refine RLHF algorithms for large language and generative models.
  • Research and implement deep reinforcement learning methods (policy gradients, actor‑critic, off‑policy learning) for model alignment.
  • Train, fine‑tune, and evaluate LLMs and diffusion models at scale.
  • Design experiments to align generative outputs with human and organisational preferences.
  • Collaborate with researchers, engineers, and human feedback teams to build scalable alignment pipelines.
  • Publish findings in top‑tier AI conferences and contribute to open‑source frameworks.
Key Requirements
  • PhD in Computer Science, Machine Learning, or related field.
  • Publications at NeurIPS, ICML, ICLR, ACL, or related venues.
  • Deep expertise in Reinforcement Learning (policy optimisation, reward modelling, RLHF).
  • Strong knowledge of deep learning frameworks (PyTorch, JAX, Tensor Flow).
  • Proficiency in Python and standard ML libraries.
  • Solid foundations in probability, optimisation, and statistics.
  • Experience working with large‑scale distributed training on GPUs/TPUs.
Seniority level

Mid‑Senior level

Employment type

Full‑time

Job function

Software Development, IT System Custom Software Development, Semiconductor Manufacturing

Referrals increase your chances of interviewing at micro

TECH Global LTD by 2x

Apply BELOW

Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary