×
Register Here to Apply for Jobs or Post Jobs. X

Applied Scientist, Observability and Triage, Prime Video

Job in Greater London, London, Greater London, W1B, England, UK
Listing for: Amazon
Full Time position
Listed on 2026-06-04
Job specializations:
  • IT/Tech
    Machine Learning/ ML Engineer, Data Scientist, Data Analyst, AI Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 GBP Yearly GBP 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Location: Greater London

Applied Scientist, Observability and Triage, Prime Video

Job :  | Amazon Development Centre (London) Limited

Come build the future of entertainment with us. Are you interested in shaping the future of movies and television? Do you want to define the next generation of how and what Amazon customers are watching?

Prime Video is a premium streaming service that offers customers a vast collection of TV shows and movies — all with the ease of finding what they love to watch in one place. We offer customers thousands of popular movies and TV shows including Amazon Originals and exclusive licensed content, as well as exciting live sports events. Members can also subscribe to add‑on channels, cancel at any time, and rent or buy new‑release movies and TV box sets on the Prime Video Store.

Prime Video is a fast‑paced, growth business available in over 200 countries and territories worldwide. The team works in a dynamic environment where innovating on behalf of our customers is at the heart of everything we do. If this sounds exciting to you, please read on.

The Observability and Triage team is looking for an Applied Scientist for our London office with experience in generative AI and large models. This wide‑impact role works with development teams across the UK, India, and the US. The green‑field project will deliver features that reduce the operational load for internal Prime Video builders, and you will develop AI‑driven solutions that automatically detect anomalies, identify root causes, recommend resolutions, and take action for operational incidents.

You will consume petabytes of data daily from multiple metric, log, and data‑based events and experiment with how to shape the future using this data.

You will bring strong technical ability, excellent teamwork and communication skills, and a strong motivation to deliver customer value from your research. Our position offers opportunities to grow your technical and non‑technical skills and make a global impact.

Key job responsibilities
  • Design and develop machine learning and generative AI systems for automated incident triage, root‑cause analysis, and resolution recommendation at scale
  • Rapidly prototype and evaluate hypotheses in a high‑ambiguity environment, leveraging both quantitative experimentation and domain expertise in operational systems
  • Build evaluation frameworks (including LLM‑as‑a‑Judge approaches) to measure model accuracy across triage accuracy and root‑cause prediction
  • Collaborate with software engineering teams to integrate ML models into production observability systems serving hundreds of development teams
  • Communicate results and insights to both technical and non‑technical audiences, including through publications, presentations, and written reports
A day in the life

On a typical day, you analyse patterns across thousands of operational incidents to improve an automated triage model, then design an experiment to test whether a new generative‑AI based approach better identifies root causes for complex multi‑service incidents. Your internal customers are Prime Video development teams who rely on your solutions to reduce the time and effort spent responding to operational events.

You collaborate closely with software engineers and operational stakeholders across the world to ensure your research translates into production systems that measurably reduce customer impact.

About the team

Our team builds AI‑powered observability and triage solutions for Prime Video development teams, consuming petabytes of data daily to automatically detect, diagnose, and recommend resolutions for operational incidents.

Basic Qualifications
  • Experience programming in Java, C++, Python or related language
  • Experience in any of the following areas: algorithms and data structures, parsing, numerical optimisation, data mining, parallel and distributed computing, high‑performance computing
  • Experience in building machine learning models for business application
  • PhD, or Master’s degree in CS, CE, ML or equivalent relevant work experience
Preferred Qualifications
  • Experience using Unix/Linux
  • Experience in professional software development

Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice (https://(Use the "Apply for this Job" box below). ) to know more about how we collect, use and transfer the personal data of our candidates.

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary