More jobs:
Research Scientist Intern, Audio Quality AI; PhD
Job in
Redmond, King County, Washington, 98052, USA
Listed on 2026-06-15
Listing for:
Meta
Apprenticeship/Internship
position Listed on 2026-06-15
Job specializations:
-
IT/Tech
Artificial Intelligence, Data Scientist, AI Engineer (Applied/Software), Machine Learning/ ML Engineer -
Research/Development
Artificial Intelligence, Data Scientist
Job Description & How to Apply Below
Apply nowThe Meta Reality Labs Research Team brings together a world-class team of researchers, developers, and engineers to create the future of virtual and augmented reality, which together will become as universal and essential as smartphones and personal computers are today. And just as personal computers have done over the past 45 years, AR, VR and MR will ultimately change everything about how we work, play, and connect.
We are developing all the technologies needed to enable breakthrough AR glasses and VR headsets, including optics and displays, computer vision, audio, graphics, brain-computer interfaces, haptic interaction, eye/hand/face/body tracking, perception science, and true telepresence. Some of those will advance much faster than others, but they all need to happen to enable AR, VR and MR that are so compelling that they become an integral part of our particular, the Meta Reality Labs Research audio team is focused on two goals;
creating virtual sounds that are perceptually indistinguishable from reality, and redefining human hearing. See more about our work here:
Inside Facebook Reality Labs Research:
The future of audio and Filter Out the Noise With Conversation Focus. These two initiatives will allow us to connect people by allowing them to feel together despite being physically apart, and allow them to converse in even the most difficult listening environments.
Meta Reality Labs Research is looking for an intern who is passionate about speech perception and audio quality to investigate why processed speech sometimes sounds degraded or robotic. The project focuses on identifying systematic phonemic errors as causal factors in perceived quality degradation, and linking these errors to human quality and intelligibility judgments. A core method is to explore the capabilities of audio vs video LLMs.
This is fundamentally a speech-perception research role; multimodal/LLM methods are a supporting tool rather than the central focus.
Our internships are twelve (12) to twenty four (24) weeks long and we have various start dates throughout the year.
-## Research Scientist Intern, Audio Quality with AI (PhD) Responsibilities
* Investigate systematic phonemic errors as causal factors in perceived speech quality degradation, and link them to human perceptual judgments
* Build and curate datasets and benchmarks of speech for phoneme-level analysis
* Explore and compare the capabilities of audio and video (multimodal) LLMs as tools to support this analysis
* Relate findings to human perceptual data (quality preference and intelligibility) and translate them into actionable insights for research and engineering teams
* Where appropriate, adapt multimodal models to the task in a supporting capacity
* Collaborate with researchers, engineers, and cross-functional partners to define goals, communicate findings, and drive improvements in speech quality
* Develop tools and infrastructure to streamline and scale the analysis
* Stay current with research in speech perception and audio quality and intelligibility assessment, and incorporate best practices into Meta's workflows
* Disseminate results through internal reports and presentations, and, when appropriate, external publications##
Minimum Qualifications
* Currently has, or is in the process of obtaining, a PhD degree in the field of Speech and Hearing Science, Auditory Neuroscience, Computational Neuroscience, Computer Science, Artificial Intelligence, Generative AI, Transformer Models, Machine Learning, Signal Processing or Computer vision
* 3+ years experience with Python, Matlab, or similar
* 3+ years experience with machine learning software platforms such as PyTorch, Tensor Flow, etc
* Background in speech perception, psychoacoustics, or acoustic phonetics
* Experience deploying novel audio computational models and LLMs
* Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment##
Preferred Qualifications
* Experience building novel audio computational models and LLMs
* Demonstrated software engineer experience via an internship, work experience, coding competitions, or…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×