More jobs:
Senior Research Scientist; Speech
Job in
San Francisco, San Francisco County, California, 94199, USA
Listed on 2026-01-01
Listing for:
Aldea
Full Time
position Listed on 2026-01-01
Job specializations:
-
Engineering
Artificial Intelligence, AI Engineer -
IT/Tech
Artificial Intelligence, AI Engineer
Job Description & How to Apply Below
About Aldea
Aldea is a multi-modal foundational AI company reimagining the scaling laws of intelligence. We believe today's architectures create unnecessary bottlenecks for the evolution of software. Our mission is to build the next generation of foundational models that power a more expressive, contextual, and intelligent human‑machine interface.
The Role
We are seeking a Foundational AI Research Scientist (Speech) to advance the frontier of speech understanding and generation. You will lead applied research in speech‑to‑text (STT), text‑to‑speech (TTS), and speech‑to‑speech modeling, designing architectures and training strategies that redefine fidelity, cont rollability, and efficiency in voice‑based systems.
This role blends deep research expertise with strong engineering intuition. You'll drive end‑to‑end experimentation—from model design and training‑pipeline setup to empirical validation—and help translate breakthroughs into production‑grade systems.
What You’ll Do
• Research and prototype novel architectures for STT, TTS, and speech‑to‑speech modeling.
• Design and execute experiments validating new methods for scalability, performance, and quality.
• Collaborate cross‑functionally with engineering teams to integrate research into real‑world products.
• Stay current with foundational research in speech processing and generative modeling.
Minimum Qualifications
• Ph.D. in Computer Science, Engineering, or a related field.
• 3+ years of relevant industry experience.
• Demonstrated experience in training or researching TTS, STT, or speech‑to‑speech models.
• Deep understanding of modern sequence modeling architectures including State Space Models (SSMs), Sparse Attention mechanisms, Mixture of Experts (MoE), and Linear Attention variants.
• Proven experience with pre‑training foundational models from scratch on large‑scale datasets.
• Track record of working with massive multi‑modal datasets (audio, text, and speech corpora at scale).
• Deep expertise in PyTorch, Transformers, and modern deep‑learning frameworks.
• Ability to translate complex research ideas into high‑performance, maintainable code.
• Evidence of research excellence through impactful technical contributions.
Nice to Have
• Experience with voice‑based AI applications or multi‑speaker synthesis.
• Publication record in top‑tier venues (ICML, NeurIPS, ICLR, ICASSP, Interspeech).
• Background in cross‑lingual or multilingual speech systems.
• Experience with data curation, filtering, and quality assessment pipelines for speech data.
Compensation & Benefits
• Competitive base salary.
• Performance‑based bonus aligned with research milestones.
• Equity participation.
• Comprehensive health, dental, and vision coverage.
• Flexible paid time off.
Aldea is proud to be an equal‑opportunity employer. We are committed to building a diverse and inclusive culture that celebrates authenticity to win as one. We do not discriminate on the basis of race, religion, color, national origin, gender, gender identity, sexual orientation, age, marital status, disability, protected veteran status, citizenship or immigration status, or any other legally protected characteristics.
Aldea uses E‑Verify to confirm employment eligibility in compliance with federal law. For more information please visit: (Use the "Apply for this Job" box below)..e‑verify.gov
Please note:
We do not accept unsolicited resumes from recruiters or employment agencies and will not be responsible for any fees related to unsolicited resumes.
#JLjbffr
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×