Multimodal Spatial Generative AI Researcher
Listed on 2026-05-31
-
IT/Tech
AI Engineer, Artificial Intelligence, Machine Learning/ ML Engineer, Computer Science
About Dolby
Join the leader in entertainment innovation and help us design the future. At Dolby, science meets art, and high tech means more than computer code. As a member of the Dolby team, you’ll see and hear the results of your work everywhere, from movie theaters to smartphones. We continue to revolutionize how people create, deliver, and enjoy entertainment worldwide. To do that, we need the absolute best talent.
We’re big enough to give you all the resources you need, and small enough so you can make a real difference and earn recognition for your work. We offer a collegial culture, challenging projects, and excellent compensation and benefits, not to mention a Flex Work approach that is truly flexible to support where, when, and how you do your best work.
Technology Group (ATG)
The ATG is the research division of the company. ATG’s mission is to look ahead, deliver insights, and innovate technological solutions that will fuel Dolby’s continued growth. Our researchers have a broad range of expertise related to computer science and electrical engineering, such as AI/ML, algorithms, digital signal processing, audio engineering, image processing, computer vision, data science & analytics, distributed systems, cloud, edge & mobile computing, computer networking, and IoT.
Position OverviewDolby’s Research Division is seeking a Multimodal Spatial Experiences Researcher to help shape the future of generative volumetric media creation and distribution. Joining the Multimodal Experience Lab within Dolby’s Advanced Technology Group, you will collaborate with world‑class researchers to advance generative AI techniques that produce spatially consistent volumetric scenes suitable for streaming distribution in Dolby's volumetric formats.
What You’ll Do- Conduct cutting‑edge research in generative AI video models and their application to volumetric scene creation and augmentation.
- Develop methods to generate spatially consistent 3D Gaussian Splat representations from generative AI video outputs, targeting Dolby's volumetric distribution formats.
- Incorporate multiple modalities including vision, audio, language, and beyond to achieve more than is possible with any one modality alone.
- Define and drive research roadmaps in collaboration with a top‑tier research team.
- Apply AI and deep learning techniques to develop novel solutions and enhance existing approaches for volumetric content creation.
- Design and validate algorithms using robust methodologies, quality metrics, and test content.
- Share insights through publications, technical papers, and internal presentations.
- Transfer research outcomes to product groups and contribute to patent applications.
- Stay ahead of the curve by monitoring the latest advances in generative AI, computer graphics and computer vision.
- Ph.D. in Computer Science, Electrical Engineering, Applied Math, Physics, or related field.
- Strong publication record in top‑tier conferences and journals.
- Deep expertise in generative AI video models (e.g., diffusion models, video generation/editing architectures) and their application to 3D-consistent content creation.
- Strong understanding of 3D scene representations, particularly NeRF or Gaussian Splatting.
- Experience with integrating multiple modalities is a plus.
- Proven experience in computer graphics / computer vision R&D.
- Proficiency in programming languages such as Python, C/C++, or MATLAB.
- Excellent written and verbal communication skills.
- Collaborative team mindset with the ability to work across research and engineering.
- Strong background in deep learning‑based methods.
- Experience with real‑time graphics APIs (OpenGL, CUDA, Direct
X, Vulkan). - Familiarity with game engines (Unreal, Unity).
- Experience with XR environments, capture, and devices.
- Experience with video codec pipelines and streaming distribution architectures.
- Familiarity with 3D generative models (e.g., score distillation, multi‑view diffusion, feed‑forward 3D generation).
The Atlanta Area base salary range for this full‑time position is $137,600–$168,300, which can vary if outside this location, plus bonus, benefits, and some roles may also…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).