Principal Research Scientist – Foundation Models Vision AI & Physical AI
Listed on 2026-05-24
-
Research/Development
Robotics, Artificial Intelligence, Research Scientist -
Engineering
Robotics, AI Engineer (Applied/Software), Artificial Intelligence, Research Scientist
Principal Research Scientist – Foundation Models for Vision AI & Physical AI
Location:
Seattle, WA or Palo Alto, CA (Hybrid/Remote)
Full-time with Centific
About the TeamCentific’s Physical AI Lab is building the next generation of embodied intelligence at the intersection of multimodal foundation models, simulation, agentic AI, and real-world robotics. Our mission is to move from perception and reasoning to robust real-world action across safety, industrial, healthcare, warehouse, autonomous systems, and smart environment use cases.
We are looking for a Research Leader with deep foundational model-building experience, a strong publication record, and the ability to translate frontier research into deployable systems and long-term IP.
The RoleAs Principal Research Scientist, you will define and drive Centific’s research agenda in Vision AI, multimodal foundation models, simulation-first learning, agentic AI, and embodied intelligence. You will lead a small team of researchers, engineers, and interns while contributing directly to model design, large-scale training, benchmarking, and external scientific visibility.
This role is for someone who has gone beyond applying existing models and has materially advanced architectures, training methods, datasets, or evaluation frameworks in AI, robotics, vision, autonomous driving, or multimodal learning.
What You’ll Do- Lead high-impact research in multimodal foundation models, world models, embodied AI, vision-language-action systems, and agentic AI.
- Develop new approaches for perception, temporal reasoning, spatial intelligence, affordance understanding, autonomous decision-making, and sim2real transfer.
- Advance challenging robotics capabilities including dexterous manipulation, contact-rich interaction, bimanual coordination, long-horizon task execution, navigation in dynamic environments, and robust action under uncertainty.
- Contribute to large-scale model building, including multimodal pretraining, distributed training, fine-tuning, distillation, and evaluation of models for vision, robotics, and autonomous systems.
- Help shape research relevant to autonomous driving and mobile autonomy, including scene understanding, multimodal sensor reasoning, planning-aware perception, and edge-case robustness.
- Guide integration of research with simulation and digital twin platforms such as Isaac Sim, Isaac Lab, Mu Jo Co , Omniverse, or related environments.
- Establish rigorous benchmarks and reproducible evaluation frameworks for robustness, safety, generalization, manipulation success, policy performance, and real-world deployment readiness.
- Mentor Ph.D. interns and engineers, and help build a strong research culture grounded in rigor, speed, originality, and scientific excellence.
- Ph.D. in Computer Science, Robotics, Machine Learning, Computer Vision, Autonomous Systems, or a related field.
- Strong publication record in top venues such as CVPR, ICCV, ECCV, NeurIPS, ICLR, ICML, CoRL, RSS, or leading autonomous driving/robotics venues.
- 5+ years of research experience in academia, industry, or advanced R&D environments.
- Demonstrated experience building or advancing large-scale foundational models, novel architectures, or training methods in multimodal AI, vision, robotics, autonomous driving, embodied AI, world models, or simulation-based learning.
- Deep expertise in PyTorch and/or JAX, GPU training, distributed experimentation, and large-scale model development.
- Proven ability to lead ambitious technical programs and mentor junior researchers.
- Publications or patents in multimodal foundation models, dexterous robotics, autonomous driving, spatial intelligence, simulation-based learning, manipulation, or embodied AI.
- Strong experience in Vision AI, including perception, tracking, grounding, 3D scene understanding, video understanding, sensor fusion, or multimodal reasoning.
- Familiarity with agentic AI systems, tool-using agents, planning frameworks, and memory-based architectures; experience with agentic memory, knowledge graphs, or long-horizon reasoning systems is a plus.
- Experience with Isaac Sim, Mu Jo Co , OpenUSD/Omniverse, Open3D,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).