Advanced Technology: AI/ML Research Scientist
Listed on 2026-06-18
-
IT/Tech
AI Engineer (Applied/Software), Data Scientist, AI Business & Operations, Machine Learning/ ML Engineer
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.
Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras, to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference.
About The TeamCerebras builds wafer-scale AI processors—single chips delivering tens of PB/s of memory bandwidth and a dataflow architecture that accelerates at a granularity no multi-device system can match. The Advanced Technology Group (ATG) is Cerebras ’ pathfinding organization. We work ahead of product to explore new architectures, demonstrate breakthrough performance on scientific and AI workloads, and shape the technical roadmap for future Cerebras hardware and software.
Our work regularly appears at top-tier venues (Supercomputing, SIAM, IEEE, and NeurIPS ) and directly influences the design of next-generation wafer-scale systems.
The Role
Most AI research today is shaped by the constraints of existing hardware. This role starts from the other direction: what would you build if the architecture let you rethink the fundamentals? You will design and develop AI models and training methodologies on wafer-scale hardware, working at the level of optimization theory, model architecture, and statistical foundations rather than assembling existing components.
The ATG sits at the intersection of AI, computational science, and computer architecture, and your work will draw on all three. You will collaborate closely with Cerebras’ ASIC, compiler, kernel, and AI teams as well as external partners at universities and national laboratories.
What You Will Do- Design AI models and training methods from first principles,leveragingarchitectural properties of wafer-scale hardware that are unavailable on conventional platforms.
- Investigate how techniques from computational science—numerical methods, PDE solvers, simulation—can inform and advance AI modeldesign, and explore hybrid workflows that couple simulation and learning.
- Develop a deep understanding of the hardware substrate and use it to guide algorithmic choices: model structure, optimization strategy, memory access patterns, numerical precision.
- Publish findings and present at top-tiervenues (NeurIPS, ICML, ICLR, etc.); represent Cerebras in the broader AI/MLresearch community.
- Inform the design of future Cerebras hardware and software by identifying the computational patterns that matter most for next-generation AI workloads.
- PhD in Machine Learning, Computer Science, Applied Mathematics, Statistics, Physics, or a related quantitative field preferred
; exceptional candidates without a graduate degree whodemonstrateequivalent depth through published research, significant open-source contributions, or a strong industrytrack recordare encouraged to apply. - Mathematical maturity: comfort with the theory behind gradient methods,losslandscapes, generalization, and the relationship between model structure and data statistics.
- Track recordof published research at top-tierAI or computational science venues.
- Proficiency in Python andPyTorch; comfort with C or other low-level languages is a strong signal.
- Excellent communication and interpersonal skills:able to present complex technical material to both ML and systems audiences, and to collaborate effectively in a fast-paced, small-team environment.
- You will have direct access to hardware that changeswhat’salgorithmically possible. Tens of PB/s of memory bandwidth and fine-grained dataflow execution open design spaces thatdon’texist on GPU clusters.
- You will work alongside researchers in computational science, computer architecture, and performance engineering. The synthesis across these fields is central to ATG’s approach.
- Your research will influence silicon - ATG’s findings directly shape the design of future Cerebras chips and systems.
Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.
This website or its third-party tools process personal data. For more details, to review our CCPA disclosure notice.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).