Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach enables Cerebras to deliver industry-leading training and inference speeds and allows machine learning users to run large-scale ML applications with less hardware management.
Cerebras' customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras to deploy scale and transform key workloads with ultra-high-speed inference.
Thanks to the wafer-scale architecture, Cerebras Inference offers fast Generative AI inference, significantly faster than GPU-based hyperscale cloud inference services. This speed improvement enhances the user experience of AI applications, enabling real-time iteration and more capable computation.
AboutThe Role
As a Senior Software Engineer in the ML Integration and Quality team, you will bring together and deliver all software and hardware components for the Cerebras AI platform. You will focus on software components integration and quality, pre-deployment/production validation for Cerebras training and inference solutions. You will influence testing practices, debugging methodology, cross-team communication, and advocate for world-class products.
Responsibilities- Develop and execute a comprehensive integration and QA strategy aligned with the roadmap of the Cerebras AI solution.
- Apply solid software integration methodologies, communicate effectively, and ensure quality.
- Break down complex tasks, solve problems, and assist with debugging.
- Automate workflows, testbed setups, and build tools to monitor and debug.
- Identify potential issues by creatively testing Cerebras software.
- Contribute to developing software specifications with a focus on ML.
- Drive quality of various software and hardware components to ensure accuracy, performance, and usability of ML training and inference.
- Work effectively in a fast-paced environment, making prioritizations and judgments that impact productivity.
- Define and implement quality metrics to measure product and process quality, providing actionable insights and recommendations to drive continuous improvement.
- Provide regular updates on quality, key metrics, and risks to engineering and business stakeholders.
- Collaborate with software and product teams to develop clear acceptance criteria and deliver a quality product.
- Demonstrate ownership and a quality-driven approach in all deliverables.
Skills & Qualifications
- 5+ years of relevant industry experience in software integration and development.
- Strong automation and programming skills using Python, C++, or Go.
- Experience testing compute/machine learning/networking/storage systems within a large-scale enterprise environment.
- Experience debugging issues across distributed scale-out systems.
- Experience understanding complex systems and creating thorough test plans.
- Experience working effectively across teams, including product development, product management, customer operations, and field teams.
- Excellent verbal and written communication.
- Strong organizational skills, teamwork, and a can-do attitude.
- Experience working with geographically dispersed teams across time zones.
Skills & Qualifications
- Experience with ML workloads such as LLM/Multimodal training or related areas.
- Experience with hardware architecture, performance optimizations, compilers, and ML frameworks.
- Experience with distributed systems, cloud environments, and microservices deployment and debugging.
- This role follows a hybrid schedule, requiring in-office presence 3 days per week; fully remote is not an option.
- Office locations:
Sunnyvale, Toronto.
People who are serious about software build their own hardware. At Cerebras we have built a breakthrough architecture unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team cite five main reasons for joining Cerebras:
Read our blog:
Five Reasons to Join Cerebras in 2026.
Apply today and become part of the forefront of groundbreaking advancements in AI!
Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We strive daily to create a work environment that empowers people to do their best work through continuous learning and support.
This website or its third-party…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: