Software Engineer, ML System Architecture
Listed on 2026-02-16
-
IT/Tech
Machine Learning/ ML Engineer, AI Engineer
Responsibilities
Leveraging substantial data and computing resources and through continued investment in these domains, we have developed a proprietary general‑purpose model with multimodal capabilities. In the Chinese market, Doubao models power over 50 Byte Dance apps and business lines, including Doubao, Coze, and Dreamina, and are available to external enterprise clients via Volcano Engine. Today, the Doubao app stands as the most widely used AIGC application in China.
The Machine Learning (ML) System sub-team combines system engineering and the art of machine learning to develop and maintain massively distributed ML training and inference system/services around the world, providing high‑performance, highly reliable, scalable systems for LLM/AIGC/AGI. In our team, you ll have the opportunity to build large‑scale heterogeneous systems integrating with GPU/NPU/RDMA/Storage and keep them running stable and reliable, enrich your expertise in coding, performance analysis and distributed systems, and be involved in the decision‑making process.
You ll also be part of a global team with members from the United States, China and Singapore working collaboratively towards unified project direction.
- Design and development of Machine Learning infrastructure for LLM/AIGC, etc.
- Build a super large machine learning system integrating GPUs, RDMA networking, and high‑performance storage.
- Address technical challenges such as high stability and availability of the system.
- Organize and coordinate multiple teams to complete the construction of the system, including Data center, network, computing, storage, and resource teams.
Minimum Qualifications
- Proficient in 1 to 2 programming languages such as C++/Go/Python/Shell in a Linux environment.
- Understanding of distributed systems principles and experience in design, development and maintenance of large‑scale machine learning systems.
- Familiarity with Kubernetes architecture and extensive experience in system‑level development and tuning.
- Excellent logical analysis ability, with the capacity to reasonably abstract and split business logic.
- Strong sense of responsibility, good learning ability, communication skills and self‑drive.
Preferred Qualifications
- Familiar with the ML infrastructure of large model training and inference.
- Experience in one of the following fields: AI Infrastructure, HW/SW Co‑Design, High Performance Computing, ML Hardware Architecture (GPU, Accelerators, Networking).
Founded in 2023, the Byte Dance Doubao (Seed) Team is dedicated to pioneering advanced AI foundation models. Our goal is to lead in cutting‑edge research and drive technological and societal advancements. Our research areas span deep learning, reinforcement learning, Language, Vision, Audio, AI Infra and AI Safety. Our team has labs and research positions across China, Singapore, and the US.
Why Join Byte DanceInspiring creativity is at the core of Byte Dance s mission. Our innovative products are built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day. As Byte Dancers, we strive to do great things with great people.
We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.
Byte Dance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At Byte Dance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach.
We are passionate about this and hope you are too.
Byte Dance is committed…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).