×
Register Here to Apply for Jobs or Post Jobs. X

Research Engineer Graduate; Seed-Infra-Machine Learning Sys Training-US PhD PhD

Job in Seattle, King County, Washington, 98127, USA
Listing for: ByteDance
Apprenticeship/Internship position
Listed on 2026-01-01
Job specializations:
  • IT/Tech
    Machine Learning/ ML Engineer, AI Engineer
Job Description & How to Apply Below
Position: Research Engineer Graduate (Seed-Infra-Machine Learning Sys Training-US) - 2026 Start (PhD) PhD[...]
Research Engineer Graduate (Seed-Infra-Machine Learning Sys Training-USStart (PhD)

Location:

Seattle

Team:
Technology

Employment Type:

Regular

Job Code: A123005

Responsibilities

The Seed-Infra team combines ML system engineering and the art of machine learning to develop and maintain massively distributed ML training and Inference system/services around the world, providing high-performance, highly reliable, scalable systems for LLM/AIGC/AGI. In our team, you'll have the opportunity to build the large-scale heterogeneous system integrating with GPU/NPU/RDMA/Storage and keep it running stable and reliable, enrich your expertise in coding, performance analysis and distributed system, and be involved in the decision-making process.

You'll also be part of a global team with members from the United States, China and Singapore working collaboratively towards unified project direction. We are looking for talented individuals to join our team in 2026. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. Launch your career where inspiration is infinite cessful candidates must be able to commit to an onboarding date by end of year 2026.

Please state your availability and graduation date clearly in your resume.

Responsibilities:
Responsible for the machine learning system development of the company's large-scale models, researching new applications and solutions of related technologies in areas such as search, recommendation, advertising, content creation, conversation, and customer service, meeting the growing demand for intelligent interaction from users, and comprehensively improving users' lifestyles and communication methods in the future world.

The main work directions include:

• Responsible for the design and development of the architecture of large-scale machine learning systems, solving technical difficulties such as high concurrency, high reliability, and high scalability of the system.

• Covering various sub-directions of machine learning system, including resource scheduling, model training, model inference, data management, and workflow orchestration.

• Responsible for the research and introduction of advanced technologies in machine learning systems, such as the latest hardware architecture, heterogeneous computing systems, and compiler-based optimization technologies.

• Working closely with the algorithm teams to optimize the algorithm and system jointly.

Qualifications

Minimum Qualifications

• Final year or recent PhD graduate with a background in Computer Science, related technical field or equivalent industrial research experience.

• Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment.

• Excellent coding ability, solid foundation in data structures and basic algorithms, proficient in C/C++ or Python, winners of ACM/ICPC, NOI/IOI and other competitions are preferred.

• Familiar with at least one mainstream machine learning framework (Tensor Flow/PyTorch/Jax).

• Master the principles of distributed systems, and participated in the design, development, and maintenance of large-scale distributed systems.

• Strong sense of responsibility, good learning ability, communication ability, and self-motivation.

• Good communication and collaboration skills, able to explore new technologies with the team and promote technological progress.

Preferred Qualifications

• Prior experience in large-scale projects or papers with great influence in the field of large models.

• Familiar with NLP, CV-related algorithms, and technologies, and experienced in large model training and RL algorithms.

• Experience in one of the following fields: CUDA, RDMA, AI Infrastructure, HW/SW Co-Design, High-Performance Computing (cutlass, NCCL), ML Hardware Architecture (GPU, Accelerators, Networking), ML for System, and Distributed Storage.

• Demonstrated a related technical experience from previous internship, work experience, coding competitions, or publications.

• Curiosity towards new technologies and entrepreneurship.

• High levels of creativity and quick problem-solving capabilities.

By…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary