More jobs:
PhD Intern, LLM Model Research
Job in
San Jose, Santa Clara County, California, 95199, USA
Listed on 2026-07-02
Listing for:
ByteDance
Apprenticeship/Internship
position Listed on 2026-07-02
Job specializations:
-
Research/Development
AI Business & Operations, Data Scientist
Job Description & How to Apply Below
Student Researcher (Seed – LLM – Model) – 2026 Start (PhD) | Byte Dance
This is a PhD Internship at Byte Dance, located in the United States, targeting a 2026 start. Byte Dance, a global technology company, is dedicated to inspiring creativity and enriching life through innovative products. This role is crucial for advancing foundational algorithm research for large language models, ensuring their performance, efficiency, and stability for various downstream applications. PhD interns actively contribute to the company’s products, research, future plans, and emerging technologies.
TL;DR
- Role:
Internship - Type:
Full‑time (for the duration of the internship) - Location:
In‑person, United States - Pay: $60 hourly
- Team:
Seed‑LLM‑Model team, focused on foundational algorithm research for LLM models - Mission:
Conduct cutting‑edge research and development in LLM and Multi Modal Machine Learning to solve practical industry problems. - Tech Stack:
PyTorch, Tensor Flow, Megatron, FSDP, Deepspeed, Python, C++
- Research:
Research and develop cutting‑edge algorithms for large language models and Multi Modal Machine Learning. - Innovate:
Conduct in‑depth research on advanced technologies in LLM and Multi Modal Machine Learning fields. - Apply:
Apply cutting‑edge LLM/Multi Modal ML technologies to solve practical problems within the industry. - Contribute:
Participate in foundational algorithm research, specifically focusing on model architecture, optimization, and stability. - Publish:
Pursue opportunities to publish top international papers and apply for patents based on research contributions.
- Background:
Currently pursuing a PhD in artificial intelligence, computer science, automation, mathematics, or a related technical discipline. - Experience:
Solid foundation in data structure and algorithm design; proficient in deep learning frameworks like PyTorch and Tensor Flow; proficient in distributed large language model training frameworks such as Megatron, FSDP, or Deepspeed. - Skills:
Proficient in Python/C++; good reading and writing skills; solid foundation in mathematics; strong sense of responsibility, proactive, with good communication and teamwork skills. - Bonus:
Experience with pre‑trained basic technologies including efficient training and encapsulated deployment services (NLP, CV, video, Multi Modal Machine Learning, and their downstream applications); published papers in accredited academic conferences; excellent results in Multi Modal Machine Learning, Computer Vision, or Machine Learning competitions.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×