LLM Distillation Research Intern
Listed on 2026-05-29
-
Research/Development
Research Scientist, Clinical Research, Data Scientist, Research Analyst
Overview
Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.
We're looking for a research intern to work on distillation of large language models (LLMs) -- i.e., training smaller and more efficient LLMs from larger models without serious drops in performance. We're looking for distilled models for some of the applications we are building on the Special Projects team in Microsoft Research (MSR). You would be applying cutting-edge distillation methods such as the approach used to train Phi in the “Textbooks Are All You Need” paper.
The standard approach to distillation encourages the distilled model to emulate the hidden states of the larger teacher model. In this internship, we're looking to augment that standard approach with methods that align more structured domain knowledge that we might see in a knowledge graph, simulator/process model, or some other structured representation of knowledge.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Qualifications
Required Qualifications
- Currently enrolled in a PhD program,
- OR a research master’s degree program with the intention of completing a PhD, in Computer Science, Artificial Intelligence, or a related field.
- At least 1 year in programming languages used in AI research, such as Python, and familiarity with ML frameworks (e.g., Tensor Flow, PyTorch).
Other Requirements
- Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
- In addition to the qualifications above, you’ll need to submit a minimum of two reference letters for this position. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates.
You may wish to alert your letter writers in advance, so they will be ready to submit your letter.
Preferred Qualifications
- Background in machine learning, particularly in natural language processing and model distillation.
- Experience with techniques for distilling large language models.
- Knowledge of structured domain knowledge integration into AI models.
- Proficient analytical, problem-solving, and research skills.
- Ability to work collaboratively in a fast-paced research environment.
The base pay range for this internship is - Applied Sciences IC2 : USD $5,090 - $10,120 per month. There is a different range applicable to specific work locations, with the San Francisco Bay area and New York City Metropolitan area, and the base pay range for this role in those locations is USD $6,690 -$11,030 per month.
The base pay range for this internship is
- Applied Sciences IC3 : USD $6,290 - $12,170 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,060 - $13, 240 per month.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
Responsibilities
Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).