Student Researcher Seed Infra-Compiler PhD
Listed on 2026-06-16
-
IT/Tech
Machine Learning/ ML Engineer, AI Engineer (Applied/Software), Data Scientist
Student Researcher - (Seed Infra-Compiler) - 2026 Start (PhD)
Location:
San Jose
Team:
Technology
Employment Type:
Intern
Job Code: A220086
About the TeamThe Seed Infrastructures team oversees distributed training, reinforcement learning framework, high-performance inference, and heterogeneous hardware compilation technologies for AI foundation models.
Responsibilities- Contribute to AI compiler optimizations for training and inference workloads
- Develop and extend MLIR-based compiler passes for graph lowering, optimization, and code generation
- Optimize model execution on GPU and NPU accelerators, focusing on performance, memory efficiency, and scalability
- Support model deployment pipelines, including compilation, packaging, and runtime integration
- Assist with distributed training and inference acceleration, such as parallel execution, communication optimization, and runtime scheduling
- Benchmark, profile, and analyze performance of large-scale models across different hardware backends
- Collaborate with researchers and engineers to translate model and system requirements into compiler and runtime improvements
Minimum Qualifications
- Currently pursuing a PhD degree in Computer Science, Electrical Engineering, or related technical fields
- Experience using or developing open source frameworks for LLM inference such as vLLM or SGLang. Proficient in at least one deep learning framework (e.g., PyTorch, Megatron, Deep Speed, JAX), with experience in model inference workflows
- Understanding of modern computing systems, including hardware, storage, and networking, and how they impact ML workloads
- Familiarity with compilers or model optimization pipelines (e.g., PyTorch Dynamo), or related model execution workflows
- Able to commit to working for 12 weeks in 2026
Preferred Qualifications
- Experience with distributed or large-scale ML systems, including training or inference pipelines and related optimizations (e.g., FSDP, Deep Speed, Megatron, GSPMD)
- Experience with GPU/TPU/NPU programming and performance optimization, or high-performance computing and communication (e.g., CUDA, Triton, NCCL, RDMA)
- Understanding of AI compiler and model optimization stacks (e.g., torch.fx, PyTorch Dynamo, XLA, MLIR)
The hourly rate range for this position in the selected city is $60- $60.
BenefitsInterns have day one access to health insurance, life insurance, wellbeing benefits and more. Interns also receive 10 paid holidays per year and paid sick time (56 hours if hired in first half of year, 40 hours if hired in second half of year). Interns who are not working 100% remote may also be eligible for housing allowance.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).