Student Researcher Seed Infra-Compiler PhD Job San Jose area,California USA,IT/Tech

Position: Student Researcher - (Seed Infra-Compiler) - 2026 Start (PhD)

Student Researcher - (Seed Infra-Compiler) - 2026 Start (PhD)

Location:

San Jose

Team:
Technology

Employment Type:

Intern

Job Code: A220086

About the Team

The Seed Infrastructures team oversees distributed training, reinforcement learning framework, high-performance inference, and heterogeneous hardware compilation technologies for AI foundation models.

Responsibilities

Contribute to AI compiler optimizations for training and inference workloads
Develop and extend MLIR-based compiler passes for graph lowering, optimization, and code generation
Optimize model execution on GPU and NPU accelerators, focusing on performance, memory efficiency, and scalability
Support model deployment pipelines, including compilation, packaging, and runtime integration
Assist with distributed training and inference acceleration, such as parallel execution, communication optimization, and runtime scheduling
Benchmark, profile, and analyze performance of large-scale models across different hardware backends
Collaborate with researchers and engineers to translate model and system requirements into compiler and runtime improvements

Qualifications

Minimum Qualifications

Currently pursuing a PhD degree in Computer Science, Electrical Engineering, or related technical fields
Experience using or developing open source frameworks for LLM inference such as vLLM or SGLang. Proficient in at least one deep learning framework (e.g., PyTorch, Megatron, Deep Speed, JAX), with experience in model inference workflows
Understanding of modern computing systems, including hardware, storage, and networking, and how they impact ML workloads
Familiarity with compilers or model optimization pipelines (e.g., PyTorch Dynamo), or related model execution workflows
Able to commit to working for 12 weeks in 2026

Preferred Qualifications

Experience with distributed or large-scale ML systems, including training or inference pipelines and related optimizations (e.g., FSDP, Deep Speed, Megatron, GSPMD)
Experience with GPU/TPU/NPU programming and performance optimization, or high-performance computing and communication (e.g., CUDA, Triton, NCCL, RDMA)
Understanding of AI compiler and model optimization stacks (e.g., torch.fx, PyTorch Dynamo, XLA, MLIR)

Job Information

The hourly rate range for this position in the selected city is $60- $60.

Benefits

Interns have day one access to health insurance, life insurance, wellbeing benefits and more. Interns also receive 10 paid holidays per year and paid sick time (56 hours if hired in first half of year, 40 hours if hired in second half of year). Interns who are not working 100% remote may also be eligible for housing allowance.

#J-18808-Ljbffr