Sr. Multimodal Model Training and Inference Optimization Engineer San Jose Regular
Listed on 2026-02-16
-
IT/Tech
Machine Learning/ ML Engineer, AI Engineer, Data Engineer, Data Scientist
Sr. Multimodal Model Training and Inference Optimization Engineer
Location:
San Jose
Team:
Technology
Employment Type:
Regular
Job Code:
A193505
Share this listing:
ResponsibilitiesAbout the team
The Vision-Applied Research team focuses on applied research in Generative AI and CV/Multimodal Understanding, and delivering intelligent solutions to Byte Dance products, e.g., Tik Tok, Cap Cut, and Lemon8, enabling users to make and share creative content in a much easier way. The team has research groups dedicated to generative models for content creation, image generation, video synthesis, intelligent image/video editing, and virtual humans.
We are seeking an experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal candidate will work at the cutting edge of AI efficiency, enhancing the performance, scalability, and deployment of large-scale generative AI models.
- Optimize large model training pipelines to improve efficiency, speed, and scalability.
- Develop and improve distributed training strategies such as data parallelism, model parallelism, pipeline parallelism and communication to accelerate model training.
- Benchmark and profile deep learning models to identify performance bottlenecks and optimize computational resources.
Minimum Qualifications:
- M.S or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or a related field.
- 3 years+ experience in AI model training optimization.
- Strong software engineering skills, including proficiency in Python, C++, and CUDA.
- Strong proficiency in deep learning frameworks such as PyTorch, Megatron and Deepspeed.
- Experience with distributed training techniques such as data parallelism, model parallelism, and pipeline parallelism.
- Knowledge of transformers and diffusion models.
Preferred Qualifications:
- Candidates with publications at conferences such as MLSys, NeurIPS, ICLR, or ICML are preferred.
- Strong communication and teamwork skills.
- Self-motivated and strong problem-solving skills.
- Ability to work collaboratively in multi-functional teams.
- Experienced in implementing and optimizing complex and performance-critical systems.
The base salary range for this position in the selected city is $208800 - $438000 annually.
Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.
Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).
The Company reserves the right to modify or change these benefits programs at any time, with or without notice.
For Los Angeles County (unincorporated) CandidatesFounded in 2012, Byte Dance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including Tik Tok, Lemon8, Cap Cut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, Byte Dance has made it easier and more fun for people to connect with, consume, and create content.
WhyJoin Byte Dance
Inspiring creativity is at the core of Byte Dance's mission. Our innovative products are built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.
As Byte Dancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).