×
Register Here to Apply for Jobs or Post Jobs. X

Research Scientist Graduate; Video Quality Analysis & Coding Strategy - Global Frontier Tech R

Job in San Jose, Santa Clara County, California, 95199, USA
Listing for: ByteDance
Full Time position
Listed on 2026-05-22
Job specializations:
  • Engineering
    Software Engineer, AI Engineer, Computer Science
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: Research Scientist Graduate (Video Quality Analysis & Coding Strategy) - Global Frontier Tech R[...]

Responsibilities

We are looking for talented individuals to join our team in 2027. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. Launch your career where inspiration is infinite at our Company.

Successful candidates must be able to commit to an onboarding date by end of year 2027. Please state your availability and graduation date clearly in your resume.

About The Team

At Multimedia Lab, we push the boundaries of what’s possible in multimedia technology. Our mission is to pioneer cutting-edge research across image and video understanding, generation, processing, compression, and transmission—and transform these innovations into real-world products that delight hundreds of millions of users globally.

The ideal candidate combines deep technical expertise with a strong record of innovation, thrives on solving challenging problems at scale, and is passionate about shaping the future of multimedia experiences. This is an opportunity to work alongside top talent, drive frontier research, and turn breakthrough ideas into impactful technologies used around the world.

Topic Content:

Multimodal Foundation Models for Intelligent Multimedia Processing

Explore next-generation multimedia technologies powered by multimodal foundation models, including perceptual quality modeling, generative enhancement, temporal video understanding, user-centric evaluation, and intelligent visual representation/compression, to advance video quality, efficiency, and user experience in future multimedia systems.

Challenges for the analysis, understanding, and quality assessment and enhancement based on multimodal large models:

  • Modeling complex time sequences in long multimodal videos
  • Building few-shot grounding-based models for quality assessment
  • Creating interactive video processing/enhancement models aligned with user preferences

Research value of analysis, understanding, and quality assessment and enhancement based on multimodal large models:

  • Enhance semantic understanding and event localization in medium- and long-length videos, improving processing efficiency, and support key areas such as ads recommendation, content comprehension, video value evaluation, and transcoding enhancement
  • Lower the cost of quality annotation, enable interpretable assessment of local degradation, boost generalization across different content types, and support pixel-level quality inspection and optimization

Responsibilities

  • Design video analysis (ROI/SOD, content understanding, temporal grounding etc.) and quality assessment algorithms, and participate in database creation, algorithm design/development/optimization, etc.
  • Participate in designing strategy and solution for E2E video quality optimization with a combination of video analysis, processing and encoding algorithms
  • Apply designed algorithms for VOD / Live streaming monitoring, data analysis, objective evaluation for algorithms etc.
  • Collaborate with cross-functional teams to integrate algorithms into production workflows and validate their impact through A/B testing.

Qualifications

Minimum qualifications:

  • Individuals who are completing or recently completed a PhD in Software Development, Computer Science, Computer Engineering, or a related technical discipline.
  • In-depth knowledge of video analysis algorithms or subjective/objective video quality algorithms, and state-of-the-art technologies
  • Proficient in one of the following: C, C++, Python

Preferred Qualifications

  • Familiar with ML and image processing tools, including sklearn, opencv, ffmpeg, etc
  • Familiar with deep learning frameworks (Tensorflow/Pytorch)
  • Familiar with Transformer architectures and mainstream multi-modal large models (MLLMs), and hands-on implementation or research experience preferred.
  • Familiar with Linux development environments, shell scripting, HDFS etc
  • Knowledge of common video processing algorithms, such as supper resolution, defusion model, etc.
  • Great communication, eager to learn, and always passionate about turning cutting-edge technologies into real life use cases.

Job Information

【For Pay Transparency】Compensation Description (Annually). The base salary range for…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary