More jobs:
Job Description & How to Apply Below
Sony Research India is offering outstanding career opportunities around frontline technologies such as AI and data analytics.
Key Responsibilities:
Internships at Sony Research India (SRI) are aimed at offering students with an opportunity to industry exposure and a production level project experience. The primary responsibility in this role will be to positively contribute to the growth and development of innovative technologies ing the internship, you will work closely with research scientists and other members in the team on various development and optimization tasks in object detection, tracking and recognition, video generation, content moderation, large vision models, etc.
In addition, you’ll be expected to actively work on AI development activities daily, be it training, testing, fixing bugs or other routines for building a robust model.
For this internship, we are seeking candidates with demonstrable skills and knowledge in generative AI for video generation and understanding. The ideal candidate should have hands-on experience with video generation models, Vision-Language Models (VLMs), diffusion-based video synthesis, transformer-based video architectures and video editing models. It is essential to have strong coding skill in Python and PyTorch, along with hands-on experience implementing generative models.
Work Location:
Remote work within India.
Duration of the paid internship:
The internship will be for 6 months starting April First week of 2026.
The working hours are from 9:00 to 18:00 (Monday to Friday) fulltime.
Essential
Education:
Candidates pursuing or has completed MS/MSc/MTech/PhD level degree in Computer Science, Electronics Engineering, Data Science, Information Science, Artificial Intelligence, Computer Applications or other closely related technical discipline, will be considered for the internship program.
Must Have
Skills:
Strong understanding of advanced generative techniques including video diffusion models, latent video representations, temporal transformers, and cross-frame attention mechanisms for maintaining temporal consistency.
Familiarity with generative AI architectures such as Diffusion Models, Transformers, and GANs, particularly applied to video synthesis, multimodal alignment (text–audio–visual), video editing, and controllable content generation.
Knowledge of large vision and vision-language models (VLMs) for video understanding and generation, including efficient fine-tuning, optimization, and scalable training for video-based tasks.
Good to have
Skills:
Prior experience with text-to-video, image-to-video, video-to-video translation, or controllable video editing.
Demonstrate research capabilities through relevant publication record in top-tier conferences like CVPR, ECCV, ICCV, AAAI, ACM MM, WACV, NeurIPS.
Strong knowledge and hands-on programming experience with Python and PyTorch, along with relevant ML/DL libraries.
Our Values:
Dreams & Curiosity:
Pioneer the future with dreams and curiosity.
Diversity:
Pursue the creation of the very best by harnessing diversity and varying viewpoints.
Integrity & Sincerity:
Earn the trust for Sony brand through ethical and responsible conduct.
Sustainability:
Fulfil our stakeholder responsibilities through disciplined business practices.
Sony Research India is committed to equal opportunity in all its employment practices, policies and procedures and to ensuring that no worker or potential worker will receive less favourable treatment due to any characteristic protected under applicable local laws.
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×