Computer Vision Intern Job Delhi Delhi India,IT/Tech

Sony Research India is driving cutting-edge research and development in various locations around the globe, including laboratories in Japan, the United States, Europe, and Asia. We endeavor to create new technology, products, and services while sustaining Sony Group’s diverse businesses in electronics, entertainment, and financial fields. For our research Centre to blaze a trail in the latest technologies, we seek to foster the growth of a diverse pool of research and engineering talent and create a technology talent bank to drive research excellence worldwide.

Sony Research India is offering outstanding career opportunities around frontline technologies such as AI and data analytics.

Key Responsibilities:

Internships at Sony Research India (SRI) are aimed at offering students with an opportunity to industry exposure and a production level project experience. The primary responsibility in this role will be to positively contribute to the growth and development of innovative technologies ing the internship, you will work closely with research scientists and other members in the team on various development and optimization tasks in object detection, tracking and recognition, video generation, content moderation, large vision models, etc.

In addition, you’ll be expected to actively work on AI development activities daily, be it training, testing, fixing bugs or other routines for building a robust model.

For this internship, we are seeking candidates with demonstrable skills and knowledge in generative AI for video generation and understanding. The ideal candidate should have hands-on experience with video generation models, Vision-Language Models (VLMs), diffusion-based video synthesis, transformer-based video architectures and video editing models. It is essential to have strong coding skill in Python and PyTorch, along with hands-on experience implementing generative models.

Work Location:

Remote work within India.

Duration of the paid internship:
The internship will be for 6 months starting April First week of 2026.
The working hours are from 9:00 to 18:00 (Monday to Friday) fulltime.

Essential

Education:

Candidates pursuing or has completed MS/MSc/MTech/PhD level degree in Computer Science, Electronics Engineering, Data Science, Information Science, Artificial Intelligence, Computer Applications or other closely related technical discipline, will be considered for the internship program.

Must Have

Skills:

Strong understanding of advanced generative techniques including video diffusion models, latent video representations, temporal transformers, and cross-frame attention mechanisms for maintaining temporal consistency.
Familiarity with generative AI architectures such as Diffusion Models, Transformers, and GANs, particularly applied to video synthesis, multimodal alignment (text–audio–visual), video editing, and controllable content generation.
Knowledge of large vision and vision-language models (VLMs) for video understanding and generation, including efficient fine-tuning, optimization, and scalable training for video-based tasks.

Good to have

Skills:

Prior experience with text-to-video, image-to-video, video-to-video translation, or controllable video editing.
Demonstrate research capabilities through relevant publication record in top-tier conferences like CVPR, ECCV, ICCV, AAAI, ACM MM, WACV, NeurIPS.
Strong knowledge and hands-on programming experience with Python and PyTorch, along with relevant ML/DL libraries.

Our Values:
Dreams & Curiosity:
Pioneer the future with dreams and curiosity.
Diversity:
Pursue the creation of the very best by harnessing diversity and varying viewpoints.
Integrity & Sincerity:
Earn the trust for Sony brand through ethical and responsible conduct.
Sustainability:
Fulfil our stakeholder responsibilities through disciplined business practices.

Sony Research India is committed to equal opportunity in all its employment practices, policies and procedures and to ensuring that no worker or potential worker will receive less favourable treatment due to any characteristic protected under applicable local laws.


Increase/decrease your Search Radius (miles)



Job Posting Language