×
Register Here to Apply for Jobs or Post Jobs. X

Research Intern, Vision Foundation Model and Generative AI

Job in Gaithersburg, Montgomery County, Maryland, 20883, USA
Listing for: SwiftCruit
Apprenticeship/Internship position
Listed on 2026-06-30
Job specializations:
  • Research/Development
    AI Business & Operations
Salary/Wage Range or Industry Benchmark: 50 USD Hourly USD 50.00 HOUR
Job Description & How to Apply Below

Sony AI America, a branch of Sony AI, is a remotely distributed organization spread across the U.S. and Canada. Sony AI is Sony’s new research organization pursuing the mission to use AI to unleash human creativity. Sony AI works closely with Sony’s other business units, including Sony Interactive Entertainment LLC., Sony Pictures Entertainment Inc., and Sony Music Entertainment. With some 900 million Sony devices in hands and homes worldwide today, a vast array of Sony movies, television shows and music, and the Play Station Network, Sony creates and delivers more entertainment experiences to more people than anyone else on earth.

Research

Intern – Multimodal Foundation Model for Vision

Sony AI is seeking research interns to join our team. The team focuses on fundamental and applied research with a focus on building next‑generation foundation models for vision in a responsible manner. The role of a research intern is to develop efficient and effective methodologies and prototype solutions. You will work with a productive team of world‑class scientists and engineers to tackle the most challenging problems in foundation models and generative AI, including low‑cost yet powerful vision foundation models (VFM), vision‑language models (VLM), unified models, automatic model compression, optimization and deployment on cloud and edge.

Your work will be published in papers and contribute to improving the experience of billions of customers.

Roles and Responsibilities
  • Conduct fundamental and innovative development in low‑cost yet powerful vision‑language models (VLM), unified models, automatic model compression, optimization and deployment on cloud and edge.
  • Design or implement state‑of‑the‑art techniques on model compression, inference speed‑up, deployment on hardware, tool automation.
  • PoC for various vision+text, generation relevant tasks (VQA, captioning, understanding, etc.) and hardwares.
  • Contribute to library and tool development to support business; or publish influential research in top‑tier conferences and journals.
Required Qualifications and Skills
  • Currently has, or is in the process of obtaining, a master/PhD degree in computer science or related field.
  • Very self‑motivated and capable of proposing and implementing innovative ideas.
  • Solid presentation and communication skills to internal and external audiences.
  • Publications or expertise in compact foundation model development and deployment; influential open‑source projects or paper publication at top conferences such as CVPR, ICCV, ECCV, NeurIPS, ICML, ACL, etc.
  • Better to have front‑end development experience.
  • Solid coding skills in Python, PyTorch, etc.
Working Location

Location flexible (Tokyo, Europe, US)

The target hourly rate for this internship is $50.00 per hour. The individual will be paid hourly and eligible for overtime.

All qualified applicants will receive consideration for employment without regard to any basis protected by applicable federal, state, or local law, ordinance, or regulation.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary