×
Register Here to Apply for Jobs or Post Jobs. X

AI Research Scientist, Computer Vision – Facebook Video Intelligence

Job in Menlo Park, San Mateo County, California, 94029, USA
Listing for: Meta
Full Time position
Listed on 2025-12-31
Job specializations:
  • IT/Tech
    Computer Science, Artificial Intelligence, AI Engineer, Data Scientist
Salary/Wage Range or Industry Benchmark: 200000 - 250000 USD Yearly USD 200000.00 250000.00 YEAR
Job Description & How to Apply Below

Video Intelligence Research Scientist, Computer Vision – Facebook

The Video Intelligence team is an applied AI research team within the Facebook pillar. This role involves developing advanced video generation and understanding foundation models, enabling innovative AI‑driven video creation experiences and enhancing our ability to comprehend video content.

Responsibilities
  • Build multimodal foundation models such as text‑to‑video, image‑to‑video, video understanding, and unified native video generative models.
  • Design core foundation model architectures and conduct progressive pre‑training.
  • Post‑train foundation models using techniques such as supervised fine‑tuning (SFT), reinforcement learning from human feedback (RLHF), direct preference optimization (DPO), and low‑rank adaptation (LoRA).
  • Conduct research to develop state‑of‑the‑art GenAI models for the Facebook family of apps.
Minimum Qualifications
  • Currently has or is pursuing a bachelor’s degree in Computer Science, Computer Engineering, or a relevant technical field; degree must be completed before employment.
  • PhD in Computer Science, Machine Learning, or a relevant field is preferred.
  • Experience training multimodal, computer vision, or large language models.
  • Programming experience in Python and frameworks such as PyTorch.
  • Must obtain and maintain work authorization.
Preferred Qualifications
  • First‑authored publications at peer‑reviewed conferences (e.g., ICLR, NeurIPS, ICML, KDD, CVPR, ICCV, ACL).
  • Experience collaborating across product, engineering, and research teams.
  • Experience building text‑to‑video, image‑to‑video, video understanding, and/or unified native video generative models.
Compensation

117,000 $ US/year to 173,000 $ US/year + bonus + equity + benefits.

Equal Employment Opportunity

Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics.

Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please let us know.

About Meta

Meta builds technologies that help people connect, find communities, and grow businesses. Today we are shifting from 2D screens to immersive experiences in augmented and virtual reality.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary