×
Register Here to Apply for Jobs or Post Jobs. X

AI Research Scientist, Audio-Visual Understanding

Job in New York, New York County, New York, 10261, USA
Listing for: Meta
Full Time position
Listed on 2026-02-16
Job specializations:
  • IT/Tech
    Data Scientist, Artificial Intelligence, AI Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 154000 - 217000 USD Yearly USD 154000.00 217000.00 YEAR
Job Description & How to Apply Below
Position: AI Research Scientist, Audio-Visual Understanding, FAIR
Location: New York

Summary

Meta is seeking a Research Scientist to join Fundamental AI Research (FAIR), a research organization focused on making significant advances in AI. Our organization is driven by advancing the science of intelligence and developing technology toward achieving superintelligence. We are seeking researchers with experience in computer vision, speech and multimodal learning to join our team and help build the perceptual foundations for real-time embodied conversational agents.

This role offers the opportunity to collaborate with a highly interdisciplinary team of scientists, engineers, and cross‑functional partners, with access to cutting‑edge technology, resources, and research facilities.

Responsibilities
  • Develop joint audio‑visual understanding systems that integrate visual and auditory signals for advanced perception.
  • Build and evaluate audiovisual language models for social interactions and understanding, including predicting social intent, semantic function, and reasoning from human‑centric inputs.
  • Contribute to benchmarks and evaluation frameworks for visual social understanding and interactions.
  • Train and optimize state‑of‑the‑art machine learning and neural network methodologies.
  • Conduct and collaborate on research projects within a globally‑based team.
  • Minimum Qualifications
  • Bachelor's degree in Computer Science, Computer Engineering, a relevant technical field, or equivalent practical experience.
  • A PhD in AI, computer science, data science, or related technical fields.
  • Experience holding an industry, postdoctoral, faculty, or government researcher position.
  • Research background in machine learning, artificial intelligence, computational statistics, or applied mathematics, or related areas.
  • Research publications reflecting experience in theoretical or empirical research.
  • Experience in developing and debugging in Python or similar programming languages.
  • Experience in analyzing and collecting data from various sources.
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment.
  • Preferred Qualifications
  • Demonstrated research and software engineering experience via an internship, work experience, coding competitions, or widely used contributions in open source repositories (e.g., Git Hub).
  • Experience with audio‑visual learning or multimodal fusion techniques.
  • Familiarity with human action recognition, social signal processing, or human‑centric video understanding.
  • Experience with long‑form video understanding, video‑language models, or streaming perception systems.
  • Experience with vision‑language models (VLMs) such as LLaVA, GPT‑4V, Gemini, or similar architectures.
  • Experience with temporal modeling, video transformers, or recurrent architectures for sequential data.
  • Public Compensation

    $154,000/year to $217,000/year + bonus + equity + benefits

    Industry

    Internet

    Equal Opportunity

    Meta is proud to be an Equal Employment Opportunity and affirmative action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

    We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E‑Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

    Accommodations

    Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations‑

    #J-18808-Ljbffr
    To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
    (If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
     
     
     
    Search for further Jobs Here:
    (Try combinations for better Results! Or enter less keywords for broader Results)
    Location
    Increase/decrease your Search Radius (miles)

    Job Posting Language
    Employment Category
    Education (minimum level)
    Filters
    Education Level
    Experience Level (years)
    Posted in last:
    Salary