×
Register Here to Apply for Jobs or Post Jobs. X

AI Research Scientist - Safety Alignment Team

Job in Menlo Park, San Mateo County, California, 94029, USA
Listing for: Meta
Full Time position
Listed on 2026-05-29
Job specializations:
  • IT/Tech
    AI Engineer (Applied/Software), Data Scientist, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

About the Role

Meta is seeking AI Research Scientists to join the Safety Alignment team within Meta Superintelligence Labs. This team is dedicated to advancing the safe development and deployment of superintelligent AI. The mission is to pioneer robust safety alignment techniques that empower Meta’s most ambitious AI capabilities, ensuring billions of users experience products and services securely and responsibly.

Responsibilities
  • Design, implement, and evaluate novel safety alignment techniques for large language models and multimodal AI systems.
  • Create, curate, and analyze high-quality datasets for safety alignment.
  • Fine-tune and evaluate LLMs to adhere to Meta’s safety policies and evolving global standards.
  • Build scalable infrastructure and tools for safety evaluation, monitoring, and rapid mitigation of emerging risks.
  • Work closely with researchers, engineers, and cross-functional partners to integrate safety alignment into Meta’s products and services.
  • Lead complex technical projects end-to-end.
Minimum Qualifications
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
  • PhD in Computer Science, Machine Learning, or a relevant technical field.
  • 3+ years of industry research experience in LLM/NLP, computer vision, or related AI/ML model training.
  • Experience as a technical lead on a team and/or leading complex technical projects from end-to-end.
  • Publications at peer-reviewed conferences (e.g., ICLR, NeurIPS, ICML, KDD, CVPR, ICCV, ACL).
  • Programming experience in Python and hands-on experience with frameworks such as Py Torch .
Preferred Qualifications
  • Hands-on experience applying RL techniques (e.g.,
    RLHF, PPO, DPO, GRPO, RLVF, reward modeling
    ) to fine-tune large language models for safety and policy adherence.
  • Experience developing, fine-tuning, or evaluating LLMs across multiple languages and modalities (text, image, voice, video).
  • Demonstrated experience to innovate in safety alignment, including custom guideline enforcement, dynamic policy adaptation, and rapid hotfixing of model vulnerabilities.
  • Experience designing, curating, and evaluating safety datasets, including adversarial and borderline prompt pairs for risk mitigation.
  • Experience with distributed training of LLMs (hundreds/thousands of GPUs), scalable safety mitigations, and automation of safety tooling.
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary