×
Register Here to Apply for Jobs or Post Jobs. X

Senior Data Scientist - LLMs, RAG & Multimodal AI; Remote | r

Remote / Online - Candidates ideally in
San Francisco, San Francisco County, California, 94199, USA
Listing for: Proximity Works
Remote/Work from Home position
Listed on 2026-01-02
Job specializations:
  • IT/Tech
    AI Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 250000 USD Yearly USD 250000.00 YEAR
Job Description & How to Apply Below
Position: Senior Data Scientist - LLMs, RAG & Multimodal AI (Remote | Immediate joiner)

Overview

Join Proximity Works, one of the world’s most ambitious AI technology companies, shaping the future of Sports, Media, and Entertainment. Since 2019, Proximity Works has created and scaled AI-driven products used by 697 million daily users, generating $73.5 billion in enterprise value for our partners. With headquarters in San Francisco and offices in Los Angeles, Dubai, Mumbai, and Bangalore, we partner with some of the biggest global brands to solve complex problems with cutting-edge AI.

Role

Summary

This is a hands-on applied science role at the frontier of AI. You will design, fine-tune, and optimize large-scale language and multimodal models, with a strong focus on retrieval and search. You will product ionize retrieval-augmented pipelines, develop ranking and relevance techniques, and define robust evaluation frameworks. You will work closely with engineering and product teams to build systems that combine language, vision, and retrieval modalities — powering high-quality, real-world search and discovery experiences at scale.

What

You’ll Do
  • Design, fine-tune, and optimize LLMs for applied multimodal generation use cases.
  • Build and product ionize RAG pipelines that combine embedding-based search, metadata filtering, and LLM-driven re-ranking/summarization.
  • Apply prompt engineering, RAG techniques, and model distillation to improve grounding, reduce hallucinations, and ensure output reliability.
  • Define and implement evaluation metrics across semantic search (nDCG, Recall@K, MRR) and generation quality (grounding accuracy, hallucination rate).
  • Optimize inference pipelines for latency-sensitive use cases with strategies like token budgeting, prompt compression, and sub-100ms response targets.
  • Train and adapt models via transfer learning, LoRA/QLoRA, and checkpoint reloading, ensuring robust deployment in production environments.
  • Collaborate with product and research teams to explore innovative multimodal integrations for user-facing applications.
What Success Looks Like
  • Deployment of production-ready LLM + RAG pipelines powering global-scale search and discovery applications.
  • Demonstrable improvements in grounding accuracy and hallucination reduction across deployed systems.
  • Consistent delivery of sub-100ms inference latency for generation workloads.
  • Adoption of rigorous evaluation metrics that drive continuous model improvement.
  • Effective cross-functional collaboration with engineering, product, and research teams.
What You’ll Need
  • Strong background in NLP, machine learning, and multimodal AI.
  • Proven hands-on experience in LLM fine-tuning, RAG, distillation, and evaluation of foundation models.
  • Expertise in semantic search and retrieval pipelines (e.g., FAISS, Weaviate, Vespa, Pinecone).
  • Demonstrated ability to deploy models at scale, including distributed inference setups.
  • Solid understanding of evaluation frameworks for ranking, retrieval, and generation.
  • Proficiency in Python, PyTorch/Tensor Flow, and modern ML toolkits.
  • Experience in multimodal AI (bridging text, vision, or speech with LLMs).
  • Track record of shipping latency-sensitive AI products.
  • Strong communication skills and the ability to collaborate with cross-functional global teams.
Success Traits

Builder’s mindset
· High ownership
· Analytical clarity
· Collaborative spirit
· Global mindset
· Growth orientation

Why Join Proximity Works
  • Work directly on frontier AI problems with some of the world’s largest sports, media, and entertainment brands.
  • Be part of a global-first, high-performance engineering culture.
  • Competitive compensation aligned with global markets, with remote-first flexibility.
  • Annual global off-sites with Proxonauts from San Francisco, Dubai, India, and beyond.
  • High autonomy, direct accountability, and the opportunity to ship AI systems at scale.
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary