×
Register Here to Apply for Jobs or Post Jobs. X

Senior Director, Applied Research

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Capital One
Full Time, Part Time position
Listed on 2026-06-04
Job specializations:
  • IT/Tech
    Data Scientist, Artificial Intelligence
Salary/Wage Range or Industry Benchmark: 150000 - 200000 USD Yearly USD 150000.00 200000.00 YEAR
Job Description & How to Apply Below
* PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 6 years of experience in Applied Research or M.S. in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 8 years of experience in Applied Research
* At least 5 years of people leadership experience
* PhD focus on NLP or Masters with 10 years of industrial NLP research experience
* Core contributor to team that has trained a large language model from scratch (10B + parameters, 500B+ tokens)
* Numerous publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR on topics related to the pre-training of large language models (e.g. technical reports of pre-trained LLMs, SSL techniques, model pre-training optimization)
* Has worked on an LLM (open source or commercial) that is currently available for use
* Demonstrated ability to guide the technical direction of a large-scale model training team
* Experience working with 500+ node clusters of GPUs Has worked on LLM scaled to 70B parameters and 1T+ tokens
* Experience with common training optimization frameworks (deep speed, nemo)
* PhD focus on topics in geometric deep learning (Graph Neural Networks, Sequential Models, Multivariate Time Series)
* Member of technical leadership for model deployment for a very large user behavior model
* Multiple papers on topics relevant to training models on graph and sequential data structures at KDD, ICML, NeurIPs, ICLR
* Worked on scaling graph models to greater than 50m nodes

Experience with large scale deep learning based recommender systems
* Experience with production real-time and streaming environments
* Contributions to common open source frameworks (pytorch-geometric, DGL)
* Proposed new methods for inference or representation learning on graphs or sequences
* Worked datasets with 100m+ users
* PhD focused on topics related to optimizing training of very large language models
* 5+ years of experience and/or publications on one of the following topics:
Model Sparsification, Quantization, Training Parallelism/Partitioning Design, Gradient Checkpointing, Model Compression
* PhD focused on topics related to guiding LLMs with further tasks (Supervised Fine tuning, Instruction-Tuning, Dialogue-Fine tuning, Parameter Tuning)
* Demonstrated knowledge of principles of transfer learning, model adaptation and model guidance
* Experience deploying a fine-tuned large language model

Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the . Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level.
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary