×
Register Here to Apply for Jobs or Post Jobs. X

Research Engineer- Language Models

Job in Palo Alto, Santa Clara County, California, 94306, USA
Listing for: Fastino
Full Time position
Listed on 2026-02-16
Job specializations:
  • IT/Tech
    AI Engineer, Machine Learning/ ML Engineer, Data Scientist, Artificial Intelligence
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: Research Engineer- Large Language Models

Full-time | Remote (UK) with trips to Silicon Valley office | Reports to Founders

Introduction
  • Join us at Fastino as we build the next generation of LLMs. Our team, boasting alumni from Google Research, Apple, Stanford, and Cambridge is on a mission to develop specialized, efficient AI.
  • Fastino's GLiNER family of open source models has been downloaded more than 5 million times and is used by companies such as NVIDIA, Meta, and Airbnb
  • Fastino has raised $25M (as featured in Tech Crunch) through our seed round and is backed by leading investors including Microsoft, Khosla Ventures, Insight Partners, Github CEO Thomas Dohmke, Docker CEO Scott Johnston, and others.
What You’ll Work On
  • Experiment with novel language model architectures, helping drive and execute Fastino s research roadmap
  • Optimize Fastino’s multimodal models to improve response quality, instruction adherence, and overall performance metrics
  • Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories
  • Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards
  • Build robust and real-world motivated evaluations
  • Partner with Fastino engineering team to ship model updates directly to customers
  • Establish best practices for code health and documentation on the team, to facilitate collaboration and reliable development
What We’re Looking For
  • Required - Great velocity for building and shipping agents / AI products.
  • Optional - Advanced degree (Master s or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies
  • Optional - Demonstrated ability to do independent research in Academic or Industry settings
  • Optional - Substantial industry experience in large-scale deep learning model training, with demonstrated expertise in at least one of Large Language Models, Vision-Language Models, Diffusion Models, or comparable generative AI architectures
  • Optional - Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, Tensor Flow, or equivalent platforms for model development and optimization
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary