×
Register Here to Apply for Jobs or Post Jobs. X

LLM Inference KV Cache Architect

Job in San Jose, Santa Clara County, California, 95199, USA
Listing for: ByteDance
Full Time position
Listed on 2026-06-17
Job specializations:
  • Software Development
    AI Engineer (Applied/Software)
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

Byte Dance is seeking a systems researcher or engineer in San Jose, California, to design and maintain high-performance KV caching for large language models (LLMs). This role will enhance model serving efficiency, improve latency and throughput, and optimize the use of attention key-value states in collaborative AI systems.

The ideal candidate holds a PhD in a relevant field and has a strong background in distributed systems, memory management, and performance optimization. Join us to work on cutting-edge AI technology and contribute to impactful projects.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary