Data Scientist - Model Optimization
Job in
Burlingame, San Mateo County, California, 94012, USA
Listed on 2026-06-01
Listing for:
Quadric
Full Time
position Listed on 2026-06-01
Job specializations:
-
Engineering
AI Engineer (Applied/Software) -
IT/Tech
AI Engineer (Applied/Software), Data Scientist, Machine Learning/ ML Engineer
Job Description & How to Apply Below
Role:
You will be joining the data science team focused on model optimization for Quadric's custom GPNPU architecture. You will research, prototype, and implement novel quantization algorithms tailored to our hardware constraints. Beyond applying existing techniques, you'll develop custom low-precision methods that maximize performance on the Chimera GPNPU. Your work will directly shape the quantization capabilities in the Chimera SDK and influence future hardware features.
This California Bay Area based engineering role is intended to be primarily in-office at our Burlingame location, with the ability to commute regularly. We believe strong technical collaboration, rapid iteration, and shared problem-solving are well supported by working together in person. The team and company also gather periodically for onsite meetings and offsite events to connect, collaborate, and align on priorities.
Responsibilities:
* Design statistically rigorous experiments to compare PTQ, QAT, and mixed-precision schemes on vision, language, and multimodal models.
* Implement custom quantization algorithms from scratch, adapting existing techniques or developing novel approaches to match Chimera GPNPU's unique architectural features and numerical formats.
* Build calibration datasets; develop Python notebooks/dashboards to track accuracy, latency, power, and memory trade-offs.
* Perform layer-level error analysis to guide numerical-format choices.
* Partner with compiler team to convert your findings into turnkey SDK flows and reference configs.
* Publish internal white papers, external benchmarks, and present results to customers and at industry events.
* Monitor academic literature in compression and efficient inference; translate promising ideas into reproducible prototypes.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×