ML Compiler Engineer Job San Bruno area,California USA,IT/Tech

San Bruno, United States | Posted on 04/15/2026

Headquartered in Silicon Valley,
femtoAI
—formerly known as Femtosense—was founded in 2018 by researchers from the Brains in Silicon Lab at Stanford University. Our technology takes inspiration from the principles of neuromorphic computing such as sparsity to empower intelligence in everyday devices.

We pioneered a high-performance AI accelerator integrated with an end-to-end embedded AI platform, enabling low-latency operation with less energy at a fraction of the cost. From wearables and household appliances to robotics and autonomous vehicles, femto

AI brings the power of AI to everyday devices.

Job Description

You will work on a custom ML compiler that transforms modern ML and DSP models into highly efficient programs for our accelerator.

This role spans the full compiler stack—from ingesting models and transforming intermediate representations to optimizing execution under tight memory and latency constraints.

What You’ll Do

Build and maintain model ingestion pipelines (e.g., PyTorch / ONNX → internal IR)
Implement graph transformations such as:
Operator decomposition and canonicalization
Shape inference and layout transformations
Develop and extend intermediate representations (e.g., MLIR)
Implement optimization passes including:
Operator fusion and graph partitioning
Basic scheduling and tiling strategies
Memory planning and reuse
Debug correctness and numerical issues across transformations
Collaborate with hardware and ML teams to improve system performance

Requirements

2+ years of experience in compilers and/or edge-AI
Proficiency in Python and/or C++
Experience with at least one of the following:
MLIR, LLVM, TVM, XLA, or similar
Graph-level transformations or ML model internals
Understanding of deep learning models (conv, sequence models, etc.)
Ability to reason about correctness and performance tradeoffs

Nice to Have

Experience with optimization techniques (tiling, scheduling, memory reuse)
Familiarity with ONNX or PyTorch internals
Exposure to quantization or low-precision computation
Interest in hardware‑aware ML systems
401(k)
Medical insurance
Vision insurance
Disability insurance
Paid maternity leave
Paid paternity leave
Child care support

femto

AI is an equal opportunity employer committed to a diverse workforce which strives to create an inclusive working environment empowering everyone to do their best work. We do not discriminate on the basis of race, ethnicity, religion, gender, gender identity, sexual orientation, age, marital status, veteran status, or disability status.

#J-18808-Ljbffr