×
Register Here to Apply for Jobs or Post Jobs. X

Senior Performance Architect

Job in Burlingame, San Mateo County, California, 94012, USA
Listing for: quadric.io, Inc
Part Time position
Listed on 2026-02-23
Job specializations:
  • Engineering
    Software Engineer, Systems Engineer, AI Engineer
Salary/Wage Range or Industry Benchmark: 120000 - 160000 USD Yearly USD 120000.00 160000.00 YEAR
Job Description & How to Apply Below

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code.

As a Senior Performance Architect, you will be the critical link between software and hardware, responsible for understanding how code executes on Quadric s architecture and identifying opportunities for optimization. You will analyze workloads from high-level C++ and Python down through generated assembly to pinpoint performance bottlenecks. This is a hands-on role: beyond analysis, you will prototype solutions yourself - whether that means writing optimized code, modifying compiler passes, or building proof-of-concept implementations to validate proposed fixes before handing off to the appropriate team for productization.

This role requires regular work from the Quadric office in Burlingame, CA, a minimum of 2–3 days per week, with some weeks requiring more days onsite based on business needs. Candidates must be able to commute to the office.

Responsibilities
  • Analyze application performance across the full stack: C++/Python source, compiler output, assembly, and hardware execution
  • Identify and localize performance bottlenecks to specific code regions, assembly sequences, or architectural limitations
  • Implement proof-of-concept fixes and optimizations to validate proposed solutions before broader rollout
  • Develop and maintain profiling infrastructure, benchmarks, and performance regression tests
  • Collaborate with compiler engineers to improve code generation and optimization passes
  • Work with hardware architects to identify microarchitectural improvements and validate performance models
  • Create performance models that predict workload behavior and guide optimization priorities
  • Document findings and communicate performance insights to both technical and non-technical stakeholders
  • Support customer engagements by analyzing their workloads and recommending optimizations
Qualifications
  • BS/MS in Computer Science, Computer Engineering, or Electrical Engineering with 5+ years of performance analysis experience
  • Strong proficiency in C++ and Python; ability to read, reason about, and write optimized code at the assembly level
  • Hands-on mentality: comfortable implementing proof-of-concept solutions, not just identifying problems
  • Deep understanding of computer architecture: pipelines, caches, memory hierarchies, SIMD/vector execution
  • Experience with profiling tools (perf, VTune, custom trace analysis) and performance debugging methodologies
  • Ability to trace performance issues from application behavior down to microarchitectural root causes
  • Strong analytical and problem-solving skills with attention to detail
  • Excellent communication skills; ability to explain complex performance issues to diverse audiences
  • Experience working cross-functionally with compiler, runtime, and hardware teams
Nice to Have
  • Experience with ML/AI workloads and frameworks (PyTorch, Tensor Flow, ONNX)
  • Background in compiler development or code generation
  • Experience with GPU, DSP, or custom accelerator architectures
  • Familiarity with cycle-accurate simulation and performance modeling tools
Expected Outcomes in First 12 Months
  • Establish systematic performance analysis methodology and tooling for Quadric's software stack
  • Identify and drive resolution of top performance bottlenecks in key customer workloads
  • Build performance models that accurately predict workload behavior within 10-15% of actual measurements
  • Become the go-to expert for performance questions spanning the hardware/software boundary
Benefits
  • Competitive salary and meaningful equity
  • Medical, dental, and vision coverage starting on day one
  • 401(k) retirement plan
  • Flexible paid time off (unlimited, non-accrual) to support work-life balance
  • When working in-office, enjoy company-provided lunches and a stocked kitchen
  • Convenient office location within walking distance of the Caltrain station
  • Support for commuting, including monthly parking or Caltrain passes
  • Downtown Burlingame office location, close to shops, cafes, and local amenities
  • A politics-free, highly collaborative environment where talented people can do their best work and make an immediate impact
  • The opportunity to build long-term career relationships in a company that values strong personal connections alongside professional excellence
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary