×
Register Here to Apply for Jobs or Post Jobs. X

AI Accuracy Architect

Job in San Diego, San Diego County, California, 92189, USA
Listing for: Nutanix
Full Time position
Listed on 2026-06-29
Job specializations:
  • IT/Tech
    AI Engineer (Applied/Software), Systems Engineer
  • Engineering
    AI Engineer (Applied/Software), Systems Engineer
Salary/Wage Range or Industry Benchmark: 158400 - 237600 USD Yearly USD 158400.00 237600.00 YEAR
Job Description & How to Apply Below
Company:

Qualcomm Technologies, Inc.

Job Area:

Engineering Group, Engineering Group  Machine Learning Engineering General

Summary:

Qualcomm is leveraging its leadership in compute, connectivity, and AI acceleration to play a central role in the evolution of Cloud AI. The Qualcomm Cloud AI team develops hardware and software platforms enabling efficient, high quality inference of large scale foundation models.

We are seeking a Staff Engineer – AI Accuracy Architect to lead accuracy centric architecture and optimization for LLMs, VLMs, and emerging multimodal models, working closely with compiler, performance, and model optimization teams. This role spans Day0 hardware enablement through production deployment and requires deep expertise in quantization, numerics, and accuracy–performance tradeoffs across the inference stack.

This is a senior technical role with broad cross-functional impact.

- Key Responsibilities Own accuracy architecture for LLM, VLM, and multimodal inference, balancing model quality, performance, power, and hardware constraints.

Lead Day0 enablement of cutting edge models on current and future Qualcomm AI platforms in partnership with compiler, performance, firmware, and silicon teams.

Design, implement, and evaluate quantization strategies (e.g., PTQ, QAT, mixed precision, per-channel/group-wise), understanding their impact on accuracy, latency, throughput, and memory.

Analyze and resolve accuracy regressions and numerical stability issues across kernels, compilers, runtimes, and hardware.

Partner with performance engineers to co-optimize kernels and execution strategies where accuracy and performance intersect.

Drive model conversion, optimization, and deployment using PyTorch and ONNX, with accuracy validation as a first class requirement.

Define accuracy evaluation metrics and tooling to track regressions and improvements over time.

Serve as a technical authority and mentor on accuracy, quantization, and numerics across teams.

Engage with customers and partners to debug complex accuracy issues and deliver production ready solutions.

- Required Qualifications Extensive hands on experience with LLMs and/or VLMs in production or preproduction environments.

Expert level understanding of quantization and numerics, including precision tradeoffs and accumulation behavior.

Deep knowledge of transformer architectures, attention mechanisms, and MoEs.Proven ability to balance accuracy, performance, and hardware constraints.

Experience across compiler, kernel, and hardware abstraction layers.

Strong Python skills and ability to scale accuracy experiments.

Solid foundation in computer architecture and ML accelerators.

Strong technical leadership and communication skills.

MS in CS, CE, EE, or related field, or equivalent experience.

- Preferred Qualifications

PhD in a related field.

Experience with ML compilers and torch.compile

Background in numerical methods, linear algebra, and accuracy evaluation frameworks.

Minimum Qualifications:

Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.

ORMaster's degree in Computer Science, Engineering, Information Systems, or related field and 3+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.

ORPhD in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.

Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disabili or call Qualcomm's toll-free number found here . Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process.

Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary