×
Register Here to Apply for Jobs or Post Jobs. X

AI Performance Engineer; Cloud AI Engineering), Sr Sr

Job in San Diego, San Diego County, California, 92189, USA
Listing for: Qualcomm
Full Time position
Listed on 2025-12-01
Job specializations:
  • Engineering
    AI Engineer, Systems Engineer, Software Engineer, Computer Science
Job Description & How to Apply Below
Position: AI Performance Engineer (Cloud AI Engineering), Sr | Staff | Sr. Staff

3 days ago Be among the first 25 applicants

Company

Qualcomm Technologies, Inc.

Job Area

Engineering Group, Engineering Group >
Machine Learning Engineering

General Summary

AI Performance Engineer (Cloud AI Engineering)
Qualcomm is utilizing its traditional strengths in digital wireless technologies to play a central role in the evolution of Cloud AI. We are investing in several supporting technologies including Deep Learning. The Qualcomm Cloud AI team is developing hardware and software solutions for Inference Acceleration.

We are hiring an AI Performance Engineers at multiple levels to join our dynamic, collaborative team. This role spans the full product lifecycle—from cutting-edge research and development to commercial deployment—and demands strategic thinking, strong execution, and excellent communication skills.

This Role Involves The Following Activities

  • Convert, optimize and deploy models for efficient inference using PyTorch, ONNX.
  • Work at the forefront of GenAI by understanding advanced algorithms (e.g. attention mechanisms, MoEs) and numerics to identify new optimization opportunities.
  • Performance analysis and optimization of LLM, VLM, and diffusion models for inference. Scale performance for throughput and latency constraints.
  • Mapping the next generation AI workloads on top of current and future hardware designs.
  • Work closely with customers to drive solutions by collaborating with internal compiler, firmware and platform teams.
  • Analyze complex performance or stability issues to work towards final root cause of underlying problems.
  • Create engineering solutions to deliver continuous insights into performance of AI workloads guiding the improvements over time.
  • Design and implement high-level kernels, e.g. in Triton, with a focus on generating efficient, low-level code.
Candidates For This Position Will Demonstrate The Following
  • Hands-on experience in building and optimizing language models, notably in PyTorch, ONNX, preferably in production-grade environments.
  • Deep understanding of transformer architectures, attention mechanisms and performance trade-offs.
  • Experience in workload mapping strategies exhibiting sharding or various parallelisms.
  • Strong Python programming skills.
  • Proactive learning about the latest inference optimization techniques.
  • Understanding of computer architecture, ML accelerators, in-memory processing and distributed systems.
  • Strong communication, problem-solving skills and ability to learn and work effectively in a fast-paced and collaborative environment.
  • MS in Computer Science, Machine Learning, Computer Engineering or Electrical Engineering.
Bonus Skills
  • Background in neural network operators and mathematical operations, including linear algebra and math libraries.
  • Understanding of machine learning compilers.
  • Experience in converging accuracy and its evaluation methods.
  • Knowledge of torch.compile or torch

    Dynamo.
  • PhD in Computer Science, Computer Engineering or Machine Learning
Minimum Qualifications
  • Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 6+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
  • OR Master’s degree in Computer Science, Engineering, Information Systems, or related field and 5+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
  • OR PhD in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.

Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disabili or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process.

Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary