×
Register Here to Apply for Jobs or Post Jobs. X

GPU Architect, Platform Architecture

Job in Cupertino, Santa Clara County, California, 95014, USA
Listing for: Apple Inc.
Full Time position
Listed on 2026-02-24
Job specializations:
  • IT/Tech
    AI Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 150000 - 200000 USD Yearly USD 150000.00 200000.00 YEAR
Job Description & How to Apply Below

Imagine what you could do here! At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Dynamic, inquisitive people and inspiring, innovative technologies are the norm here. The people who work here have reinvented entire industries with all Apple Hardware products.

The same passion for innovation that goes into our products also applies to our practices strengthening our commitment to leave the world better than we found it!

Description

The Platform Architecture GPU group is looking for a talented GPU Architect to join the Neural Accelerator effort with strong skills in performance analysis and development at the level of ML frameworks and lower-level kernel implementations.

Responsibilities
  • Analyze the performance of linear algebra and machine learning algorithms on Apple GPU platforms, pursuing investigations wherever they take you in Apple software.
  • With our partner teams in Software Engineering and Hardware Technologies, formulate system-level strategies to address performance problems and unlock the next level of AI performance for our users.
  • Work closely with MLX, MPS and CoreML teams on real ML use-cases in an end-to-end co-design effort, from early design exploration up to product launch.
Minimum Qualifications
  • Experience with software and hardware performance analysis and optimization
  • Experience in GPU programming models such as Metal, CUDA, or similar
  • Experience with ML frameworks, for example MLX, Pytorch, or similar
Preferred Qualifications
  • MS or PhD in Computer Science, Electrical Engineering, or equivalent
  • 20+ years of relevant industry experience
  • Experience working specifically in CUDA C++ on ML and/or linear algebra algorithms
  • Experience optimizing LLM inference for low latency at the implementation level
  • Experience optimizing LLM inference at scale in the cloud or datacenter
  • Ability to communicate across both hardware and software organizations

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .

Apple accepts applications to this posting on an ongoing basis.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary