Sr. Machine Learning Engineer, Foundation Models - AI, Search & Platforms
Listed on 2026-03-15
-
Software Development
AI Engineer, Machine Learning/ ML Engineer
Overview
Apple | Posted Mar 11 | Full-time
Summary:
Do you think differently, are eager to break the status quo, are bold and ambitious, aren’t afraid to take risks and are passionate to build best-in-class technology? If yes, Apple is the place for you. At Apple, we think differently and push the boundaries of computing and intelligence. We build products that bring smiles to people’s faces. Foundation Model Infrastructure team, within AI, Search & Knowledge Platforms Technologies, is the backbone of Apple Intelligence.
It builds frameworks, services and tools that power the largest Apple foundation models on servers. Our infrastructure powers a wide range of Apple services and products, serving millions of queries every day with low latency and optimized compute. As part of this group, you will have a chance to bring intelligence to billions of users and work on optimizing billions of parameters across language, vision and speech models using state-of-the-art technologies at Apple’s scale.
- Work alongside the Foundation Model Research team to optimize inference for cutting-edge model architectures.
- Collaborate with product teams to build production-grade solutions to launch models serving millions of customers in real time.
- Build tools to understand bottlenecks in inference for different hardware and use cases.
- Mentor and guide engineers in the organization.
- 7+ years of experience leading and driving complex, ambiguous projects.
- Experience with high throughput services, particularly at supercomputing scale.
- Proficient with running applications on Cloud (AWS / Azure or equivalent) using Kubernetes, Docker, etc.
- Familiar with GPU programming concepts using CUDA.
- Familiar with one of the popular ML Frameworks like PyTorch or Tensor Flow.
- Proficient in building and maintaining systems written in modern languages (e.g., Go, Python).
- Familiar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models.
- Familiarity with Nvidia Tensor
RT-LLM, vLLM, Deep Speed, Nvidia Triton Server, etc. - Experience writing custom CUDA kernels using CUDA or OpenAI Triton.
Apple base pay is part of the total compensation package and is determined within a range based on skills, qualifications, experience, and location. The base pay range for this role is between $171,600 and $302,200. Apple employees may participate in discretionary employee stock programs, stock purchase plans, and receive comprehensive medical and dental coverage, retirement benefits, product discounts, and access to education-related reimbursement.
The role may be eligible for discretionary bonuses or relocation assistance. Learn more about Apple Benefits.
Note:
Apple benefits, compensation, and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.
Apple accepts applications to this posting on an ongoing basis.
About the companyApple
Be vigilant about potential scams, phishing attempts, or fraudulent activities, and seek credible sources or reviews to assess the trustworthiness of the company. Remember, your personal and financial security is paramount. Support Finity is not responsible for any consequences that may arise from disclosing such information to unauthorized or fraudulent entities.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).