×
Register Here to Apply for Jobs or Post Jobs. X

Machine Learning Platform Engineer, Apple Services Engineering

Job in Seattle, King County, Washington, 98127, USA
Listing for: Apple Inc.
Full Time position
Listed on 2026-05-21
Job specializations:
  • Software Development
    Software Engineer, Cloud Engineer - Software, Python, DevOps
Salary/Wage Range or Industry Benchmark: 125000 - 150000 USD Yearly USD 125000.00 150000.00 YEAR
Job Description & How to Apply Below

Seattle, Washington, United States Software and Services

We're building the evaluation platform that will serve all of Apple's generative AI and agent systems. Evaluating non‑deterministic AI systems is one of the hardest unsolved problems in production ML — and one Apple has to get right 're building the platform that makes it tractable for every team here. This is a hands‑on engineering role with a lot of autonomy.

You'll write a lot of Python and own meaningful pieces of the platform end‑to‑end. You'll be partnering closely with research engineers, model and serving teams, product and feature teams, and the infra and data platform groups this work integrates with.

Description

Build and ship:
Take ownership of features and services within the evaluation platform: APIs, SDKs, orchestration components, evaluation runners. You'll have the room to make calls on your own work and the support to deliver it well. Productionize ML research:
Partner with research engineers to take their prototype code and turn it into reliable services. You'll learn their world quickly and translate research patterns into clean Python that holds up under real load. Move fast, responsibly:
You'll get scoped problems with room to figure out the how. We trust you to balance speed with care, to know when something needs a quick prototype and when it needs a design doc, tests, and a careful rollout. Improve as you go:
Notice the rough edges and pick them up. The flaky test, the slow build, the confusing API, the runbook that's out of date. We want someone who leaves the codebase a little better every week. Developer experience:
Help build the SDKs and abstractions that other Apple teams use to evaluate their models and agents. You'll feel the friction of bad ergonomics directly, which puts you in a great position to fix it. Operational ownership:
Your code runs in production. You write the tests, set up the CI, add the metrics, and stay close when something breaks. You don't need to be an SRE, but you take care of what you ship.

Responsibilities
  • Build and ship:
    Take ownership of features and services within the evaluation platform: APIs, SDKs, orchestration components, evaluation runners. You'll have the room to make calls on your own work and the support to deliver it well.
  • Productionize ML research:
    Partner with research engineers to take their prototype code and turn it into reliable services. You'll learn their world quickly and translate research patterns into clean Python that holds up under real load.
  • Move fast, responsibly:
    You'll get scoped problems with room to figure out the how. We trust you to balance speed with care, to know when something needs a quick prototype and when it needs a design doc, tests, and a careful rollout.
  • Improve as you go:
    Notice the rough edges and pick them up. The flaky test, the slow build, the confusing API, the runbook that's out of date. We want someone who leaves the codebase a little better every week.
  • Developer experience:
    Help build the SDKs and abstractions that other Apple teams use to evaluate their models and agents. You'll feel the friction of bad ergonomics directly, which puts you in a great position to fix it.
  • Operational ownership:
    Your code runs in production. You write the tests, set up the CI, add the metrics, and stay close when something breaks. You don't need to be an SRE, but you take care of what you ship.
Minimum Qualifications
  • 4-8 years of software engineering experience building and shipping production services.
  • Strong Python. You're fluent with FastAPI, Pydantic, and the modern Python ecosystem. You write code that’s clean, tested, and easy for the next person to pick up.
  • Builder's mindset. You enjoy shipping. You're comfortable iterating quickly on scoped problems and knowing when to slow down for the parts that need it.
  • Fluency with AI coding tools. You actively use tools like Claude Code (or equivalents) in your day-to-day workflow, including features like skills, slash commands, and agent-style workflows. You have a good intuition for when to lean on them, when to steer them, and how to get high-quality output.
  • Familiarity with the agentic LLM landscape. You stay…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary