Software Engineer, Agentic Evaluation
Listed on 2026-06-05
-
Software Development
Software Engineer, Machine Learning/ ML Engineer
Cupertino, California, United States Machine Learning and AI
We're a team at Apple building software that helps shape the next generation of Siri and AI-powered experiences. The work spans frameworks, tooling, and infrastructure — including a strong focus on how we evaluate and measure the quality of what we ship. We can't say much about specifics, but the problems are new, the surface area is large, and the reach is enormous.
We're a collaborative, humble, and curious group that learns from each other and builds together.
You'll work alongside engineers, designers, and researchers to design and build software end-to-end — from early prototypes to production systems running on real devices. You'll have meaningful autonomy in how you get there, and the opportunity to shape both what we build and how we know it's working. The work is hard enough to stretch you, and the team is generous enough to support you while you grow.
Responsibilities- Designing and building software that powers new Siri and AI experiences
- Prototyping quickly to explore what works — then making it real
- Partnering closely with ML, Design, and other engineering teams to shape features, not just implement specs
- Contributing to architecture decisions and helping set the technical bar for the team
- Investing in the quality, reliability, and evaluability of what you ship — through testing, tooling, and infrastructure the team can trust
- 3+ years of software engineering experience with strong CS fundamentals
- Proficiency in Swift, Objective-C, Python, or another modern language — strong engineers in adjacent stacks will pick up the rest
- You've shipped software that people used, and you're ready to own bigger pieces end-to-end
- Expert in using generative AI models for coding — you've integrated tools like Claude, Cursor, or Codex deeply into how you work, and have a point of view on where they help and where they don't
- An interest in software evaluation and quality — you care about whether what you build actually works, and want to be on a team that takes measurement seriously
- Comfortable with ambiguity; when you're stuck, you dig in
- Strong communication and a track record of working well across teams
- BS in Computer Science or equivalent experience
- Experience in one or more iOS/macOS domains: system services, UI frameworks, concurrent application architecture, or performance
- Familiarity with how AI systems are evaluated — offline eval, human eval, A/B, or model-graded approaches
- Proficiency with one or more scripting languages (Python, Ruby, Bash)
- You seek out feedback and learn fast from those around you
- Close to the frontier — curious about new models and techniques, and have a point of view on where human-AI interaction is headed
- Base pay range: $147,400 – $272,100, depending on skills, qualifications, experience, and location
- Apple employees can become shareholders through discretionary employee stock programs, purchase stock at a discount through Employee Stock Purchase Plan, and receive other benefit programs
- Benefits include comprehensive medical and dental coverage, retirement benefits, discounted products and free services, and tuition reimbursement for formal education related to career advancement
- Role may be eligible for discretionary bonuses, commission payments, and relocation assistance
Apple is an equal‑opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).