×
Register Here to Apply for Jobs or Post Jobs. X

ASE Observability SRE

Job in Seattle, King County, Washington, 98127, USA
Listing for: Apple
Full Time position
Listed on 2026-02-07
Job specializations:
  • IT/Tech
    Cloud Computing, SRE/Site Reliability
Job Description & How to Apply Below

Summary

People at Apple don’t just build products — they craft the kind of experience that have revolutionized entire industries. The diverse collection of our people and their ideas inspire innovation in everything we do. Imagine what you could do here! Join Apple, and help us leave the world better than we found it. The Apple Service Engineering(ASE) team builds and provides systems and infrastructure that fuel Apple’s services (such as iCloud, iTunes, Siri, and Maps).

We are the foundation on which Apple’s software developers build the products that our customers love. We are looking for passionate and talented Site Reliability Engineers to continue our focus in providing our customers the highest quality Apple Services experience. Our services have to scale globally, stay highly available, and  just work.” If you love designing, engineering and running systems and infrastructure that will help millions of customers, then this is the place for you!

The Cloud Monitoring SRE organization is specifically tasked with enabling other teams to better understand their infrastructure and services, providing world-class observability capabilities. Keeping Apple services up and running 100% of the time is a challenging job. Accurately monitoring the health of every application and infrastructure that comprises the Apple ecosystem 100% of the time is an order of magnitude more challenging.

As a Site Reliability Engineer on the Cloud Monitoring Team at Apple you will be working to improve the reliability and performance of the software systems that provide visibility into the services & infrastructure that runs Apple. Our monitoring, alerting, and visualization platform analyzes billions of metrics per minute and comprises the central nervous system of Apple s architecture. You will work shoulder-to-shoulder with our engineering teams to design and build the next generation of cloud and systems monitoring infrastructure, focusing on automation, availability, performance, and above all efficiency at  reach every user on the planet  scale.

You will dive deep into gnarly operational issues; from the software, systems, automation, and process perspectives. You will understand the challenges around integrating disparate infrastructures into new facilities, processes and procedures.

Description

Apple Services Engineering infrastructure is BIG. Operating at our scale, across multiple geographically dispersed data centers and servicing hundreds of millions of users presents unique challenges. As an SRE at Apple, you ll need to solve these problems using data, teamwork, and your own expertise. SREs at Apple own the full infrastructure stack; from device driver performance debugging to content delivery network traffic management — our responsibilities are both broad and deep.

ASE runs the majority of its systems on Linux. We run a mix of open source, vendor licensed, and internally developed tools to perform functions such as system configuration management, provisioning, software deployment, logging, and monitoring. You ll learn these tools and have opportunities to improve them. Our team is collaborative; we work closely with the development teams we support to deliver the best results for Apple.

We think critically and strive to balance the best solution with the need to get things done for each engineering challenge we face. Good ideas are heard and results are rewarded.

Minimum Qualifications
  • B.S. in Computer Science or a related field.
  • Minimum 4+ years of industry experience.
  • Proven experience developing production-grade software in Python, Go, or Java.
  • Strong sense of ownership and integrity demonstrated through clear communication and collaboration
  • Experience and confidence around incident response and incident management
  • Experience/knowledge in managing and scaling distributed systems in a public, private, or hybrid cloud environment
  • Experience/knowledge with the Prometheus ecosystem
  • Acute drive to automate manual operations and to improve them through repeated iteration
  • Comfortable with Open Source configuration management and orchestration tools (such as Helm, Puppet, and Spinnaker)
  • Familiarity with…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary