Senior Manager, Site Reliability Engineering; SRE - Hybrid - Seattle
Listed on 2026-02-16
-
IT/Tech
Cloud Computing, Systems Engineer
We’re looking for a strategic and hands‑on Senior Manager of Site Reliability Engineering to lead our SRE team in delivering resilient, scalable, and high‑performing systems. This role is central to our mission of operational excellence and customer satisfaction. You’ll guide a team of talented engineers, champion automation, and collaborate across disciplines to ensure our infrastructure supports business growth and innovation.
A day in the life- Lead & Inspire:
Build and mentor a high‑performing SRE team. Foster a culture of ownership, innovation, and continuous learning. - Drive Reliability:
Ensure the availability and performance of critical services through proactive monitoring, incident response, and root cause analysis. - Automate Everything:
Reduce manual toil by implementing automation across deployment, recovery, and scaling processes. - Monitor & Observe:
Define and execute observability strategies using New Relic, Splunk, and other tools to detect and resolve issues before they impact users. - Collaborate & Align:
Partner with engineering, product, and operations teams to align reliability goals with business priorities. - Plan for Scale:
Lead capacity planning and performance tuning for services running on AWS EKS and other cloud‑native platforms. - Measure & Improve:
Establish and track SLOs, SLAs, and error budgets. Continuously refine processes to improve system reliability and team efficiency.
- Experience:
5+ years in SRE, Dev Ops, or infrastructure engineering, with 2+ years in a leadership role. - Technical Depth:
Expertise in cloud platforms (especially AWS), container orchestration (Kubernetes, EKS), and CI/CD pipelines. - Programming
Skills:
Proficiency in Python, Go, or Java. - Tool Mastery:
Hands-‑on experience with New Relic, Splunk, Kubernetes. - Problem Solver:
Strong analytical skills and a passion for root cause analysis and continuous improvement. - Communicator:
Clear, concise, and collaborative communicator who thrives in cross-‑functional environments. - Education:
Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
Bonus Points:
Experience with large‑scale distributed systems.
Familiarity with ITIL or similar incident management frameworks.
Cloud certifications (e.g., AWS Solutions Architect, Google Cloud Professional Engineer).
- Medical/Vision, Dental, Retirement and Paid Time Away
- Life Insurance and Disability
- Merchandise Discount and EAP Resources
Applicants with disabilities who require assistance or accommodation should contact the nearest Nordstrom location, which can be identified at
Pay Range Details$ - $ Annual
This position may be eligible for performance‑based incentives/bonuses. Benefits include 401k, medical/vision/dental/life/disability insurance options, PTO accruals, Holidays, and more.
For Los Angeles or San Francisco applicants:
Nordstrom conducts background checks after conditional offer and considers qualified applicants with criminal histories in compliance with local laws. For additional state and location specific notices, please refer to the Legal Notices on the Nordstrom Careers site.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).