Senior Manager,Site Reliability Engineering; SRE - Hybrid - Seattle Job Seattle area,Washington USA,IT/Tech

Position: Senior Manager, Site Reliability Engineering (SRE) - Hybrid - Seattle

We’re looking for a strategic and hands‑on Senior Manager of Site Reliability Engineering to lead our SRE team in delivering resilient, scalable, and high‑performing systems. This role is central to our mission of operational excellence and customer satisfaction. You’ll guide a team of talented engineers, champion automation, and collaborate across disciplines to ensure our infrastructure supports business growth and innovation.

A day in the life

Lead & Inspire:
Build and mentor a high‑performing SRE team. Foster a culture of ownership, innovation, and continuous learning.
Drive Reliability:
Ensure the availability and performance of critical services through proactive monitoring, incident response, and root cause analysis.
Automate Everything:
Reduce manual toil by implementing automation across deployment, recovery, and scaling processes.
Monitor & Observe:
Define and execute observability strategies using New Relic, Splunk, and other tools to detect and resolve issues before they impact users.
Collaborate & Align:
Partner with engineering, product, and operations teams to align reliability goals with business priorities.
Plan for Scale:
Lead capacity planning and performance tuning for services running on AWS EKS and other cloud‑native platforms.
Measure & Improve:
Establish and track SLOs, SLAs, and error budgets. Continuously refine processes to improve system reliability and team efficiency.

You own this if you have

Experience:

5+ years in SRE, Dev Ops, or infrastructure engineering, with 2+ years in a leadership role.
Technical Depth:
Expertise in cloud platforms (especially AWS), container orchestration (Kubernetes, EKS), and CI/CD pipelines.
Programming

Skills:

Proficiency in Python, Go, or Java.
Tool Mastery:
Hands-‑on experience with New Relic, Splunk, Kubernetes.
Problem Solver:
Strong analytical skills and a passion for root cause analysis and continuous improvement.
Communicator:
Clear, concise, and collaborative communicator who thrives in cross-‑functional environments.
Education:

Bachelor’s degree in Computer Science, Engineering, or equivalent experience.

Bonus Points:

Experience with large‑scale distributed systems.
Familiarity with ITIL or similar incident management frameworks.
Cloud certifications (e.g., AWS Solutions Architect, Google Cloud Professional Engineer).

Benefits

Medical/Vision, Dental, Retirement and Paid Time Away
Life Insurance and Disability
Merchandise Discount and EAP Resources

Applicants with disabilities who require assistance or accommodation should contact the nearest Nordstrom location, which can be identified at

Pay Range Details

$ - $ Annual
This position may be eligible for performance‑based incentives/bonuses. Benefits include 401k, medical/vision/dental/life/disability insurance options, PTO accruals, Holidays, and more.

For Los Angeles or San Francisco applicants:
Nordstrom conducts background checks after conditional offer and considers qualified applicants with criminal histories in compliance with local laws. For additional state and location specific notices, please refer to the Legal Notices on the Nordstrom Careers site.

#J-18808-Ljbffr


Increase/decrease your Search Radius (miles)



Job Posting Language

Senior Manager, Site Reliability Engineering; SRE - Hybrid - Seattle