Senior Director - Reliability Operations
Job in
Coppell, Dallas County, Texas, 75019, USA
Listed on 2026-05-16
Listing for:
GPS Services, Inc.
Full Time
position Listed on 2026-05-16
Job specializations:
-
IT/Tech
IT Project Manager, Systems Engineer, Cloud Computing, IT Support
Job Description & How to Apply Below
This leader drives operational excellence through a proactive reliability strategy that combines process discipline, automation, observability, and real-time insights. They will partner closely with engineering, infrastructure, cybersecurity, and product teams to build and sustain resilient systems that power Gap Inc.’s digital and in-store experiences.
As a thought leader, the Sr. Director will shape the long-term vision for operational reliability and service management—defining modern capabilities, optimizing service performance, and establishing an innovation-driven reliability culture.
What You'll Do Strategic Leadership & Vision Define and execute the enterprise Reliability Operations strategy, ensuring alignment with business objectives and technology roadmaps.
Lead transformation of ITIL functions into agile, data-driven service management capabilities across incident, problem, change, and configuration management.
Partner with senior technology and business leaders to embed reliability and performance metrics into product development and operational planning.
Operational Excellence & Reliability Engineering Lead Site Reliability Engineering (SRE) practices across platforms and services—driving automation, self-healing capabilities, and proactive monitoring to achieve measurable service resiliency improvements.
Establish standards for availability, latency, scalability, and operational efficiency through engineering-driven reliability principles.
Champion reliability by design—ensuring observability, capacity planning, and chaos testing are core to delivery processes.
Mission Control & Live Sight Insights Oversee the Mission Control organization responsible for real-time system monitoring, incident command, and critical event management.
Drive adoption of Live Sight Insights to create predictive and actionable intelligence on service health and performance trends.
Enable enterprise visibility of key metrics through intuitive dashboards and business-impact-based alerting models.
Service Now Governance Ownership Own the Service Now Platform governance strategy and roadmap, ensuring it enables ITIL process excellence, automation, and collaboration on cross-enterprise workflow integration.
Collaborate with product and engineering teams to provide industry best practices for Service Now’s capabilities including IT, HR, Security, and Enterprise Operations.
Lead a platform governance mindset—focusing on reliability, scalability, and ease of use.
People Leadership & Culture Build, inspire, and develop a high-performing global Reliability Operations team that embodies accountability, collaboration, and innovation.
Foster a culture of data-driven decision making, continuous learning, and operational excellence.
Serve as a mentor and coach to emerging leaders—raising the organizational bar for reliability engineering and service leadership.
Cross-Functional Partnership Work closely with Software Engineering, Infrastructure, Cybersecurity, and Business Technology teams to ensure reliability objectives are integrated end-to-end.
Partner with Enterprise Architecture and Program Management to align technology investments with reliability outcomes.
Act as a trusted advisor to executive leadership on reliability strategy, risk posture, and performance health of the enterprise environment.
Who You Are Proven strategic leader with success driving operational transformation at scale in global, complex environments for more than 10 years.
Deep expertise in ITIL frameworks, SRE principles, Service Now platform administration and architecture, and modern observability practices.
Strong technical understanding across infrastructure, cloud operations, automation, and service management ecosystems.
Exceptional ability to influence at all levels—translating technical reliability concepts into business impact and strategic value.
Passionate about developing people and creating a culture of ownership, reliability, and continuous improvement.
Demonstrated track record of leading large, diverse teams and delivering measurable improvements in service reliability, performance, and user satisfaction.
A high performing leader—operating with strategic agility, executive presence, and the ability to build organizational alignment through clarity, accountability, and purpose.
#J-18808-Ljbffr
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×