×
Register Here to Apply for Jobs or Post Jobs. X

Sr. Director, Platform Engineering

Job in Coppell, Dallas County, Texas, 75019, USA
Listing for: Gap
Full Time position
Listed on 2026-05-25
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer
Job Description & How to Apply Below
About the Role The Senior Director, Platform Engineering leads the strategy, development, and operations of our enterprise Cloud Platform across Azure and GCP. This role owns the Continuous Delivery Platform built on Kubernetes, drives Fin Ops discipline to optimize cloud spend, and ensures teams can build, ship, and run software with speed and confidence. You will shape the modern Dev Ops tech stack and champion infrastructure automation through Terraform, enabling engineering teams across the organization to deliver r leadership will directly impact developer productivity, system reliability, and the pace of innovation.

What You'll DoCLOUD PLATFORM STRATEGY & ENGINEERINGDefine and execute the multi-cloud platform strategy across Azure and GCP, ensuring architectural consistency, security, and scalability.

Lead the design and evolution of shared platform services — networking, identity, compute, storage, and observability — as self-service capabilities for product engineering teams.

Own the API gateway and service mesh layer (Istio), enabling secure, observable, and resilient service-to-service communication across the platform.

Evaluate emerging cloud-native technologies and make build-vs-buy decisions that balance innovation with operational sustainability.

Partner with Security, Architecture, and Compliance teams to embed governance and policy-as-code into every layer of the platform.

CONTINUOUS DELIVERY & DEVELOPER EXPERIENCEOwn and evolve the enterprise Continuous Delivery Platform built on Kubernetes, enabling teams to build, test, ship, and schedule workloads reliably.

Drive adoption of modern CI/CD pipelines, container orchestration, Git Ops workflows, and progressive delivery practices (canary, blue-green, feature flags).Champion developer experience (Dev Ex) as a first-class product — reducing friction from code commit to production through self-service tooling, golden paths, and internal developer portals.

Establish and track platform adoption metrics (deployment frequency, lead time, change failure rate, MTTR) aligned with DORA benchmarks.

INFRASTRUCTURE AUTOMATION & DEVOPS EXCELLENCELead the infrastructure-as-code practice using Terraform, ensuring all cloud resources are provisioned, versioned, and managed through automated, repeatable pipelines.

Drive the Dev Ops culture and toolchain strategy — standardizing on modern practices for configuration management, secrets management, service mesh, and policy enforcement.

Build and maintain reusable Terraform modules, landing zones, and account/project vending solutions that accelerate onboarding of new workloads and teams.

Ensure infrastructure changes flow through the same CI/CD rigor as application code, with automated testing, drift detection, and compliance checks.

OBSERVABILITY, MONITORING & EVENT STREAMINGDefine and own the enterprise observability strategy — ensuring comprehensive monitoring, logging, tracing, and alerting across all platform services and application workloads.

Lead the implementation and standardization of monitoring tool chains (e.g., Prometheus, Grafana, Datadog, Azure Monitor, Google Cloud Operations Suite) to provide real-time visibility into system health and performance.

Own the platform's event streaming and messaging infrastructure built on Apache Kafka, enabling reliable, high-throughput, real-time data pipelines across the organization.

Establish SLIs, SLOs, and error budgets as the foundation for reliability decisions, partnering with product engineering teams to drive a culture of proactive incident prevention.

Ensure distributed tracing and service dependency mapping are in place across the Istio service mesh, enabling rapid root cause analysis during incidents.

API GATEWAY & SERVICE MESH MANAGEMENTLead the strategy and operations of the Istio service mesh and API gateway layer, providing traffic management, mutual TLS, rate limiting, and fine-grained access control across microservices.

Define and enforce API lifecycle management standards — versioning, deprecation policies, schema governance, and developer documentation.

Partner with application teams to optimize service-to-service communication patterns,…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary