Enterprise Infrastructure Mgr; Enterprise Capacity & Performance Engineering
Listed on 2026-07-04
-
IT/Tech
SRE/Site Reliability, Unix/Linux
Enterprise Infrastructure Manager (Enterprise Capacity & Performance Engineering)
You will be based in Pittsburgh, PA;
Cleveland, OH;
Dallas, TX;
Denver, CO;
Birmingham, AL; or Phoenix, AZ.
The manager of Capacity Management is responsible for establishing and executing PNC’s enterprise-wide capacity, performance, and optimization strategy across infrastructure, databases, and application platforms. This role requires a highly technical, hands‑on leader who can operate across Windows, Linux, database platforms, and JVM‑based applications, with the ability to analyze system behavior, identify performance risks, and direct remediation efforts across engineering teams. The manager functions as both an enterprise strategist and technical authority, ensuring infrastructure scalability, performance stability, and cost efficiency, while guiding teams to resolve complex cross‑stack performance issues.
Key Responsibilities- Drive end‑to‑end capacity planning and performance engineering across infrastructure and application layers
- Provide hands‑on technical leadership and escalation support across OS, database, and application stacks
- Identify capacity risks, performance bottlenecks, and system saturation trends, ensuring proactive mitigation
- Drive cost optimization (Fin Ops) through right‑sizing and efficient workload placement
- Lead root cause analysis and prevention strategies for recurring incidents
- Partner with Architecture, SRE, Dev Ops, and Application teams to influence design and scalability decisions
- Build and lead a high‑performing team, ensuring technical depth and execution excellence
- Communicate risks, insights, and recommendations to executive leadership
- Act as a cross‑stack technical authority, capable of analyzing and correlating performance issues across:
- Operating Systems (Windows, Linux, Unix): CPU utilization vs saturation, memory pressure, paging/swapping, NUMA alignment, disk I/O bottlenecks
- Database Platforms:
Oracle (AWR, wait events, tuning), SQL/SQL Server/SQL, MongoDB (working set, sharding, query efficiency) - JVM‑Based Applications: heap utilization, GC tuning (G1GC, CMS), memory leaks, object churn, thread contention
- Identify issues such as resource contention, inefficient queries, and JVM misconfiguration
- Serve as a central escalation authority, determining root cause domain and routing to appropriate teams
- Provide clear, actionable technical direction for remediation without requiring direct execution
Experience across infrastructure, capacity management, performance engineering, or SRE, with 3+ years management experience.
Proven capability as a hands‑on technical leader across multiple stacks.
Deep expertise in:
- Windows and Linux operating systems
- JVM performance analysis and tuning
- Distributed and large‑scale systems
Strong experience with:
- Observability platforms (vROPs, Turbonomic, Dynatrace, Prometheus, etc.)
Familiarity with ITIL, SRE, and Dev Ops frameworks.
Proven leadership, stakeholder management, and executive communication skills.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).