Senior Manager - SRE - RunOps - PBM-IT
Listed on 2026-05-31
-
IT/Tech
Systems Engineer, IT Project Manager, IT Support, Cloud Computing
We're building a world of health around every individual - shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger - helping to simplify health care one person, one family and one community at a time.
PositionSummary
PBM-IT Service Delivery - Senior Manager is a highly visible role within the Member & Client Services Engineering (MCSE) group at CVS Health supporting enterprise‑level, mission critical services. These services include applications used directly by CVS Health members or by call center agents to provide world‑class member experience, call center technology applications, services provided by 3rd party call center vendors, clinical & benefit coding applications, Clinical Prior Authorization systems, etc.
The Senior Manager leverages proven leadership skills to manage a team of technical vendor resources maintaining applications central to the operations of CVS Health. The Senior Manager leads strategic decision making, influences change, and drives results.
The Senior Service Delivery Manager is proficient in all critical functions for the applications assigned and is expected to champion SDLC & ITIL methods and best practices on the team to ensure that quality standards are met. The role includes being accountable for maintaining the production application ecosystem, working with managed services vendor closely on achieving application SLAs, stability, partnering with core infrastructure & application development teams to resolve defects, execute defect elimination plans and identify actions to ensure overall service stability.
The role includes being accountable for working closely with managed services vendor on production maintenance, incident management, problem management, and identifying opportunities for continuous improvement for the applications assigned. The Senior Manager is proficient with all the platforms assigned and is expected to be a platform leader with responsibility and accountability for the tactical and strategic directions of the production platforms.
The manager contributes to the development and design of technical solutions adhering to all established methodologies, standards and guidelines. The candidate must have a sharp focus on platform stability, quality of service, continuous improvement and risk mitigation. The ideal candidate can influence behavior to achieve the right outcomes to support the business.
Responsibilities SRE Strategy & Reliability Leadership- Execute the enterprise SRE strategy
, including SLOs, SLIs, error budgets
, and reliability roadmaps. - Establish reliability standards and practices across the applications, backend services, APIs, data platforms, and AI workloads
. - Drive a culture of reliability‑by‑design and operational excellence across engineering teams.
- Lead adoption of AIOps capabilities for proactive issue detection, alert noise reduction, and predictive failure prevention.
- Implement AI‑assisted incident triage
, automated runbooks, root‑cause analysis, and self‑healing systems. - Partner with the AI Platform team to integrate LLMs and ML models into operational workflows (log summarization, anomaly detection, remediation).
- Own enterprise observability strategy across metrics, logs, traces, and user experience monitoring
. - Standardize tooling and practices using platforms such as AppD, Dyna Trace, Datadog, Splunk, Prometheus, Grafana, Open Telemetry
. - Deliver real‑time dashboards and executive reporting on uptime, performance, latency, and error budgets.
- Lead enterprise incident response, escalation, and post‑incident learning (blameless postmortems).
- Reduce MTTR, MTTD
, and incident frequency through automation and preventive engineering. - Establish runbooks, on‑call models, and operational readiness reviews.
- Build, mentor, and lead global SRE teams & managers.
- Define…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).