Senior Manager, Software Engineering
Listed on 2026-07-02
-
IT/Tech
SRE/Site Reliability
Senior Manager, Application Software Engineering
United States
Oracle Health AI is building the next generation of intelligent, secure, and resilient healthcare cloud services that improve clinical and operational outcomes for healthcare providers and government agencies.
We are seeking an experienced Senior Manager, Site Reliability Engineering to lead a high‑performing engineering organization responsible for the reliability, performance, security, automation, and operational excellence of Oracle Health AI services supporting Oracle Health Federal customers.
This leader will drive the organization's transformation from traditional operations to a software‑defined, AI‑assisted, and automation‑first operating model. The ideal candidate is an engineering leader with deep experience in cloud‑native platforms, Site Reliability Engineering (SRE), Dev Ops, and AI‑enabled operational excellence. They will partner across engineering, product, cloud infrastructure, security, compliance, and customer operations to deliver highly available, secure, and scalable services while fostering a culture of innovation, operational excellence, and continuous improvement.
Responsibilities- Lead and develop a team of software engineers, Site Reliability Engineers (SREs), and technical leaders responsible for the performance, availability, security, reliability, and operational excellence of Oracle Health Federal customer environments.
- Drive the organization's transformation to an SRE‑first and Dev Ops operating model through adoption of Infrastructure as Code (IaC), Configuration as Code, Policy as Code, Git Ops, progressive delivery, automated rollback strategies, canary deployments, self‑healing infrastructure, and measurable operational toil reduction.
- Build AI‑native operational capabilities using Oracle‑approved AI technologies and secure data handling practices to accelerate software development, production support, incident response, troubleshooting, change execution, knowledge retrieval, engineering productivity, and customer operations.
- Eliminate repetitive operational work through software engineering, intelligent automation, AI agents, reusable runbooks, validation frameworks, self‑service platforms, and exception‑based operational workflows.
- Own operational excellence across the complete service lifecycle, including Day 0 platform deployment, Day 1 customer onboarding, and Day 2 production operations, reliability, maintenance, and continuous improvement.
- Establish and continuously improve operational metrics including service availability, reliability, latency, deployment frequency, change success rate, Mean Time to Detect (MTTD), Mean Time to Recover (MTTR), automation coverage, engineering productivity, and customer experience. Use these metrics to prioritize engineering investments and measure organizational progress.
- Partner closely with Product Development, Cloud Infrastructure, Security, Compliance, Customer Success, and Federal stakeholders to ensure AI‑enabled operational workflows meet Oracle security standards and regulatory requirements, including FedRAMP, Authority to Operate (ATO), Department of Defense (DoD), Department of Veterans Affairs (VA), HIPAA, and customer‑specific compliance obligations.
- Champion adoption of Oracle Cloud Infrastructure (OCI) native capabilities for observability, telemetry, monitoring, alerting, distributed tracing, deployment safety, operational analytics, resilience engineering, and continuous service improvement.
- Lead continuous improvements across incident management, problem management, change management, production readiness, service health reviews, operational governance, and customer escalation management while promoting blameless postmortems and continuous learning.
- Recruit, mentor, coach, and develop engineering managers, technical leads, senior engineers, and SREs, building organizational capabilities in cloud‑native engineering, software‑driven operations, AI‑assisted reliability engineering, and operational excellence.
- Collaborate across Oracle Health AI and Oracle Cloud organizations to establish common engineering standards, reusable automation frameworks,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).