Senior Manager, Cloud & DevOps Engineering
Listed on 2026-05-18
-
IT/Tech
IT Project Manager, Systems Engineer
We were tired of hearing that healthcare is broken, so we decided to do something about it. At Nomi Health, we believe the care itself isn’t broken — it’s the business of healthcare that gets in the way. Every year, more than $1 trillion is wasted on paperwork, delays, and middle layers that drive up costs and keep people from the care they need.
We’re rebuilding the system so it works the way it should: clear prices you can trust, faster payments that keep providers focused on patients, and data that helps employers make better decisions. Our work has already touched more than 30 million lives — from local communities in Michigan to some of the largest companies in the country.
We are looking for a talented and passionate Senior Manager of Cloud and Dev Ops Engineering to join our team. You will own the day-to-day operation of our AWS and Kubernetes infrastructure across multiple business units and lead a team that delivers reliably against a roadmap set in partnership with senior technical leadership.
You will report to and partner closely with the VP of Technical Operations and Automation, who serves as the architecture lead for Dev Ops. The Senior Manager owns execution, team delivery, and operational excellence—you’ll stay close enough to the work to review a Terraform PR, debug a production issue, and coach your engineers through hard problems, while architecture direction and cross‑org technical strategy live with the VP.
Dev Ops operates as a platform team: we provide and operate the infrastructure surface the rest of the company builds on, and application and data teams own what runs on top of it. You’ll be responsible for the platform meeting its specifications—uptime, security, throughput, access—but not for the business logic of what moves through it.
How you will make an impact- Lead by example through hands‑on technical contributions (80%) while supporting team performance, mentorship, and delivery outcomes (20%).
- Run day‑to‑day operations of AWS across multiple accounts and environments—VPC, Transit Gateway, EC2, RDS, S3, IAM, EKS, ECR, ELB/NLB, Route 53, Transfer Family, and Lambda.
- Operate our Kubernetes platform in production: EKS clusters, Git Ops via ArgoCD, Helm, and supporting controllers (NGINX ingress, external‑secrets, external‑dns, Kyverno, Datadog Operator).
- Maintain and extend our infrastructure‑as‑code footprint—Terraform modules, Terraform Cloud, pipeline hygiene, and review practices that keep production safe from unintended changes.
- Operate our secure file‑transfer platform (SFTP / SFTPGo / AWS Transfer Family) to the specifications set by the business—uptime, access, encryption, and key management.
- Own observability and Fin Ops execution—Datadog monitors, dashboards, log ingestion budgets and exclusion filters, Cloud Cost Management, and AWS Cost Anomaly Detection.
- Drive release engineering and production deployment practices—go‑live runbooks, release coordination, and post‑mortem follow‑through.
- Partner with Security and Compliance to execute against SOC 2 and HITRUST audits, credential rotation, CVE monitoring and remediation, SIEM integration, pentest environment provisioning, and third‑party access (VPN, Okta/Entra, Zscaler).
- Provide and operate the infrastructure underneath internal AI and automation tooling (n8n, kagent, agent‑gateway, internal AI platform AWS account) so those teams can build on a stable surface.
- Execute infrastructure‑layer provisioning and teardown for client onboarding and termination—accounts, access, and credentials.
- Manage, mentor, and grow a team of cloud and Dev Ops engineers; own sprint planning, on‑call health, and delivery against the roadmap set with the VP of Technical Operations and Automation.
- BS / MS in Computer Science or Engineering, or equivalent hands‑on experience.
- 7+ years of infrastructure engineering experience overall, with 3+ years leading or managing a Dev Ops, SRE, or Cloud Platform team.
- A track record of reliably delivering against a roadmap—you’re excited by making the trains run on time and making your team more effective, and you’re energized by executing well within a defined architectural…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).