Manager, Cloud Platform Engineering
Listed on 2026-05-04
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability
American Credit Acceptance is seeking a forward-thinking leader to build and operate a modern, AI-enabled Cloud Platform Engineering organization that powers every application, workload, and engineering team in the company.
This role owns the strategy and execution of our AWS cloud platform, Dev Ops, SRE, Infrastructure as Code, and Internal Developer Platform—while introducing practical, responsible use of AI to accelerate engineering, automation, operations, and reliability.
You will lead a team of AWS Architects, Dev Ops Engineers, and Platform Engineers responsible for the design, governance, automation, security, reliability, cost efficiency, and intelligent operation of our AWS ecosystem.
You are not simply maintaining infrastructure. You are:
- The owner of the AWS platform
- The governor of Infrastructure as Code and Git Ops standards
- The builder of an Internal Developer Platform that engineers utilize fully
- The bridge between Engineering, Security, Networking, and Business Operations
- The leader introducing AI-assisted engineering and operations into platform workflows
- The architect of what “next‑generation” Platform Engineering looks like
Your Mission is to
Lead & Elevate the Platform Engineering Culture
Build a culture grounded in ownership, automation, reliability, and engineering excellence.
Introduce AI-assisted development, operational automation, and intelligent troubleshooting into daily engineering practices.
Own the AWS Cloud Platform
Be accountable for the architecture, administration, governance, and operational excellence of our AWS environment including multi‑account strategy, networking, security boundaries, resiliency, observability, and compliance.
Architect the Internal Developer Platform
Create paved roads, reusable patterns, golden paths, and self‑service capabilities that improve developer productivity while enforcing standards for security, compliance, and reliability.
Champion Infrastructure as Code, Git Ops, and Intelligent Automation
Define and enforce enterprise standards for Terraform / Cloud Formation, Git Hub Actions, and Git‑driven infrastructure workflows. Embed AI into CI/CD, testing, code quality, and operational runbooks.
Introduce AI into Platform Operations
Leverage AI for:
- Log and telemetry analysis
- Automated remediation and runbook execution
- Capacity planning and cost optimization insights
- Developer assistance for pipelines, IaC, and cloud patterns
Partner with Finance and Engineering leadership to implement cost visibility, chargeback models, capacity planning, and cloud guardrails that enable fast innovation with responsible consumption.
Standardize Observability, Reliability, and Operations
Drive standards for monitoring, logging, distributed tracing, SRE practices, and operational readiness across all platforms and applications.
Bridge Engineering, Infrastructure, Security, and Networking
Ensure alignment between application engineering teams and the cloud, network, and security architecture required to support them.
Qualifications- Bachelor’s degree in Computer Science, Engineering, or related field.
- 5+ years of leadership experience managing Dev Ops, SRE, Cloud Engineering, or Platform Engineering teams.
- 3+ years of experience owning and operating AWS platforms supporting mission‑critical applications.
- Deep expertise in AWS architecture and administration, including:
- EC2, EKS, ECS, S3, RDS, Dynamo
DB, Lambda - IAM, Organizations, Control Tower, SCPs, KMS, Secrets Manager
- Cloud Watch, Cloud Trail, Config, Guard Duty, Security Hub
- Strong experience implementing and governing Infrastructure as Code using Terraform and/or Cloud Formation.
- Enterprise‑level proficiency administering Git Hub Actions, Git Hub Enterprise, and Git Ops workflows.
- Experience with Kubernetes (EKS), Docker containers, and cluster operations.
- Experience with JFrog Artifactory or similar artifact repositories.
- Strong understanding of TCP/IP, DNS, CDN, HTTP, WAF, OAuth, SAML, SCIM.
- Experience implementing observability platforms across logs, metrics, traces, and infrastructure telemetry.
- Familiarity with static and dynamic code analysis tools such as Sonar Qube, Endor Labs, Stack Hawk, or similar.
- Practical exposure to…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).