Manager, Platform Engineer
Listed on 2026-02-12
-
IT/Tech
Systems Engineer, Cloud Computing
Role Summary
In this role, you will act as a Platform Engineer expert for the pro-code Agent Platform that serves a diverse group of colleagues across the enterprise. The role will require the ability to flex between technical engineering of the platform and hands-on working knowledge creating Azure solutions and pro code AI agent applications as a Customer Zero of the platform. You will be an engineering and operations leader for the Azure-based Agent Platform, including integration of Microsoft Foundry services to enable secure, scalable, and compliant pro‑code AI agents and data applications.
You will design and build a reliable, transparent, interoperable and sustainable enterprise-grade platform with security, governance, cost management, and lifecycle in mind; guide the roadmap; and mentor engineers while collaborating with architecture, security, and product teams to deliver a best-in-class developer experience. This will include using Infrastructure-as-Code (Terraform) to provision cloud resources in a consistent, repeatable way, ensure environments are highly available and easily reproducible while ensuring tracing, monitoring, and observability of agents.
You will also integrate Azure’s latest capabilities (including Azure Microsoft Foundry for Agentic AI applications) into the platform, helping the organization leverage Agentic AI in business solutions.
- Agent Platform Solution Architecture:
Lead design, engineering, security, compliance, release management, governance, financial management, roadmap, and service/operational oversight for the enterprise pro‑code AI agent platform on Azure, including Foundry integrations. - Engineer & Automate Cloud Infrastructure:
Architect highly available, scalable, and reproducible Azure environments using Infrastructure as Code (Terraform); implement CI/CD pipelines for platform components and configurations. - Security & Compliance by Design:
Implement and enforce robust controls across identity (RBAC), network (VNets/NSGs), data protection (encryption at rest/in transit), and policy guardrails (Azure Policy), aligned to enterprise and regulatory standards; drive regular audits and hardening. - Integrate Foundry & AI Services:
Operationalize Azure AI Foundry and related AI/ML services to enable agentic applications; define patterns, SDK and CLI choices, and operational run‑books for development teams. - Reliability Engineering & Observability:
Establish comprehensive monitoring, logging, tracing, alerting, SLOs/SLAs, and incident response; proactively optimize performance, availability, and cost (e.g., autoscaling, right‑sizing, reservations). - Multi‑Environment Operations:
Oversee global environments and self‑service tenants; coordinate change governance, lifecycle management, and knowledge management (incident notes, KB articles, run‑books, training curricula). - Team Leadership & Mentorship:
Coach and mentor engineers; foster a culture of high performance, collaboration, and continuous improvement; provide technical oversight to geographically dispersed colleagues and partners. - Stakeholder & Vendor Management:
Partner with senior leaders to align solutions to standard patterns; manage relationships with technology suppliers, licenses, and cloud hosting agreements.
- Required:
Bachelor’s degree with four years of relevant experience; OR Master’s degree with two years of relevant experience; OR Associate's degree with seven years of relevant experience; OR Ph.D. with 0+ years of experience; OR 9 years of relevant experience with a high school diploma or equivalent - Required:
3+ years in engineering/technical roles operating complex, rapidly evolving platforms/products. - Required:
Demonstrated hands-on expertise with Microsoft Azure services and architecture: compute (VMs, AKS), storage (Blob/SQL), networking (VNet, NSGs), identity (Service Principals/RBAC), APIs (Azure API Management) web interfaces, agentic AI (Foundry/Agent 365). - Required:
Strong Terraform (or equivalent IaC) and CI/CD experience (Azure Dev Ops/Git Hub Actions). - Required:
Proven ability to implement cloud security best practices and compliance controls; incident resolution of…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).