More jobs:
Senior Azure Platform Engineer
Job in
Jersey City, Hudson County, New Jersey, 07390, USA
Listed on 2026-06-13
Listing for:
Proviniti
Full Time
position Listed on 2026-06-13
Job specializations:
-
IT/Tech
Cloud Computing: Infrastructure & Operations, SRE/Site Reliability, IT Infrastructure, Cybersecurity
Job Description & How to Apply Below
We are placing a Senior Azure Platform Engineer with a centralized Platform Engineering team at one of the world's largest financial institutions. The team owns the internal Azure platform that application teams across the bank depend on — and they're in a critical phase, scaling toward enterprise‑wide General Availability while expanding into multi‑cloud.
You'll build, harden, and operate real production infrastructure. You'll own incidents, run root cause analysis, and improve the platform so the same failure doesn't happen twice.
What you'll do- Own Terraform-based infrastructure across multi‑environment and multi‑subscription Azure setups - state management, drift detection, remediation, and module design
- Operate AKS clusters in production - pod and node troubleshooting, scaling, ingress issues, cluster upgrades, and incident response
- Implement and enforce Azure Policy at management group and subscription scope, including deny and audit effects and active remediation
- Design and maintain platform security controls:
Managed Identity, RBAC at control and data plane, Key Vault, Entra , and secure service‑to‑service communication - Own production incidents end‑to‑end - triage, root cause analysis, resolution, and prevention
- Build observability into the platform: logging strategy, alerting, container monitoring, and AKS diagnostic tooling
- Partner with application teams on platform onboarding, automation patterns, and best practices
- Contribute to platform hardening and standardization as the platform scales to support more teams
- Terraform - Remote state management, state locking, drift detection and remediation, multi‑environment module design, recovery and import scenarios. This is the highest‑priority bar.
- AKS / Kubernetes - Production cluster operations — pod and node troubleshooting, resource exhaustion, ingress, rollbacks, cluster upgrades. You need operational stories, not theory.
- Azure Policy & governance - Hands‑on policy enforcement (deny, audit, modify) at management group and subscription scope. Remediation tasks and compliance reporting.
- Security & identity - Managed Identity (system vs. user‑assigned), RBAC at control and data plane levels, Key Vault, Entra , JWT/OAuth, secure inter‑service communication.
- Observability & RCA - Log Analytics, Azure Monitor, Prometheus, Grafana, Splunk, or ELK. Full incident triage from symptom to resolution to prevention.
- API Management - Policy configuration: throttling, rate limiting, auth enforcement, request/response transformations, observability integration.
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×