More jobs:
Job Description & How to Apply Below
Job Mode :
Full time with Mphasis (Client Location - UAE - Contract renewal basis)
LOOKING FOR IMMEDIATE JOINERS
Responsibilities- Ensure reliability, availability, and performance of services running across Azure/AKS and on-premises air-gapped Kubernetes (RKE2) environments, meeting strict SLAs and business requirements.
- Maintain scalable, resilient, and secure Kubernetes platforms, including ingress, storage, and stateful workloads.
- Automate operations and deployments using scripting (Python, Go, Bash), infrastructure-as-code (Terraform, Bicep, Ansible), and Git Ops with ArgoCD and Kustomize across both cloud and on-prem environments.
- Operate CI/CD pipelines Azure Dev Ops /Github Actions and manage container supply chains for both connected and air-gapped environments, including private registry mirroring and image scanning.
- Monitor system performance using Azure Monitor, Prometheus, Grafana, and Open Telemetry; proactively detect and resolve issues to prevent disruption.
- Lead incident response, perform root cause analysis, and drive post-incident reviews with permanent fixes and improvements.
- Develop, document, and enforce best practices for operations, security, and compliance across cloud and on-prem environments.
- Collaborate with development, security, and operations teams to enhance system design and support modern application platforms (Docker, Kubernetes).
- Participate in on-call rotations to respond to critical incidents across all environments.
- Use IT Service Management tools or incident, change, and problem management.
- Working knowledge of Scrum, ITIL, Agile methodologies and experience interfacing with external auditors.
- Bachelor's degree in Computer Science, Engineering, or related field.
- Minimum 10 years as a Site Reliability Engineer, with significant expertise in Azure cloud environments.
- Strong knowledge of Azure cloud services, networking, and security.
- Hands-on experience with both managed (AKS) and self-managed/air-gapped (Rancher RKE2 or equivalent) Kubernetes distributions.
- Proficiency in scripting languages (Python, Go, Bash) and infrastructure-as-code tools (Terraform, Bicep, Ansible).
- Experience with Git Ops (ArgoCD, Kustomize), CI/CD pipelines, Docker, and Kubernetes for deployment automation in connected and disconnected environments.
- Hands-on experience with monitoring tools (Azure Monitor, Prometheus, Grafana).
- Proven track record in incident management and troubleshooting.
- Excellent problem-solving, communication, and collaboration skills; attention to detail and a commitment to continuous learning.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×