More jobs:
Senior SRE - Azure
Job in
Alpharetta, Fulton County, Georgia, 30239, USA
Listed on 2026-02-16
Listing for:
Compunnel, Inc.
Full Time
position Listed on 2026-02-16
Job specializations:
-
IT/Tech
Cloud Computing, SRE/Site Reliability
Job Description & How to Apply Below
The Senior Site Reliability Engineer (Azure) will help build, maintain, and scale cloud-native infrastructure in a fast-paced, collaborative environment.
This role works closely with development and operations teams to ensure systems are reliable, efficient, automated, and secure.
Responsibilities include designing Azure cloud environments, managing Kubernetes clusters, implementing Infrastructure-as-Code through Terraform/Terragrunt, improving CI/CD pipelines, enhancing system observability, and providing support for on-call and incident response activities.
Key Responsibilities- Design, implement, and maintain Azure cloud infrastructure using best practices for scalability and reliability.
- Manage and optimize Kubernetes clusters (preferably AKS) and containerized workloads.
- Build and maintain Infrastructure-as-Code solutions using Terraform and Terragrunt.
- Develop, maintain, and enhance CI/CD pipelines using Git Hub Workflows/Actions and ArgoCD.
- Support Databricks environments and associated cloud integrations.
- Implement and improve observability using tools such as Grafana, Prometheus, Loki, and Tempo.
- Automate operational tasks to improve efficiency, reduce manual work, and enhance reliability.
- Participate in on-call rotations, incident response, root-cause analysis, and remediation activities.
- Collaborate with developers to improve application performance, reliability, and adherence to SRE practices such as SLIs and SLOs.
- Identify opportunities for cost optimization, performance improvements, and infrastructure security enhancements.
- Minimum 4 years of experience in Site Reliability Engineering, Dev Ops, or cloud infrastructure roles.
- Strong hands‑on experience with Azure cloud services.
- Proficiency with Java and Infrastructure-as-Code tools including Terraform and Terragrunt.
- Strong experience with Kubernetes (preferably AKS) and container orchestration.
- Experience working with Databricks in production environments.
- Proficiency with CI/CD tooling, especially Git Hub Workflows/Actions and ArgoCD.
- Strong understanding of observability tooling, including Grafana (Prometheus, Loki, Tempo preferred).
- Ability to collaborate in cross‑functional environments and communicate effectively.
- Master’s degree in Computer Science or a related field.
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×