Site Reliability Engineer II
Listed on 2026-02-07
-
IT/Tech
Cloud Computing, Systems Engineer
Title:
Site Reliability Engineer II
Duration: 12 Months (Likely extensions)
Location: Alpharetta, GA (Hybrid: 3 days onsite per week)
Note:
Due to compliance and PII‑access requirements, the candidate must undergo in‑person verification during onboarding at one of these Spectraforce locations:
Dallas, Raleigh‑Durham, Atlanta, Chicago, Kansas City, Minneapolis, or Tampa.
Client is seeking a skilled Site Reliability Engineer to join our team and help build, maintain, and scale our cloud‑native infrastructure. You will work closely with development and operations teams to ensure our systems are reliable, scalable, and efficient. The ideal candidate is passionate about automation, observability, and infrastructure‑as‑code, and thrives in a collaborative, fast‑paced environment.
Key Responsibilities- Design, implement, and manage cloud infrastructure on Azure using Terraform and Terragrunt.
- Maintain and optimize Kubernetes clusters on Azure Kubernetes Service (AKS).
- Build and manage CI/CD pipelines using Git Hub Actions/Workflows and ArgoCD for Git Ops deployments.
- Enhance system reliability by implementing monitoring, alerting, and observability solutions with Grafana.
- Automate operational tasks to reduce toil and improve team efficiency.
- Participate in on‑call rotations, incident response, and post‑mortem analysis.
- Collaborate with development teams to improve application performance, scalability, and resilience.
- Implement and advocate for SRE best practices, including SLIs, SLOs, and error budgets.
- Continuously improve system performance, cost efficiency, and security.
Skills & Qualifications
- 3+ years of experience in an SRE, Dev Ops, or cloud infrastructure role.
- Strong experience with Azure cloud services and infrastructure.
- Hands‑on experience with java and, Terrafor,m and Terragrunt for infrastructure‑as‑code.
- Proficiency with Kubernetes (preferably AKS), Databric,ks and container orchestration.
- Experience with CI/CD tools, especially Git Hub Workflows/Actions and ArgoCD.
- Solid understanding of observability tools like Grafana (Prometheus, Loki, Tempo experience is a plus).
Bachelor's degree required, Masters preferred
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).