×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer

Job in Deerfield, Lake County, Illinois, 60063, USA
Listing for: Tata Consultancy Services
Full Time position
Listed on 2026-05-19
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

Role :
Dev Ops & Site Reliability Lead - Retail Job Description

  • Containers & Orchestration (AKS, Docker)
  • Networking (VNETs, Private Endpoints, Application Gateway, Load Balancers)
  • Proven experience designing enterprise‑grade, highly available cloud platforms
  • Strong understanding of hybrid and multi‑cloud architectures (AWS / GCP exposure preferred)
  • Advanced experience with Azure Dev Ops and CI/CD pipeline architecture
  • Git Ops concepts, branching strategies, release orchestration
  • Site Reliability Engineering (Leadership Level) Ownership of platform reliability, resiliency, and performance Definition and governance of:
  • SLIs, SLOs, SLAs
  • Error budgets and reliability metrics
  • Advanced observability strategy, designing and implementation:
  • Incident response leadership, RCA facilitation, and long‑term remediation planning Experience operating 99.9%–99.99% availability systems
Containers, APIs & Integration
  • Leadership‑level experience with AKS‑based platforms, ingress, and scaling strategies
  • Understanding of microservices, API‑led and event‑driven architectures
  • Familiarity with Azure Integration Services (Service Bus, Event Hub, API Management)
Security, Compliance & Cost
  • Secure cloud design using Key Vault, managed identities, RBAC
  • Act as Lead SRE for Retail platforms, owning reliability and stability outcomes
  • Define and enforce SRE standards, best practices, and operating models
  • Architect and govern highly available, scalable cloud platforms
  • Lead the design and implementation of CI/CD and IaC strategies
  • Establish proactive monitoring, alerting, and incident prevention mechanisms
  • Own major incident leadership, RCA execution, and corrective action tracking
  • Partner with application, security, and architecture teams to build reliability by design
  • Drive automation to reduce toil and improve operational efficiency
  • Mentor and coach SRE and Dev Ops engineers across teams
  • Influence roadmap decisions with a reliability, scalability, and cost lens
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary