×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer

Job in Deerfield, Lake County, Illinois, 60063, USA
Listing for: Tata Consultancy Service Limited
Full Time position
Listed on 2026-06-05
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, IT Support, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 120000 - 140000 USD Yearly USD 120000.00 140000.00 YEAR
Job Description & How to Apply Below
Must Have Technical/Functional Skills
  • 7+ years of experience in SRE, platform engineering, or cloud infrastructure engineering in large-scale enterprise environments (10,000+ employees or equivalent complexity).
  • Deep, hands-on expertise with Microsoft Azure minimum 4 years in a primary Azure cloud engineering role.
  • Expert-level proficiency with AKS: cluster lifecycle management, RBAC, network policies, pod security standards, cluster autoscaler, and Workload Identity.
  • Strong infrastructure-as-code skills:
    Terraform (required) and/or Bicep; experience managing Azure Landing Zones or Enterprise-Scale architecture.
  • Proficiency in at least one systems programming/scripting language:
    Python (preferred), Go, or Power Shell.
  • Experience designing and operating enterprise observability platforms using Azure Monitor, Log Analytics and Application Insights at scale.
  • Demonstrable track record of owning SLOs/SLIs and delivering measurable reliability improvements in production.
  • Strong knowledge of enterprise networking in Azure:
    Hub-and-Spoke/Virtual WAN, Express Route, Azure Firewall, NSGs, Private Endpoints, and DNS Private Zones.
Required/Preferred Certifications:
  • AZ-104 | AZ-305 (Preferred) | AZ-400 (Preferred) | CKA | ITIL v4 Foundation
Roles & ResponsibilitiesReliability & Availability Engineering
  • Define, own, and enforce enterprise-wide SLOs, SLIs, and Error Budgets across all Tier-0 and Tier-1 Azure-hosted services; report SLA compliance to executive stakeholders monthly.
  • Lead architectural reviews for new services and ensure relia bility non-functionals (availability targets, RTO/RPO) are embedded from design through to production.
  • Champion and implement chaos engineering practices using Azure Chaos Studio and custom fault injection frameworks to proactively surface reliability risks.
  • Drive Disaster Recovery (DR) design and conduct quarterly DR drills across Azure paired regions. Incident Management & On-Call
  • Serve as Incident Commander for P1/P2 major incidents, own end-to-end incident lifecycle from detection through resolution and Post-Incident Review (PIR).
  • Participate in a structured On-Call rotation with follow-the-sun global coverage; maintain response SLAs of
  • Drive blameless post-mortem culture and ensure all action items from PIRs are tracked and delivered within agreed SLA.
Observability & Platform Engineering
  • Design and operate the enterprise observability stack:
    Azure Monitor, Log Analytics Work spaces, Application Insights, and Azure Managed Grafana; ensure full MELT (Metrics, Events, Logs, Traces) coverage.
  • Build and maintain alerting frameworks using Azure Monitor Alert Rules and Azure Action Groups integrated with Pager Duty and Service Now.
  • Develop and operate platform automation, runbooks, and self-healing capabilities using Azure Automation, Logic Apps, and Python/Power Shell scripting.
CI/CD & Infrastructure Reliability
  • Collaborate with Dev Ops and development teams to embed reliability gates into Azure Dev Ops pipelines ; automated performance testing, synthetic monitoring, and progressive deployment (canary/blue-green) strategies.
  • Manage reliability of AKS clusters across multiple Azure regions, own node pool scaling, upgrade strategy and cluster hardening in alignment with CIS Benchmarks.
  • Contribute to infrastructure-as-code reliability reviews using Terraform/Bicep to enforce standards across Azure Landing Zones.
TCS Employee Benefits Summary:
  • Discretionary Annual Incentive.
  • Comprehensive Medical Coverage:
    Medical & Health, Dental & Vision, Disability Planning & Insurance, Pet Insurance Plans.
  • Family Support:
    Maternal & Parental Leaves.
  • Insurance Options:
    Auto & Home Insurance, Identity Theft Protection.
  • Convenience & Professional Growth:
    Commuter Benefits & Certification & Training Reimbursement.
  • Time Off:
    Vacation, Time Off, Sick Leave & Holidays.
  • Legal & Financial Assistance:
    Legal Assistance, 401K Plan, Performance Bonus, College Fund, Student Loan Refinancing.
#LI-RJ2 Salary Range-$120000-$140,000 a year
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary