×
Register Here to Apply for Jobs or Post Jobs. X

Head – Site Reliability Engineering; SRE

Job in 400001, Mumbai, Maharashtra, India
Listing for: TSS Consultancy Pvt Ltd
Full Time position
Listed on 2026-02-14
Job specializations:
  • IT/Tech
    Cloud Computing, SRE/Site Reliability
Job Description & How to Apply Below
Position: Head – Site Reliability Engineering (SRE)
We are seeking a highly experienced Head of Site Reliability Engineering (SRE) to lead the reliability, scalability, performance and availability of our production systems. This role blends deep software engineering expertise with infrastructure and operations leadership, focusing on automation, observability, incident management and continuous improvement.

You will build and lead a world-class SRE and Platform Engineering organization, ensuring exceptional platform reliability while enabling rapid product innovation in a B2B SaaS environment.

Key Responsibilities

Strategic Leadership & Team Scaling

- Lead and scale a high-performing SRE and Platform Engineering organization.
- Manage managers and senior engineers, fostering a culture of technical excellence and ownership.
- Define and execute the long-term SRE roadmap aligned with business growth and enterprise client needs.
- Champion a blameless post-mortem culture, driving systemic improvements over individual fault.

Reliability & Performance Engineering

- Establish, monitor, and enforce SLIs, SLOs, and SLAs aligned with enterprise B2B expectations.
- Implement and manage error budgets to balance platform stability with release velocity.
- Act as a key stakeholder in architecture reviews to ensure high availability and horizontal scalability.

Outage & Incident Management

- Design and oversee a robust 24/7 Incident Command System.
- Lead major incident responses and ensure effective crisis management.
- Conduct deep root-cause analysis (RCA) and post-incident reviews.
- Drive corrective and preventive actions to continuously improve reliability.
- Communicate clearly with executive leadership and client-facing teams during critical incidents.

Observability & Infrastructure

- Evolve monitoring into a full observability platform (metrics, logs, tracing).
- Ensure 100% infrastructure automation using Infrastructure as Code (IaC) tools.
- Manage production environments across cloud, hybrid, and on-prem setups.
- Support containerized platforms using Docker and Kubernetes.
- Collaborate closely with Dev Ops, Platform, and Infrastructure teams.

Security & Compliance

- Embed security and reliability best practices across systems.
- Support audits and regulatory compliance requirements (ISO 27001, SOC, SEBI, RBI, PCI-DSS, etc., as applicable).
- Implement strong access controls, logging, and audit trails.

Required Qualifications

- 10+ years of experience in Infrastructure, Dev Ops, or SRE roles.
- 3+ years in a senior leadership role managing large engineering teams.
- Proven experience in a B2B SaaS environment with contractual uptime commitments.
- Strong expertise in Kubernetes and container orchestration at scale.
- Deep knowledge of cloud platforms (AWS, Azure, or GCP).
- Solid background in distributed systems, networking, and load balancing (DNS, TLS, BGP).
- Proficiency in Python or Java for automation and tooling.
- Hands-on experience with monitoring and observability tools (Prometheus, Grafana, ELK, Datadog, etc.).
- Strong understanding of databases, caching, and distributed system concepts.
- Demonstrated ability to lead teams calmly and effectively during high-pressure incidents.

Preferred Certifications

- Cloud Certifications (AWS / Azure / GCP)
- Kubernetes Certifications (CKA / CKAD)
- SRE / Dev Ops Certifications
- ITIL Foundation
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary