×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineering Lead

Job in Nottingham, Nottinghamshire, NG1, England, UK
Listing for: Experian
Full Time position
Listed on 2026-02-15
Job specializations:
  • IT/Tech
    Cloud Computing, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 100000 - 125000 GBP Yearly GBP 100000.00 125000.00 YEAR
Job Description & How to Apply Below

Company Description

Experian is a global data and technology company, powering opportunities for people and businesses around the world. We help to redefine lending practices, uncover and prevent fraud, simplify healthcare, create marketing solutions, and gain deeper insights into the automotive market, all using our unique combination of data, analytics and software. We also assist millions of people to realize their financial goals and help them save time and money.

We operate across a range of markets, from financial services to healthcare, automotive, agribusiness, insurance, and many more industry segments.

We invest in people and new advanced technologies to unlock the power of data. As a FTSE 100 Index company listed on the London Stock Exchange (EXPN), we have a team of 22,500 people across 32 countries. Our corporate headquarters are in Dublin, Ireland. Learn more at

Job Description

We are looking for an enthusiastic SRE Lead to work in Project Spring at the forefront of our cloud modernisation, within our Credit & Verification Services. This is a hybrid role requiring travelling to Hyderabad office 40% times per month.

Background

This is an incredibly exciting time for the Experian UKI Region, as we look to build our presence in the UK and Hyderabad and work on a technology transformation to meet our aspiration to significantly scale our business over the next five years. This an opportunity to join Credit & Verification Services on this journey and be part of a collaborative team that uses Agile Dev Sec Ops  principles to deliver business value.

The

Domain

As a member of the Project Spring team within Credit and Verification Services, you’ll be part of a forward-thinking delivery group at the forefront of transforming how credit information is accessed in the UK. We’re leading the charge in moving the Experian UK credit report ecosystem to the cloud—modernizing legacy systems and unlocking new possibilities for data-driven insights.

Project Spring team thrives on collaboration, curiosity, and a shared passion for solving complex problems with elegant, scalable technology. If you’re excited by the idea of shaping the future of financial data in the cloud, you’ll feel right at home here.

Role Context

As the SRE Lead, you will own the reliability strategy for mission-critical systems and lead a team of engineers to ensure high availability, scalability, and performance. You will combine technical expertise with leadership skills to drive operational excellence and foster a culture of reliability across engineering teams.

Key Responsibilities
  • Leadership & Strategy
    • Define and implement SRE best practices across the organization.
    • Proven expertise in production support, resilience engineering, disaster recovery (DCR), automation, and cloud operations
    • Mentor and guide a team of SREs, fostering growth and technical excellence.
    • Collaborate with senior stakeholders to align reliability goals with business objectives.
  • Reliability & Performance
    • Establish SLIs, SLOs, and SLAs for critical services and ensure adherence.
    • Drive initiatives to improve system resilience and reduce operational toil.
    • Excellent in designing systems that detect and remediate issues without manual intervention – Self Healing systems, Runbook automation
    • Exposure to tools like Gremlin, Chaos Monkey, AWS FIS to simulate outages and improve fault tolerance
  • Incident Management
    • Act as the primary point of escalation for critical production issues and lead major incident response, root cause analysis, and postmortems.
    • Perform detailed post-incident investigations to identify underlying causes. Document findings and share learnings to prevent recurrence.
    • Implement preventive measures and continuous improvement processes.
  • Observability
    • Champion monitoring, logging, and alerting strategies using tools like Prometheus, Grafana, ELK, and AWS Cloud Watch.
    • Build real-time dashboards to visualize system health and reliability metrics.
    • Configure intelligent alerting based on anomaly detection and thresholds.
    • Combine metrics, logs, and traces to enable root cause analysis and reduce Mean Time to Resolution (MTTR).
    • Knowledge of AIOps or ML-based anomaly detection fo…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary