Senior Site Reliability Engineer; SRE
Listed on 2026-05-15
-
IT/Tech
SRE/Site Reliability, Cloud Computing, Systems Engineer, IT Support
Company Description
Experian is a global data and technology company, powering opportunities for people and businesses around the world. We help to redefine lending practices, uncover and prevent fraud, simplify healthcare, create marketing solutions, and gain deeper insights into the automotive market, all using our unique combination of data, analytics and software. We also assist millions of people to realize their financial goals and help them save time and money.
Job DescriptionWe are looking for a Site Reliability Engineer (SRE) to improve the reliability, and performance of business‑critical systems. You will focus on AWS cloud infrastructure, Dev Ops tooling, and core SRE practices within a distributed, production environment. Reporting to our Lead, you will work with development, platform, and operations teams to ensure systems are stable, scalable, well‑monitored and meet defined reliability targets.
LocationThis is a hybrid remote/in‑office role.
Main Responsibilities Reliability and Operations- Support high availability, scalability and performance of production systems
- Work with defined SLIs, SLOs and SLAs, ensuring services meet agreed reliability targets
- Identify and reduce operational toil through automation and process improvement
- Contribute to the design and implementation of fault‑tolerant and resilient systems
- Participate in resilience and failure testing activities to validate system behaviour under fault conditions and improve recovery
- Manage and operate systems hosted on AWS (EC2, EKS/ECS, RDS, S3, Lambda, Cloud Watch, IAM, and VPC)
- Support cloud deployments and infrastructure changes following best practices
- Help with backup, disaster recovery and resiliency planning
- Work with CI/CD pipelines and Dev Ops practices to ensure reliable and repeatable deployments, including build, test and release automation processes
- Use Infrastructure as Code tools such as Terraform or Cloud Formation to manage and provision infrastructure
- Develop automation using scripting languages (Python, Bash or similar) to reduce operational toil and improve efficiency
- Participate in production incident response, troubleshooting, and service restoration
- Perform root cause analysis (RCA) and contribute to post‑incident reviews
- Help implement preventive actions to avoid incident recurrence
- Configure and maintain monitoring, logging, and alerting using tools like Cloud Watch, Prometheus, Grafana, Splunk, or Dynatrace
- Develop dashboards to track system and platform health and reliability metrics across the user journey
- Improve alert quality to reduce noise and improve response times
- Work with application and engineering teams to embed reliability into system design
- Collaborate within a globally distributed team, using clear handovers to ensure continuity
- Share knowledge and contribute to team‑wide best practices
- Communicate with all kinds of stakeholders, influencing decisions through reliability‑focused insights
- Experience in production support, Dev Ops, SRE, cloud operations, or systems engineering
Cloud Expertise - Hands‑on experience with AWS cloud services, including compute, container and serverless workloads
- Practical experience with CI/CD pipelines and Dev Ops practices, including Git‑based version control, pull request workflows, code reviews, and deployment automation
- Experience with SRE principles, monitoring, and reliability engineering practices
- Proficiency in scripting (Python, Bash, or similar) for automation and operational tooling
- Experience with Linux systems and troubleshooting production issues
- Exposure to data platforms and data pipelines
- Understanding of data reliability concepts
- Experience supporting or operating complex distributed systems
- Hybrid working
- Great compensation and discretionary bonus
- Core benefits include pension, Bupa healthcare, Sharesave scheme and more
- 25 days annual leave with 8 bank holidays and 3 volunteering days. You can purchase additional annual leave.
Experian is proud to be an Equal Opportunity and affirmative action employer. We are committed to creating a diverse workforce. If you have a disability or special need that requires accommodation, please let us know at the earliest opportunity.
#J-18808-LjbffrTo Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: