×
Register Here to Apply for Jobs or Post Jobs. X

Lead Site Reliability Engineer

Job in Southampton, Hampshire County, SO15, England, UK
Listing for: NICE
Full Time position
Listed on 2026-05-19
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, IT Support
Salary/Wage Range or Industry Benchmark: 100000 - 125000 GBP Yearly GBP 100000.00 125000.00 YEAR
Job Description & How to Apply Below

At NiCE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest standards and execute beyond them. And if you’re like us, we can offer you the ultimate career opportunity that will light a fire within you.

So, what's the role all about?

Here at NICE Public Safety , we provide state of the art solutions for the Public Safety & Justice market, providing software as a service for multi-media evidence management and Emergency Contact Centres to a worldwide customer base.

We are currently expanding our Cloud Platform Engineering team to ensure we continue to offer exemplary service to our customers. This is a very hands‑on role. You will be involved in ensuring our cloud platforms are observable, measurable, reliable, scalable, and maintainable. It’s likely that the successful candidate will have significant experience in a Dev Ops, SRE, Cloud Engineer, or Cloud Development role.

How

will you make an impact?
  • Act as part of a team of SREs that act as the ‘gatekeepers’ of production and actively manage the work backlog and develop reliability improvements.
  • Lead investigations into root cause outages, performance, and cost issues.
  • Lead initiatives to develop the automation of low-value tasks balanced against project delivery demands.
  • You will provide technical leadership and to wider Cloud Operations and Support teams along with providing oversight to the products and services they support.
  • Collaborate with Dev Ops and engineering teams to establish and enforce SLOs, SLAs, and error budgets
  • Develop and configure monitoring dashboards and alerts in tools like Grafana and Azure Monitor.
  • Installation and configuration of Observability Platform including tools like Grafana, Prometheus, Azure Monitor, Open telemetry etc.
  • Developing bicep modules for monitoring infrastructure and deploy it.
  • Optimize system performance, cost, and security through regular reviews and tuning.
Do you have what it takes?
  • Must have 6+ years of experience in Site Reliability Engineering
  • Excellent technical, analytical and troubleshooting skills
  • Experience and in-depth knowledge of databases and data handling (MS-SQL, Elasticsearch, YML, JSON, XML)
  • Experience with Azure cloud
  • Significant experience in programming or advanced scripting (Python, Power Shell, C# etc.)
  • Experience with infrastructure/configuration as code and version control (ARM, BICEP, Git)
  • Strong Experience managing monitoring, alerting and dashboarding platforms (Azure Monitor, Prometheus, Grafana, Elasticsearch)
  • Demonstrable experience of supporting live cloud services and platforms
  • Expert in developing queries for dashboards and alerting for microservices.
  • Collaborate with Dev Ops and engineering teams to establish and enforce SLOs, SLAs, and error budgets.
  • Production experience with Kubernetes and containerization (AKS)
  • Exposure to Azure Dev Ops pipelines is desirable (CI/CD)
  • Strong experience in infrastructure as a code, design and implementation strategies.
  • Experience with AI (tools) to automate and accelerate is a plus.
  • Efficient, effective, and respectful communication skills both with customers and within internal departments. Including,
    • Good listener, able to identify and validate assumptions.
    • Able to use effective questioning to confirm understanding of a customer problem and then provide help to solve it.
    • Methodical troubleshooting, technical skill and attention to detail used in diagnosing problems and reproducing issues in a local environment.
    • Multi-tasking and time-management to prioritise and switch between varied tasks.
  • Significant experience in platform engineering, observability, and provisioning.
  • Proven ability to develop and implement a strategic vision for platform services, observability, and provisioning.
  • Strong understanding of cyber security principles, governance, and compliance frameworks.
  • Strong understanding and experience of cloud platforms, containerisation, and microservices architecture.
  • Broad background across information technology with the ability to communicate clearly with non-security technical SMEs at a comfortable level.
  • Strong proficiency in technical scoping, architecture…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary