×
Register Here to Apply for Jobs or Post Jobs. X

Lead Cloud Operations Engineer

Job in Wrexham, Wrexham County, LL13, Wales, UK
Listing for: NICE
Full Time position
Listed on 2026-07-03
Job specializations:
  • IT/Tech
    AWS, Cloud Computing: Infrastructure & Operations, SRE/Site Reliability, Unix/Linux
Job Description & How to Apply Below

At NiCE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest standards and execute beyond them. And if you’re like us, we can offer you the ultimate career opportunity that will light a fire within you.

About the Role

We are seeking a highly skilled Cloud Operations Engineer to join our team. The ideal candidate will possess deep technical expertise in Linux systems, AWS cloud infrastructure, container orchestration platforms, and database administration.

In this role, you will be responsible for designing, implementing, operating, and optimizing complex cloud environments on AWS. You will manage containerized workloads running on Amazon EKS and ECS, maintain production‑grade Linux systems and databases, and contribute to the reliability, scalability, and security of our cloud platforms.

How You Will Make an Impact
  • Design, implement, and operate scalable, secure, and highly available AWS cloud infrastructure leveraging services such as EC2, EKS, ECS, RDS, S3, VPC, Lambda, and IAM.
  • Drive the reliability and performance of containerized applications by managing Amazon EKS and ECS environments, including cluster operations, networking, scaling, and troubleshooting.
  • Ensure the stability, security, and efficiency of production Linux environments through system administration, performance tuning, storage management, networking, and incident resolution.
  • Maintain and optimize relational databases (PostgreSQL, MySQL, Aurora) and No

    SQL platforms (DynamoDB, Redis), ensuring high availability, performance, and disaster recovery readiness.
  • Strengthen the organization’s cloud security posture through effective management of IAM, network security controls, secrets management, and compliance best practices.
  • Enhance platform observability and operational excellence by implementing and improving monitoring, logging, alerting, and performance analytics using Cloud Watch, Prometheus, and Grafana.
  • Take ownership of production incidents by participating in on‑call rotations, leading troubleshooting efforts, performing root cause analysis, and driving continuous improvement initiatives.
  • Partner closely with software engineering, Dev Ops, and platform teams to improve deployment processes, application reliability, and operational efficiency.
  • Identify and implement cloud cost optimization opportunities through resource right‑sizing, capacity planning, automation, and governance best practices.
Qualifications
  • 4–5 years in a cloud operation, infrastructure engineering, or SRE role with a strong hands‑on technical focus.
  • Deep hands‑on experience with core AWS services: EC2, EKS, ECS, RDS/Aurora, S3, VPC, IAM, Lambda, Cloud Watch, Route 53, and ALB/NLB.
  • Proven ability to design and troubleshoot complex AWS networking topologies (VPCs, subnets, transit gateways, security groups).
  • Solid understanding of AWS IAM—roles, policies, permission boundaries, and cross‑account access.
  • Hands‑on production experience managing workloads on Amazon EKS and ECS—cluster lifecycle, node group management, networking (CNI, service mesh basics), and autoscaling.
  • Strong Docker fundamentals: image builds, registries (ECR), multi‑stage builds, and container security.
  • Strong Linux administration skills:
    Bash/Python scripting, process and memory management, file system and storage operations, kernel parameters, and network diagnostics.
  • Experience managing and hardening Linux servers in production environments (RHEL, Ubuntu, or Amazon Linux).
  • Proficient in Terraform—module design, state management, remote backends, and workspace strategies.
  • Hands‑on experience with Puppet for configuration management, node classification, and enforcing system state at scale.
  • Hands‑on experience with relational databases:
    PostgreSQL, MySQL, or AWS RDS/Aurora—schema management, query optimisation, replication, backups, and failover.
  • Familiarity with No

    SQL databases:
    DynamoDB, Redis, or MongoDB—data modelling, performance tuning, and operational monitoring.
  • Familiarity with CI/CD pipelines (Git Hub Actions, Jenkins, or AWS Code Pipeline).
  • Experience with observability tooling:
    Cloud Watch, Datadog,…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary