×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer

Job in Riyadh, Riyadh Region, Saudi Arabia
Listing for: Lucidya | لوسيديا
Full Time position
Listed on 2026-03-27
Job specializations:
  • IT/Tech
    SRE/Site Reliability, Cloud Computing
Salary/Wage Range or Industry Benchmark: 200000 - 300000 SAR Yearly SAR 200000.00 300000.00 YEAR
Job Description & How to Apply Below

About Lucidya

Lucidya is an AI-native platform for customer experience (CX) intelligence that manages entire customer life cycles autonomously, from initial engagement through retention and growth. Unlike platforms that only surface insights and leave the action to you, Lucidya closes the loop with proprietary NLU technology built in-house and trained on millions of multilingual conversations. This enables marketing, support, CX, and research teams to deliver personalized experiences that drive measurable improvements in customer satisfaction, retention, and lifetime value.

As we continue scaling globally, the reliability, performance, and resilience of our infrastructure become mission‑critical to everything we do.

Why this role matters

At Lucidya, our platform processes massive volumes of real‑time customer data. Any downtime, latency, or instability directly impacts our customers’ ability to make decisions and serve their own users. This role exists to make sure that doesn't happen.

What You'll Do

You’ll be responsible for outcomes, not just tasks. Here’s what success looks like in this role:

You’ll make reliability the default
  • Design and maintain infrastructure that is highly available, fault‑tolerant, and scalable.
  • Proactively identify and eliminate single points of failure before they become incidents.
  • Ensure our production systems remain stable, even under increasing scale and load.
You’ll own and optimize our cloud environments
  • Manage and continuously improve workloads across AWS, GCP, or Azure.
  • Use Infrastructure as Code (Terraform) to standardize and scale infrastructure.
  • Optimize resource usage to balance performance and cost.
You’ll run and improve Kubernetes in production
  • Operate and scale Kubernetes clusters (EKS, GKE, etc.) with confidence.
  • Troubleshoot issues quickly and ensure smooth deployments and upgrades.
  • Ensure our containerized workloads perform reliably at scale.
You’ll build strong observability and respond to incidents
  • Implement and refine monitoring systems using tools like Prometheus, Grafana, Datadog, or ELK.
  • Define alerting that is meaningful, not noisy.
  • Respond to incidents, lead root cause analysis, and ensure we learn from every failure.
You’ll automate everything that shouldn't be manual
  • Write scripts and build tooling to eliminate repetitive operational work.
  • Continuously improve infrastructure efficiency through automation.
  • Promote a culture where manual work is a temporary state, not the norm.
You’ll collaborate to improve the entire system
  • Work closely with Dev Ops and engineering teams to solve performance bottlenecks.
  • Contribute to CI/CD improvements and deployment reliability.
  • Help shape reliability best practices across the organization.
What success looks like (First 90 Days)
First 30 days
  • Built a strong understanding of our infrastructure, systems, and workflows.
  • Contributing to day‑to‑day operations with support from the team.
  • Started identifying areas for improvement in automation and reliability.
By 90 days
  • Independently managing infrastructure tasks and troubleshooting issues.
  • Actively contributing to reliability and scalability improvements.
  • Took ownership of parts of our infrastructure and are improving them.
Requirements Who You Are

This is what will make you successful in this role:

  • Spent :3 years working in SRE, Dev Ops, or infrastructure engineering, and seen what breaks at scale.
  • Comfortable working in cloud environments like AWS, GCP, or Azure—and understand how distributed systems behave.
  • Hands‑on with Kubernetes in production and know how to troubleshoot it when things go wrong.
  • Don’t just fix issues - ask why they happened and make sure they don’t happen again.
Technically, you likely
  • Use Terraform (or similar IaC tools) to manage infrastructure.
  • Work confidently with Docker and Kubernetes.
  • Write scripts in Python, Bash, or similar to automate workflows.
  • Understand CI/CD pipelines (Jenkins, Git Hub Actions, Bitbucket, etc.).
  • Have a solid grasp of networking, load balancing, and high‑availability design.
When it comes to monitoring
  • Implemented tools like Prometheus, Grafana, Datadog, or ELK.
  • Know the difference between useful alerts and noise.
  • Focus on signals that actually drive action.
What sets you apart
  • Take ownership—you don't wait to be told something is broken.
  • Calm under pressure and methodical during incidents.
  • Simplify complexity instead of adding to it.
  • Communicate clearly, even when explaining deeply technical issues.
  • Care about building systems that make other engineers more effective.
Nice To Have (but Not Required)
  • Experience with Rabbit

    MQ or Redis in production.
  • Familiarity with Ansible or AWX.
  • Exposure to multi‑cloud or hybrid environments.
  • Cloud certifications (AWS, GCP) or Linux certifications.
  • Background from ITI (Information Technology Institute).
What The Hiring Process Will Look Like
  • Screening Interview - Talent Acquisition.
  • Technical Interview - SRE Lead.
  • Technical Task.
  • Final Interview - SRE Lead & Cloud Dev Ops Director.
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary