Senior Site Reliability Engineer
Dundee, Dundee City Area, DD1, Scotland, UK
Listed on 2026-02-16
-
IT/Tech
Cloud Computing, Systems Engineer
About Airalo
Alo! Airalo is the world’s first eSIM store that helps people connect in over 200+ countries and regions across the globe. We are building the next digital service that revolutionizes the telecom industry. We are a travel-tech company and an equal-opportunity environment that values and executes diversity, inclusion, and equity. Our team is spread across 50+ countries and six continents. What glues us together is our commitment to changing the way you connect.
Check out more information about Airalo in our Public Handbook: (Use the "Apply for this Job" box below).-handbook
About youWe hope that you care deeply about the quality of your work, the intrinsic worth of tasks, and the success of your team. You are self-disciplined and do not require micromanagement in terms of your skillset and work ethic. You do your best to flourish as an individual every day while working hard to foster a collaborative team environment. You believe in the importance of being — and staying — authentic, honest, positive, and kind.
You are a good interlocutor with clear and concise communication. You are able to manage multiple projects, have an analytical mind, pay keen attention to detail, and love to get your hands dirty. You are cognizant, tolerant, and welcoming of vulnerabilities and cultural differences.
Position: Full-time / Employee
Location: Remote-first
Benefits: Health Insurance, work-from-anywhere stipend, annual wellness & learning credits, annual all-expenses-paid company retreat in a gorgeous destination & other benefits
On-CallParticipating in our on-call rotation is a core expectation of this role. It's essential for maintaining 24/7 service reliability across our global operations, ensuring our systems remain resilient and our customers experience uninterrupted service, regardless of time zone or geography.
- Paid Rotation:
We offer standby fees + overtime pay. - Delayed Start:
No on-call duties for your first 6 months. - Rest & Recovery:
Guaranteed rest periods and flexible hours following night incidents. - Shared Load:
Rotations are split (Weekdays vs. Weekends) to minimize fatigue.
Please refer to the On-Call Policy in the Airalo Handbook for full details: -policy
We are looking for an Senior Site Relability Engineer to join our growing engineering team.
We are a company that values SRE principles and practices. We believe in empowering our SREs to make data-driven decisions, automate operational tasks, and continuously improve the reliability of our systems. We foster a blameless culture where everyone is encouraged to learn from mistakes and share knowledge. If you are passionate about building and maintaining highly reliable systems, we would love to hear from you!
Whatyou'll do:
- Lead the design of scalable, fault-tolerant and self-healing systems in a multi-region AWS environment.
- Define and track Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to drive architectural decisions and error budget policies.
- Conduct blameless post-incident reviews to uncover systemic root causes and implement long-term preventive measures.
- Identify patterns of manual work and lead the development of internal tools/automation to permanently eliminate them.
- Develop and maintain automated runbooks and playbooks for common operational tasks and complex incident response.
- Shift from simple monitoring to deep observability, ensuring high cardinality data leads to proactive actionable insights.
- Proactively identify and mitigate operational risks through chaos engineering and architecture reviews.
- Work with software engineers to design systems for reliability, scalability, and maintainability from the early stages of the SDLC.
- Continuously evaluate and optimize system performance, capacity, and cost efficiency.
- Beyond just participating, you will refine the on-call experience to reduce alert fatigue, improve MTTR, and ensure sustainable rotation health.
- Bachelor’s degree in Computer Engineering or a similar discipline.
- 5+ years of experience as a Site Reliability Engineer or in a similar role.
- 3+ years of experience…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: