Site Reliability Engineer; Hybrid). LilyLifestyle
Listed on 2026-06-06
-
IT/Tech
SRE/Site Reliability, Cloud Computing, IT Support, Systems Engineer
Site Reliability Engineer (SRE)
London (Hybrid - 2-8 days per month in office)
£50,000 per annum
Clear progression to Mid-Level SRE within 18 months
We're working with a growing, engineering‑led organisation looking to hire a Site Reliability Engineer who enjoys solving real platform problems through automation—not just firefighting tickets.
The OpportunityThis role blends hands‑on engineering with platform ownership. You'll spend your time split between:
- Supporting developers with broken builds and deployments (40%)
- Designing and building automation, CI/CD pipelines, and Terraform infrastructure (60%)
You'll act as the automation backbone of the platform—reducing manual effort, improving reliability, and enabling engineering teams to move faster.
Key Responsibilities Developer Support & Troubleshooting (40%)- Debug failing builds, deployments, and CI/CD pipelines
- Provide Tier 2/3 support via Slack, tickets, and pairing sessions
- Take ownership of incidents, ensuring reliable and timely resolution
- Design, build, and optimise CI/CD pipelines (Git Hub Actions, Jenkins, Git Lab CI)
- Develop and maintain Terraform modules for infrastructure‑as‑code
- Build automation tools (CLI tools, scripts, Git Hub Apps, self‑service tooling)
- Own observability: dashboards, alerts, monitoring, and runbooks
- Continuously improve platform processes and reduce operational toil
- 2‑3 years in Dev Ops, SRE, or Platform Engineering
- Strong Linux troubleshooting and systems knowledge
- Proven experience with Terraform (module design, not just usage)
- CI/CD experience (Git Hub Actions, Git Lab CI, Jenkins)
- Ability to write production‑quality code in Python or Bash
- Solid networking fundamentals (DNS, load balancers, CDNs)
- Experience with observability tools (New Relic, Datadog, Prometheus, Grafana)
- Comfortable participating in on‑call rotations
- Experience using AI tools (e.g. ChatGPT, Copilot, Cursor) to enhance productivity
- Go, Ansible, or configuration management experience
- Experience working with multiple CDNs (Cloud Front, Fastly, Cloudflare)
- You're a proactive problem‑solver who automates rather than repeats
- You communicate clearly with both technical and non‑technical stakeholders
- You stay calm under pressure and take ownership during incidents
- You care about clean, maintainable, production‑quality code
- You actively use AI tools to improve how you build and debug systems
- £50,000 salary
- Genuine ownership of CI/CD and platform automation
- Direct collaboration with the Head of Technology
- Clear progression to mid‑level SRE within 18 months
- Learning budget and dedicated development time
This is not a ticket‑driven support role. You'll be a key technical contributor shaping how the platform operates—working alongside engineers who code and influencing real infrastructure and tooling decisions.
#J-18808-LjbffrTo Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: