×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer

Job in Llanelli, Carmarthenshire, SA15, Wales, UK
Listing for: Fanvue LLC
Full Time position
Listed on 2026-02-16
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer
Job Description & How to Apply Below

Join us in redefining the creator economy with AI

Fanvue is the fastest-growing creator monetisation platform in the creator economy
. We are the leading AI-powered creator-first platform, designed to empower creators worldwide to directly monetise their audience. We’re on a mission to redefine the creator economy by empowering creators to connect, share, and earn more efficiently.

🎯
The Role
We are hiring a Site Reliability Engineer (SRE) to elevate the reliability, scalability, and performance of the core platform that powers Fanvue. You will be the technical specialist who ensures our infrastructure is predictable, resilient, and capable of supporting rapid product development across multiple teams.

This role sits at the heart of the platform: improving the health of our Aurora Postgre

SQL estate, developing robust AWS infrastructure, enabling engineering teams with deep technical expertise, and driving the reliability culture required to support a fast-scaling product.

🚀
What You’ll Do

  • Own and optimise Aurora Postgre

    SQL (Serverless

    V2) clusters that power Fanvue’s core systems, ensuring performance, availability, and scalability
  • Oversee the reliability of AWS-managed data infrastructure across Aurora, Elasti Cache Redis, Dynamo

    DB, and RDS
  • Develop and maintain Infrastructure as Code using AWS CDK (Type Script), establishing automated, reusable patterns and best practices
  • Reduce operational toil through automation and build self-service tooling that empowers engineering teams
  • Implement and maintain robust monitoring, observability, and alerting using AWS Cloud Watch
  • Ensure CI/CD pipelines are reliable, safe, and performant, enabling frequent and high-confidence deployments
  • Act as the escalation point for complex infrastructure and database issues, supporting teams when deep expertise is required
  • Lead incident response, run post-mortems, and deliver actionable improvements to avoid repeat failures
  • Partner closely with stream teams to understand their infrastructure needs and provide technical guidance without slowing their velocity
  • Mentor engineers across the Platform team, raising reliability standards and improving operational maturity

👀
Who You Are
A highly experienced reliability engineer with deep hands‑on expertise in AWS‑managed database systems, distributed systems, and infrastructure automation. You bring:

  • Extensive experience operating, scaling, and tuning Aurora PostgreSQL (preferably Serverless

    V2)
  • Strong proficiency across AWS database services:
    Aurora Postgre

    SQL, Elasti Cache Redis, Dynamo

    DB, and RDS
  • Expertise with Infrastructure as Code
    , especially AWS CDK (Type Script)
  • Proven ability to identify, measure, and eliminate toil through automation
  • Experience applying SRE principles: SLIs, SLOs, error budgets, gradual rollouts, and reliability‑focused system design
  • Strong architectural thinking, with the ability to design fault‑tolerant, scalable infrastructure
  • Deep expertise with monitoring, observability, and performance tuning using AWS Cloud Watch
  • Excellent communication skills and the ability to guide teams without creating bottlenecks
  • A high‑ownership mindset aligned with Amazon Leadership Principles:
    Ownership, Dive Deep, Think Big, Deliver Results

Nice‑to‑haves

  • Experience supporting ECS Fargate workloads or containerised environments
  • Background in building internal platform tools or developer enablement systems
  • Familiarity with microservice vs centralised architecture trade‑offs


You’ll Thrive Here If

  • You enjoy being the deep technical expert teams rely on
  • You love optimising systems for performance and reliability
  • You are motivated by solving hard technical problems and making infrastructure invisible, stable, and scalable
  • You take pride in raising engineering standards and creating leverage for others

⚠️
You’ll Struggle Here If

  • You prefer reactive operations over proactive engineering
  • You are uncomfortable owning large technical surfaces with autonomy
  • You avoid hands‑on investigation, deep dives, or operational responsibility

🌍
Why Join Fanvue?

  • Own and strengthen the most mission‑critical systems at one of the fastest‑growing creator platforms
  • Competitive salary, equity, and benefits package
  • A culture that values innovation, ownership, transparency, and speed
  • Unlimited holiday
  • Remote working
  • Flexible hours to support how you perform best
  • Budget for growth and wellbeing


Fanvue is for Everyone
We know that diverse teams build better products. Even if you do not meet every single requirement, we encourage you to apply. Many great people grow into parts of a role, and we value potential just as much as experience.

#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary