×
Register Here to Apply for Jobs or Post Jobs. X

Senior Platform Engineer

Job in Greater London, London, Greater London, W1B, England, UK
Listing for: Attio Ltd
Full Time position
Listed on 2026-06-12
Job specializations:
  • IT/Tech
    SRE/Site Reliability, Cloud Computing: Infrastructure & Operations, Systems Engineer
Salary/Wage Range or Industry Benchmark: 95000 - 125000 GBP Yearly GBP 95000.00 125000.00 YEAR
Job Description & How to Apply Below
Location: Greater London

About the Role

We are seeking highly skilled and experienced Platform Product Engineers to join our Security, Infrastructure and Performance team. This is a crucial, dual‑faced role that combines high‑level engineering strategy with hands‑on operational excellence. The successful candidates will be responsible for building, operating, and continuously enhancing the internal technology platform, fundamentally treating this platform as a product with all development teams as its primary customers.

What

you'll do

The core responsibility is to implement, maintain, and continuously improve the foundational platform infrastructure that powers all engineering services. This necessitates a relentless focus on ensuring high reliability, exceptional scalability, and optimal performance across the entire stack.

Platform Infrastructure: Build and maintain platform infrastructure using declarative IaC tools (e.g., Terraform, Pulumi), ensuring all environments are reproducible, version‑controlled, and auditable. Proactively manage the capacity of the infrastructure to consistently meet or exceed Service Level Objectives for latency, error rates, and availability.

Incident Response and Post‑Mortems: Act as first‑line responders for critical system incidents. Triage, diagnose, and resolve complex production issues rapidly. Drive a culture of blameless post‑mortems, ensuring root causes are identified and long‑term preventative measures are implemented as code.

Tooling & Automation: Own the stack of supporting tools necessary for operational excellence and developer enablement, including:

  • Continuous Integration and Continuous Delivery Pipelines: Implement, maintain, and evolve fully automated CI and CD pipelines. Establish best practices for fast, reliable, and secure build, test, and deployment processes.
  • Observability: Implement and manage robust systems for monitoring (metrics), logging (centralised log aggregation), and distributed tracing to provide deep insights into application and infrastructure health.
What you'll bring

Applied Dev Ops and SRE Principles:

  • Demonstrable, hands‑on experience applying core Dev Ops and Site Reliability Engineering (SRE) principles to manage, monitor, and scale production systems.
  • Deep understanding of the SRE mindset, including SLO/SLA creation and monitoring, error budget management, toil reduction, and post‑incident review (blameless postmortems).
  • Proven ability to drive cultural and process change that fosters collaboration between development and operations teams.

Cloud Infrastructure and Containerisation Expertise:

  • Expertise in one or more major public cloud providers (AWS, GCP, or Azure), encompassing network configuration, security best practices (IAM, security groups, etc.), compute services (EC2, GKE, ECS, etc.), and managed services (databases, queues, serverless functions).
  • In‑depth knowledge of container technologies, specifically Docker, and extensive experience orchestrating them at scale using Kubernetes (K8s). Includes designing, deploying, and managing Kubernetes clusters, understanding networking (CNI), storage (CSI), and security configurations within the Kubernetes ecosystem.

Automation and Programming

Skills:

  • Proficiency in one or more modern software languages (e.g., Typescript, Go, Python, Rust) and associated frameworks used for building high‑performance, resilient production systems.
  • Proven experience developing robust, maintainable, and well‑tested automation scripts, services and pipelines to manage infrastructure, deployments, and operational tasks.

Operational Tooling and Observability Management:

  • Experience owning, managing, and maintaining mission‑critical operational tooling.
  • Desirable:
    Proven background in implementing and managing centralised logging solutions or similar platforms (e.g., Splunk, Data Dog).
  • Desirable:
    Familiarity with distributed tracing tools (e.g., Jaeger, Zipkin) and Application Performance Monitoring (APM) solutions.
What we offer
  • Competitive salary of £95,000 to £125,000
  • Equity in an early‑stage tech company on an incredible trajectory
  • Private medical insurance through AXA
  • Pension contribution through Hargreaves Lansdown
  • Enhanced family leave
  • Team off‑site in fun places! (We've been to Barcelona, Lisbon, Malta, and Split so far)
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary