Senior Platform Engineer
Listed on 2026-02-13
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability
Location
- Remote, but requirement to meet up once a month for in‑person collaboration day
- UK based
Akixi is a fast‑growing SaaS analytics company delivering real‑time and historical insights for cloud‑based communication platforms, including Microsoft Teams, Cisco Broad Works, and Webex Calling. Our analytics help businesses enhance customer experience, optimise productivity, and drive performance across voice and collaboration channels.
Job SummaryWe are seeking a Senior Platform Engineer with a deep understanding of AWS cloud infrastructure, Infrastructure‑as‑Code (IaC) tooling such as Terraform, and configuration management using Ansible. The ideal candidate will be a self‑starter, passionate about Site Reliability Engineering (SRE) principles, and thrive in collaborative environments.
You will play a pivotal role in automating infrastructure, improving reliability and scalability, and ensuring smooth CI/CD pipelines across multiple environments. You'll work closely with software engineering, and security teams to drive platform excellence.
Key Responsibilities- Design, build, and manage scalable, secure, and resilient infrastructure on AWS using Terraform (modularised, reusable components).
- Implement configuration management solutions using Ansible, including playbook development, inventory structuring, and role‑based automation.
- Manage secrets securely using services such as AWS Secrets Manager or Hashi Corp Vault.
- Implement robust monitoring, alerting, and observability tooling (e.g., Cloud Watch, Prometheus, Grafana, Datadog).
- Participate in incident response, root cause analysis, and resilience improvements.
- Maintain and evolve CI/CD pipelines using tools such as Git Hub Actions, Bitbucket Pipelines, or Jenkins.
- Automate deployments for container‑based workloads on ECS (Fargate), or Lambda, and manage supporting infrastructure.
- Collaborate with development teams to optimise build/deploy cycles and reduce lead time for changes.
- Ensure security best practices are embedded into infrastructure provisioning and pipeline execution.
- Support compliance and auditing by implementing guardrails and controls as code (e.g., AWS Config, SCPs, IAM policy management).
- Some OOH work is required to maintain the production systems and also will be part of OOH critical ticket rota.
- 5+ years in Dev Ops, SRE, or Cloud Engineering roles.
- Expertise in AWS core services: EC2, IAM, VPC, ECS/Fargate, Cloud Formation, Cloud Watch, RDS, Dynamo
DB, S3, Lambda. - Strong proficiency in Terraform (HCL) – including work spaces, modules, and Terraform Cloud or similar.
- Ansible experience – developing roles, dynamic inventories, managing remote configurations.
- Strong scripting knowledge (Bash, Python, or Go).
- Experience with container orchestration and deployment (Docker, ECS, or Kubernetes).
- Proficient with Git Ops or IaC‑based workflows.
- Familiarity with Google SRE practices, particularly around reliability, observability, and operational excellence.
- Understanding of systems reliability metrics and associated tooling
- Self‑driven with a bias toward action and ownership.
- Excellent communicator, able to collaborate across disciplines and levels of technical understanding.
- Experience working as part of a cross‑functional team.
- Comfortable working in agile environments (Scrum/Kanban).
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: