Engineering Manager, MLOps, Marketplace, Ecommerce Users | Remote H
London, Greater London, EC1A, England, UK
Listed on 2025-12-21
-
IT/Tech
Cloud Computing, AI Engineer, Data Science Manager, Machine Learning/ ML Engineer
Engineering Manager, MLOps – Marketplace, Ecommerce
Our client is a well‑known digital marketplace focused on sustainable e‑commerce with over 35 million active users worldwide. They are redefining how people buy and sell second‑hand fashion, aiming to make the future of style circular and accessible.
We are looking for an experienced MLOps Engineering Manager to lead and scale the MLOps function. You will shape how machine learning is developed, deployed, and scaled across the organisation.
As a high‑impact role you will lead a talented team of 6‑8 engineers, set strategic direction for ML infrastructure, and ensure the business delivers reliable, scalable, and high‑performing ML systems that drive real‑world impact.
Key Responsibilities- Manage and develop a team of 8 MLOps engineers, fostering collaboration, high performance, and personal growth.
- Define and deliver the MLOps roadmap, aligning closely with the wider engineering and data strategy.
- Provide guidance on architecture, tooling, and best practices for ML pipelines, deployment, monitoring, and incident management.
- Partner with data science, ML, and product teams to ensure infrastructure supports innovation and business needs.
- Oversee system reliability, cost optimisation, and vendor relationships to keep infrastructure scalable and efficient.
- Take ownership of critical ML/infra incidents, ensuring swift resolution and continuous learning.
- Deliver clear progress, risk, and priority updates to leadership in a concise and actionable way.
- Proven experience leading an MLOps, ML Engineering, or Platform Engineering team.
- Solid background in applied machine learning and a passion for platform disciplines.
- Hands‑on experience with cloud platforms (AWS, GCP, or Azure), including large‑scale ML infrastructure management.
- Knowledge of GPU computing for model training and serving.
- Experience managing containerised workloads (Docker, Kubernetes, Kubeflow, etc.) and integrating with CI/CD tools (Jenkins, Git Hub Actions, Git Lab CI).
- Familiarity with distributed computing frameworks (Spark, Ray, Tensor Flow Distributed, PyTorch Distributed).
- Strong understanding of monitoring, logging, and observability for large‑scale ML systems.
- Experience in cost optimisation for compute/GPU workloads.
- Excellent people leadership and communication skills, able to influence technical and non‑technical stakeholders.
- Comfortable working in a fast‑paced, collaborative environment with strategic and operational responsibilities.
- Experience with vendor management and contract oversight.
- Familiarity with tools such as Databricks, Tecton (or Feast), Seldon, or Sage Maker.
- Private health and mental wellbeing coverage, including counselling and coaching.
- Salary up to £140,000 + bonus and benefits.
- 25 days annual leave plus additional company‑wide rest days and volunteer leave.
- Flexible hybrid working; option to work abroad for limited periods.
- Generous parental, IVF, and carer leave policies.
- Learning and development budgets for conferences, mentorship, and skills growth.
- Pension matching, life insurance, and recognition for service milestones.
If you are interested, to Owen Thomas and we will review your application.
#J-18808-LjbffrTo Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: