DevOps Engineer
Listed on 2026-05-31
-
IT/Tech
Cloud Computing
About Vumedi:
Vumedi is the largest video education platform for doctors worldwide, dedicated to advancing medical education through innovative video-based learning. Our mission is to empower healthcare professionals by providing them with access to the latest clinical knowledge and surgical techniques from experts around the globe. We curate a vast library of high-quality educational content, enabling users to enhance their skills, stay informed about industry trends, and improve patient outcomes.
We are headquartered in Oakland, CA
, and have additional offices in Minneapolis, MN, and Zagreb, Croatia.
- Build technology that matters in a fast-scaling Silicon Valley digital healthcare company
:
Your work directly impacts how doctors across the world learn and make decisions that save lives. - Grow as we grow: Be part of a company in an accelerated growth phase, where expanding teams, products, and markets create real opportunities for ownership, leadership, and career progression.
- Build with AI
:
Work on applied LLM systems – from intelligent search to AI-driven content agents – and shape how AI transforms medical knowledge delivery. - Own your craft end-to-end
:
Take full responsibility for building systems that scale globally and power mission‑critical workflows. - Collaborate globally: Join a world‑class team of passionate engineers on modern tech stack which will further drive your career development.
- Have real product impact
:
Influence the direction of product development by collaborating closely with product and leadership teams.
We are looking for a Dev Ops Engineer to join our engineering team and take ownership of our infrastructure, deployment processes, and overall platform reliability. You will work closely with backend and data teams to support a growing video and data platform used by millions of healthcare professionals worldwide.
In this role, you will focus on improving our CI/CD pipelines, system reliability, and developer experience, while helping scale our cloud infrastructure in a secure and cost‑efficient way. You will work extensively with AWS services (compute, storage, networking, IAM, monitoring) and help ensure our systems are reliable, observable, and well‑architected.
You’ll also support and enable emerging AI/ML and LLM‑powered systems used for large‑scale medical content processing, helping build and operate the infrastructure required for these workloads. This includes improving data pipelines, optimizing resource usage, and ensuring production‑grade reliability of AI‑driven services.
This is a high‑impact role with a broad scope—from supporting production systems and data pipelines to driving long‑term improvements in how we build, deploy, and operate our platform, with strong ownership and autonomy in shaping Dev Ops practices.
What you will do:- Own and improve our infrastructure, CI/CD pipelines, and deployment processes across multiple environments
- Work with AWS services (compute, storage, networking, IAM, monitoring) to ensure scalable, secure, and reliable systems
- Collaborate closely with backend and data teams to support production systems, data pipelines, and overall platform reliability
- Continuously improve developer experience by streamlining workflows, reducing friction, and enabling faster, safer deployments
- Contribute to improving security practices, access control, and compliance of our infrastructure
- Automate infrastructure and workflows using Python
- Improve observability by implementing and maintaining monitoring, logging, and alerting systems
- Troubleshoot production issues, participate in incident response, and implement long‑term fixes to improve system stability
- Identify and drive improvements in performance, scalability, and cost efficiency across the platform
- Support and scale AI/ML and LLM‑based systems, ensuring reliable infrastructure for data processing and content classification workloads
- You have 5+ years of experience in Dev Ops, SRE, or infrastructure engineering, with a strong focus on cloud‑native environments (preferably AWS)
- You have managed cloud infrastructure (networking, IAM, compute, storage) with a…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).