Site Reliability Engineer
Listed on 2026-03-11
-
IT/Tech
Cloud Computing, Systems Engineer
Sapio Sciences is on a mission to accelerate scientific drug discovery and high-throughput clinical and diagnostic services for our clients and partners. The Sapio team consists of expert, highly collaborative scientists, software developers, and professionals passionate about delivering a best-in-class lab informatics platform and industry-specific solutions.
Sapio is one of the few software providers to offer a truly unified and highly configurable lab informatics platform and a broad suite of purpose-built solutions. The Sapio platform makes it easy for scientists, laboratory professionals, and bioinformatics professionals to streamline and manage their end-to-end laboratory operations, from instrument data integration to workflow and experiment setup to sample and material management, data management, and scientific data analysis and reporting.
Workingat Sapio
At Sapio, we’re not just building lab informatics solutions. We’re creating tools for scientists who are motivated to make the world a better place. We understand that lab informatics is about more than managing data or connecting workflows. It’s about making life easier for scientists and accelerating scientific progress for everyone. Our platform delivers the levels of configurability, usability, and insight that scientists have only ever dreamed of.
We’re a team of scientists, developers, and innovators who question convention and stay focused on what matters most: advancing drug discovery science. We challenge the status quo and respond to the needs at the heart of science with powerful solutions that are simple to use, effortless to evolve and downright easy to love. As part of the Sapio team, you’ll work in a collaborative and forward-thinking environment where your ideas are valued, your growth matters, and your work makes a difference.
We’re proud to partner with leading labs around the world, from ambitious start-ups to global organisations, who trust Sapio to support discovery, development and diagnostics with industry-first science-aware solutions.
The PositionWe are seeking a Site Reliability Engineer (SRE) to maintain, improve, and scale the reliability, performance, and availability of Sapio’s SaaS platform. This role combines cloud infrastructure operations with automation engineering to ensure stable, secure, and scalable environments across customer and internal systems.
You will partner closely with Product, Support, and Dev Ops teams to embed reliability best practices into system architecture, deployment processes, and operational workflows. This role contributes to service resilience, observability maturity, and continuous improvement initiatives that directly support customer SLAs and platform growth.
- Operate and enhance our AWS-based environment (EC2, ECS, Terraform, custom tooling)
- Action customer-driven requests such as deployments and upgrades
- Troubleshoot and resolve outages, incidents, and performance issues
- Continuously improve monitoring, alerting, and observability for early detection and reduced noise
- Collaborate with engineering teams to design, build, and improve infrastructure solutions
- Participate in the on‑call rotation to maintain high availability
- Contribute to the design and delivery of new infrastructure features
- Drive automation, reliability, and efficiency improvements across existing systems.
- Degree or equivalent in Computer Science, Engineering, Data Science, Life Sciences, or related field, or equivalent practical experience.
- Proven experience working as an SRE, Dev Ops, or Cloud Infrastructure Engineer in production Linux/cloud environments.
- Background in B2B SaaS businesses (scaling multi‑tenant systems, customer‑driven SLAs).
- Scripting/coding with Python or Ruby.
- Experience running and supporting Java-based applications.
- Strong track record of operational excellence and infrastructure improvement.
- Linux systems administration (deep troubleshooting and performance tuning)
- AWS (EC2, ECS, networking, IAM, security best practices)
- Infrastructure‑as‑code with Terraform
- Configuration management (Chef or equivalent)
- Monitoring & observability (Datadog or…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: