Lead DevOps Engineer
Listed on 2026-05-09
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability, IT Project Manager
Gabb gives families a better choice for their kids’ first technology. Our product portfolio spans purpose-built hardware alongside digital services. Everything we build is designed to let families introduce technology in steps, giving kids connection without the harms of a fully open smartphone. We’re a fast-growing, mission-driven company headquartered in Lehi, Utah, with a team that genuinely loves what we do and why we do it.
OverviewGabb is seeking a hands-on Dev Ops leader to help scale and operate our infrastructure. This is a player/coach role where you’ll spend most of your time building, improving, and supporting our platform while leading a small team of engineers. We’re looking for someone who can quickly assess our environment and drive practical improvements.
You will own infrastructure reliability, CI/CD, observability, incident response, and cost management. This role is deeply hands-on and includes participating in on-call and actively debugging production issues. You’ll also help guide a small Dev Ops team and improve how we operate day-to-day. You’ll partner closely with engineering and product teams to support business needs, focusing on execution, reliability, and continuous improvement — not a purely strategic or management role.
WhatYou’ll Do
- INFRASTRUCTURE & RELIABILITY — Own infrastructure reliability and uptime across development, staging, and production environments
- Design and implement scalable, resilient systems on AWS that support continued customer and device growth
- Manage multi-regional or highly available infrastructure with clear SLAs, failover patterns, and capacity planning
- Identify and remediate security risks in infrastructure and deployment pipelines in partnership with engineering and security teams
- CI/CD & DEVELOPER PLATFORM — Build and maintain CI/CD systems and Git Ops workflows that enable product teams to deploy with speed and confidence
- Improve developer tooling, deployment pipelines, and environment management to reduce operational burden on product engineering teams
- Manage Kubernetes and containerized workloads; drive infrastructure-as-code practices using Terraform, Helm, and related tooling
- OBSERVABILITY & INCIDENT RESPONSE — Build and enhance observability practices—monitoring, logging, tracing, and alerting—that provide visibility into system health, performance, and risk
- Establish and lead incident response practices including on-call structures, root cause analysis, and post-incident improvement processes
- Provide leadership with regular reporting and dashboards on infrastructure health, performance trends, and cost
- SYSTEMS, PROCESSES & OPERATIONAL MATURITY — Define and execute the infrastructure and platform roadmap aligned with company growth and product needs
- Establish operational processes, runbooks, and documentation that increase reliability, consistency, and knowledge transfer across the team
- Drive infrastructure cost management — build visibility into cloud spend, identify waste, and align resource usage with business goals
- Support new product initiatives with scalable platform solutions, ensuring Dev Ops considerations are embedded early in architecture decisions
- TEAM LEADER SHIP — Lead, mentor, and grow a high-performing Dev Ops team — foster a culture of ownership, collaboration, and operational excellence
- Set team direction, define roles and responsibilities, and build clear accountability structures
- Collaborate with engineering, product, and executive leadership to translate business needs into technical initiatives and priorities
- 6+ years of experience in Dev Ops, SRE, or infrastructure engineering, with at least 2+ years leading teams
- Demonstrated track record of building or maturing a Dev Ops function — not just operating within an established one
- Experience operating and scaling systems to millions of users or connected devices
- Strong experience with AWS and cloud-based infrastructure at production scale
- Experience with Kubernetes and containerized workloads in production environments
- Expertise in infrastructure as code (Terraform, Helm) and CI/CD pipeline design (Git Ops workflows preferred)
- Strong…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).