Lead DevOps Engineer Job Lehi area,Utah USA,IT/Tech

Gabb gives families a better choice for their kids’ first technology. Our product portfolio spans purpose-built hardware alongside digital services. Everything we build is designed to let families introduce technology in steps, giving kids connection without the harms of a fully open smartphone. We’re a fast-growing, mission-driven company headquartered in Lehi, Utah, with a team that genuinely loves what we do and why we do it.

Overview

Gabb is seeking a hands-on Dev Ops leader to help scale and operate our infrastructure. This is a player/coach role where you’ll spend most of your time building, improving, and supporting our platform while leading a small team of engineers. We’re looking for someone who can quickly assess our environment and drive practical improvements.

You will own infrastructure reliability, CI/CD, observability, incident response, and cost management. This role is deeply hands-on and includes participating in on-call and actively debugging production issues. You’ll also help guide a small Dev Ops team and improve how we operate day-to-day. You’ll partner closely with engineering and product teams to support business needs, focusing on execution, reliability, and continuous improvement — not a purely strategic or management role.

What

You’ll Do

INFRASTRUCTURE & RELIABILITY — Own infrastructure reliability and uptime across development, staging, and production environments
Design and implement scalable, resilient systems on AWS that support continued customer and device growth
Manage multi-regional or highly available infrastructure with clear SLAs, failover patterns, and capacity planning
Identify and remediate security risks in infrastructure and deployment pipelines in partnership with engineering and security teams

CI/CD & DEVELOPER PLATFORM — Build and maintain CI/CD systems and Git Ops workflows that enable product teams to deploy with speed and confidence
Improve developer tooling, deployment pipelines, and environment management to reduce operational burden on product engineering teams
Manage Kubernetes and containerized workloads; drive infrastructure-as-code practices using Terraform, Helm, and related tooling

OBSERVABILITY & INCIDENT RESPONSE — Build and enhance observability practices—monitoring, logging, tracing, and alerting—that provide visibility into system health, performance, and risk
Establish and lead incident response practices including on-call structures, root cause analysis, and post-incident improvement processes
Provide leadership with regular reporting and dashboards on infrastructure health, performance trends, and cost

SYSTEMS, PROCESSES & OPERATIONAL MATURITY — Define and execute the infrastructure and platform roadmap aligned with company growth and product needs
Establish operational processes, runbooks, and documentation that increase reliability, consistency, and knowledge transfer across the team
Drive infrastructure cost management — build visibility into cloud spend, identify waste, and align resource usage with business goals
Support new product initiatives with scalable platform solutions, ensuring Dev Ops considerations are embedded early in architecture decisions

TEAM LEADER SHIP — Lead, mentor, and grow a high-performing Dev Ops team — foster a culture of ownership, collaboration, and operational excellence
Set team direction, define roles and responsibilities, and build clear accountability structures
Collaborate with engineering, product, and executive leadership to translate business needs into technical initiatives and priorities

What We’re Looking For EXPERIENCE & SKILLS

6+ years of experience in Dev Ops, SRE, or infrastructure engineering, with at least 2+ years leading teams
Demonstrated track record of building or maturing a Dev Ops function — not just operating within an established one
Experience operating and scaling systems to millions of users or connected devices
Strong experience with AWS and cloud-based infrastructure at production scale
Experience with Kubernetes and containerized workloads in production environments
Expertise in infrastructure as code (Terraform, Helm) and CI/CD pipeline design (Git Ops workflows preferred)
Strong…