Sr. DevOps Manager
Listed on 2026-02-16
-
IT/Tech
SRE/Site Reliability, IT Project Manager, Systems Engineer, Cloud Computing
* This is a 100% remote role for someone based in either Boston or San Francisco*
Zeitview is the leading intelligent aerial imaging company for high-value infrastructure, providing businesses with actionable, real-time insights to recover revenue, reduce risk and improve build quality. We serve customers in the solar, wind, insurance, construction, real estate, and critical infrastructure industries. Trusted by the largest enterprises in the world, Zeitview is active in over 70 countries. Our mission is to accelerate the global transition to renewable energy and sustainable infrastructure through advanced inspection solutions.
Take a look at our latest achievements here!
We are seeking a Senior Dev Ops Engineering Manager who combines strong people leadership with deep hands‑on technical expertise. This role is intentionally split: approximately 50% people and delivery leadership, and 50% hands‑on Dev Ops engineering. You will lead and grow a Dev Ops/SRE organization while actively designing, building, and operating the infrastructure, platforms, and tooling that power our engineering teams.
This is a critical role for someone who enjoys staying close to the technology while scaling teams, systems, and operational excellence.
- Lead, mentor, and grow a team of Dev Ops and SRE engineers, including performance management, coaching, career development, and hiring.
- Establish clear goals, expectations, and success metrics for the team; own and track team OKRs.
- Foster a culture of operational excellence, ownership, reliability, and continuous improvement.
- Partner closely with Engineering, Platform, Security, IT, and Product leaders to align infrastructure strategy with business and product goals.
- Perform capacity planning, workload prioritization, and on‑call rotations to ensure sustainable team operations.
- Act as an escalation point for major incidents and drive actionable postmortems and systemic improvements.
- Actively design, build, and maintain cloud infrastructure (e.g., AWS/GCP/Azure) using Infrastructure as Code (Terraform, Cloud Formation, Pulumi, etc.).
- Architect and operate CI/CD pipelines to enable fast, safe, and reliable software delivery.
- Lead hands‑on efforts in reliability, scalability, performance, security, and cost optimization.
- Drive improvements in observability (logging, metrics, tracing), alerting, and incident response.
- Work directly in production systems, including participating in on‑call rotations when necessary.
- Partner with application and platform teams to improve developer experience, deployment patterns, and operational readiness.
- Evaluate, introduce, and standardize Dev Ops tools and practices across the organization.
- Cloud infrastructure and networking
- Kubernetes and container orchestration
- CI/CD systems and release automation
- Monitoring, alerting, and observability platforms
- Security best practices (secrets management, access controls, compliance)
- Reliability engineering and incident management
- Cost management and cloud efficiency
- 10+ years of experience in software, Dev Ops, or infrastructure engineering.
- 4+ years of experience managing and growing engineering teams.
- Experience writing IaC 3+ years
- Well versed in the AWS services and resources
- Strong hands‑on experience with modern cloud platforms and Dev Ops tooling.
- Proven ability to balance strategic leadership with deep technical execution.
- Experience supporting high‑availability, production‑grade systems.
- Strong communication skills and ability to influence across teams and leadership levels.
- Experience leading Dev Ops/SRE teams in a scaling or high‑growth environment.
- Experience with the following technologies
- Terraform (Terragrunt)
- Gitops (ArgoCD)
- Atlantis
- Kubernetes
- Docker
- Grafana
- Prometheus
- PostgreSQL
- Elastic Search
- Background working with global or distributed teams.
- Prior experience owning production on‑call and incident response at scale.
- ML/Computer Vision experience
- A high‑performing Dev Ops team with clear ownership, strong morale, and measurable impact.
- Reliable,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).