Senior DevOps Engineer
Listed on 2025-12-02
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability, IT Support
At Hiro, we build developer tools that bring Web3 to Bitcoin. Our suite of tools unlocks the full potential of Bitcoin through smart contracts, digital assets, and decentralized applications. With Hiro tools, developers can test and deploy smart contracts, spin up nodes and other server-side resources for scaling, and more. Building on Bitcoin is hard. Hiro's developer tools make it easier.
We’re very proud to say that Hiro has consistently been recognized on Built In’s Best Places to Work list, including 2025’s Best Remote Startups to Work For, Best Startups to Work For in New York City, 50 Best U.S. Startup Companies to Work For, and 100 Best Remote Places to Work.
Hiro is funded and backed by more than $75 million from Union Square Ventures, Y Combinator, Lux Capital, Winklevoss Capital, Naval Ravikant, and others.
About The OpportunityHiro is seeking a Dev Ops Engineer to help scale, secure, and optimize the infrastructure powering the Stacks ecosystem. You’ll be responsible for Kubernetes-based systems that underpin the Stacks Public Testnet, API Gateway, Chainhook-2.0, and Platform services, ensuring that our developer community and partners can build on reliable and observable infrastructure. Dev Ops engineers at Hiro are hybrid systems and software engineers with a two-pronged mission:
- Architect and operate Kubernetes clusters across production, staging, and testnet environments.
- Build and manage API Gateway infrastructure (Kong , Istio) for high-scale external APIs.
- Deploy and monitor Chainhook-2.0 within Platform, ensuring resilient predicate registration and event streaming.
- Own Stacks Public Testnet reliability
, including alerting, metrics, and incident response. - Implement IaC (Terraform) for all infrastructure resources.
- Improve CI/CD pipelines for internal services deployed into private Kubernetes clusters.
- Enhance observability (Grafana, Prometheus, Loki) with SLO-based alerts and distributed tracing.
- Partner with blockchain and platform teams to ensure smooth rollouts and incident preparedness.
- Improve the tooling and automation for building, testing, and deploying software and services
- Collaborate with engineering teams on building / launching new products and features
- Evangelize and implement industry best practices to improve the security and ease-of-use of our production environment
- Diagnose problems from all sides and quickly narrow down potential solutions
- Debug production issues across services and multiple levels of the stack
- Improve operational standards, tooling, and processes
- Engineer solutions to automate, and streamline monitoring and incident escalation, improve resiliency and uptime
- 8+ years in Dev Ops/SRE roles operating high-availability distributed systems.
- Deep Kubernetes expertise (Stateful Sets, Operators, Service Meshes, scaling strategies).
- Strong background in API infrastructure
: load balancing, API Gateways (Kong, Nginx), rate limiting, caching. - Proficiency with container technologies like Docker or Containerd
- Skilled in IaC (Terraform, Pulumi) and CI/CD (Git Hub Actions, ArgoCD).
- Experience with observability stacks
:
Prometheus, Grafana, Loki, Jaeger/OTEL. - Strong networking knowledge (L4/L7 load balancers, TLS termination, DNS, ingress controllers).
- Proficiency with scripting languages (Bash, Python, Go, or NodeJS).
- Cloud experience (AWS/GCP/Azure), ideally multi-cloud.
- Excellent communication skills and comfort working with a diverse team across time zones
- Great ownership, humility, and bias to action
- Able to see a problem from all sides and quickly narrow down potential solutions
- Experience deploying, scaling, and troubleshooting production services
- Experience running blockchain nodes in production.
- Familiarity with high volume APIs ,
event-driven systems
. - Security hardening of Kubernetes (RBAC, PSP/OPA, auditing, secrets management).
- Performance tuning of Postgres/Redis for high-scale API workloads.
- Experience…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).