Senior Site Reliability Engineer
Listed on 2026-02-16
-
IT/Tech
Systems Engineer, SRE/Site Reliability
We’re building
Anuma.ai at Zeta Chain: an ambitious privacy AI product designed to solve hard problems and deliver real value. Backed by top-tier investors, our team is pushing the boundaries of what AI can do in the real world. If you’re excited by meaningful challenges and building products from the ground up, this is the place for you. About the Role
We are looking for a
Senior Site Reliability Engineer (SRE) to ensure the reliability, scalability, and security of Anuma.ai's production infrastructure.
This role is highly hands‑on and execution‑focused. You will operate critical blockchain and AI‑adjacent infrastructure, build automation to reduce operational overhead, and partner closely with protocol, platform, and AI teams to design systems that are reliable by default.
What You’ll Do- Operate and maintain
production blockchain infrastructure
, including validators, RPC services, indexers, and supporting services - Ensure high availability and performance for
AI‑enabled developer platforms and internal tooling - Build and maintain
monitoring, alerting, and dashboards
for protocol, infrastructure, and application health - Write high‑quality
automation and infrastructure code
to reduce toil and improve reliability - Participate in
on‑call rotations
, incident response, and post‑incident reviews - Partner with engineering teams to embed
reliability, scalability, and security best practices
into system design - Improve Kubernetes reliability across
cloud and bare‑metal environments - Continuously refine deployment, rollback, and recovery strategies
Our ideal candidate description is a wish list, not a checklist. We don’t expect every applicant to meet every requirement.
- 4+ years of experience in
Site Reliability Engineering, Infrastructure Engineering, or Platform Engineering - Strong software engineering background with production experience in
Go and/or Python - Proven experience running
Kubernetes at scale - Experience supporting
high‑availability distributed systems - Comfortable working in
fast‑moving startup environments - Strong
security mindset
, especially for infrastructure running on public or adversarial networks - Excellent collaboration skills
- Languages:Go, Python, Bash, Terraform, Ansible
- Platforms:AWS, GCP, bare metal
- Blockchain Stack:Cosmos SDK, Tender mint / Comet
BFT, Ethereum, Bitcoin
- Exposure to
AI‑powered infrastructure, observability, or developer tooling - Experience operating
blockchain nodes or validator infrastructure - Familiarity with
Cosmos‑based chains
or EVM clients - Experience with
Dev Ops, Dev Sec Ops , or Git Ops
methodologies - Contributions to
open‑source software
We believe that collaboration is supercharged when we share space together. Many members of our team work
hybrid from our San Francisco office
, and we aim for3 in-office days per week
. We know life happens, whether it’s travel, appointments, or family needs and we’re flexible when the schedule needs to shift. But generally, we value showing up, building together, and keeping the energy high. The company is a mix of remote and local team members.
Base Salary:$140,000 – $190,000
This range reflects base salaries for roles in the San Francisco market. For candidates in other locations, compensation is adjusted to remain competitive within their local market.
In addition to the base salary, all full-time team members receive an additional 10% to 25% in liquid benefits with upside based on role, experience, and impact. We believe in building together and sharing in the long-term success of the network. Compensation packages are designed to be competitive and aligned with the growth of both the team and the ecosystem.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).