Senior Site Reliability Engineer Job San Francisco area,California USA,IT/Tech

We’re building

Anuma.ai at Zeta Chain: an ambitious privacy AI product designed to solve hard problems and deliver real value. Backed by top-tier investors, our team is pushing the boundaries of what AI can do in the real world. If you’re excited by meaningful challenges and building products from the ground up, this is the place for you. About the Role

We are looking for a
Senior Site Reliability Engineer (SRE) to ensure the reliability, scalability, and security of Anuma.ai's production infrastructure.

This role is highly hands‑on and execution‑focused. You will operate critical blockchain and AI‑adjacent infrastructure, build automation to reduce operational overhead, and partner closely with protocol, platform, and AI teams to design systems that are reliable by default.

What You’ll Do

Operate and maintain
production blockchain infrastructure
, including validators, RPC services, indexers, and supporting services
Ensure high availability and performance for
AI‑enabled developer platforms and internal tooling
Build and maintain
monitoring, alerting, and dashboards
for protocol, infrastructure, and application health
Write high‑quality
automation and infrastructure code
to reduce toil and improve reliability
Participate in
on‑call rotations
, incident response, and post‑incident reviews
Partner with engineering teams to embed
reliability, scalability, and security best practices
into system design
Improve Kubernetes reliability across
cloud and bare‑metal environments
Continuously refine deployment, rollback, and recovery strategies

Minimum Qualifications

Our ideal candidate description is a wish list, not a checklist. We don’t expect every applicant to meet every requirement.

4+ years of experience in
Site Reliability Engineering, Infrastructure Engineering, or Platform Engineering
Strong software engineering background with production experience in
Go and/or Python
Proven experience running
Kubernetes at scale
Experience supporting
high‑availability distributed systems
Comfortable working in
fast‑moving startup environments
Strong
security mindset
, especially for infrastructure running on public or adversarial networks
Excellent collaboration skills
Languages:Go, Python, Bash, Terraform, Ansible
Platforms:AWS, GCP, bare metal
Blockchain Stack:Cosmos SDK, Tender mint / Comet

BFT, Ethereum, Bitcoin

Bonus Points

Exposure to
AI‑powered infrastructure, observability, or developer tooling
Experience operating
blockchain nodes or validator infrastructure
Familiarity with
Cosmos‑based chains
or EVM clients
Experience with
Dev Ops, Dev Sec Ops , or Git Ops
methodologies
Contributions to
open‑source software

In-Office Culture

We believe that collaboration is supercharged when we share space together. Many members of our team work
hybrid from our San Francisco office
, and we aim for3 in-office days per week
. We know life happens, whether it’s travel, appointments, or family needs and we’re flexible when the schedule needs to shift. But generally, we value showing up, building together, and keeping the energy high. The company is a mix of remote and local team members.

Compensation

Base Salary:$140,000 – $190,000
This range reflects base salaries for roles in the San Francisco market. For candidates in other locations, compensation is adjusted to remain competitive within their local market.

In addition to the base salary, all full-time team members receive an additional 10% to 25% in liquid benefits with upside based on role, experience, and impact. We believe in building together and sharing in the long-term success of the network. Compensation packages are designed to be competitive and aligned with the growth of both the team and the ecosystem.

#J-18808-Ljbffr


Increase/decrease your Search Radius (miles)



Job Posting Language