Lead Site Reliability Engineer
New York, New York County, New York, 10261, USA
Listed on 2026-05-21
-
Software Development
About the team
Alloy’s Infrastructure Team is a small team (6 engineers) responsible for a large and growing infrastructure footprint: 15+ Kubernetes clusters, 100+ databases, dozens of services, and complex data organization.
Our challenge isn’t just scale—it’s making that scale reliable, secure, and operable with less manual work.
We’re looking for engineers who enjoy turning complex, fragile systems into automated, self‑service platforms with strong safety guarantees.
What you’ll be doingReporting to the Engineering Manager of Infrastructure, you’ll:
- Design and build systems to automate infrastructure management at scale (provisioning, upgrades, migrations)
- Reduce operational toil by turning manual processes into reliable, repeatable workflows
- Build internal tooling and platforms that enable safe self‑service changes for other engineers
- Improve the reliability and resilience of our infrastructure (Kubernetes, databases, services)
- Implement and evolve systems for deploying and running applications in Kubernetes
- Contribute to architecture decisions across infrastructure, reliability, and security
- Write and review production‑quality code
- Participate in on‑call rotations—but focus on building systems that prevent incidents, not just respond to them
- 5+ years of experience in infrastructure, SRE, or software engineering roles
- Strong software engineering skills—you build systems, not just scripts
- Experience managing production infrastructure at scale (cloud + containerized systems)
- Experience with Infrastructure as Code (e.g., Terraform)
- Experience running and troubleshooting distributed systems (Docker/Kubernetes)
- Experience with observability and debugging tools (Datadog, Cloud Watch, ELK/EFK, etc.)
- Proficiency in at least one programming language (Python, Go, JavaScript, etc.)
- Experience participating in on‑call rotations and improving systems based on incidents
- Strong communication and collaboration skills
You might be a great fit if you
- Default to automation over manual processes
- See repetitive work and immediately want to eliminate it
- Think in terms of systems, failure modes, and long‑term scalability
- Care about building infrastructure that other engineers can use safely and confidently
- Enjoy working in a small team with high ownership and impact
Nice to have
- Experience running Kubernetes in production at scale
- Deep familiarity with AWS
- Experience building internal platforms or developer tooling
- Background in distributed systems or large‑scale data systems
We're a lean team, so your impact will be felt immediately, and opportunities will grow as the company scales up.
CompensationAlloy is committed to fair and equitable compensation practices. The anticipated starting base compensation range for this role is $151,000 to $191,000. In addition to a competitive base salary, this position is eligible for equity awards in the form of stock options (ISOs) as well as a competitive total benefits package.
Benefits and Perks- Unlimited PTO and flexible work policy
- Employee stock options
- Medical, dental, vision plans with HSA (monthly employer contribution) and FSA options
- 401k with 100% match up to 4% of annual employee compensation
- Eligible new parents receive 16 weeks of paid parental leave
- Home office stipend for new employees
- Annual Learning & Development stipend
- Well‑being benefits include access to Class Pass, One Medical, Urban Sitter, and Spring Health
- Hybrid work environment: employees are expected to work Tuesdays through Thursdays from our HQ in Union Square, Manhattan. Tasty lunches catered from a variety of local restaurants and frequent employee‑organized cultural events contribute to our positive office energy. On Monday/Friday most employees Zoom into work from home while some take advantage of the quieter office.
Alloy is proud to be an equal‑opportunity workplace and employer. We’re committed to equal opportunity regardless of race, color, ancestry, religion, gender, gender identity, parental or pregnancy status, national origin, sexual orientation, age, citizenship, marital status, disability, or veteran status. We are committed to an inclusive interview experience and provide reasonable accommodations to applicants with visible and invisible disabilities. We encourage applicants to share needed accommodations with our recruiters.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).