Member of Technical Staff; SRE
Listed on 2026-01-01
-
Software Development
Cloud Engineer - Software, DevOps
Location: New York
Member of Technical Staff (SRE)
Join to apply for the Member of Technical Staff (SRE) role at Cockroach Labs.
OverviewCategory‑defining tech. Career‑defining work. Lots of tech companies disrupt, but many fail when they try to scale. We're different. Cockroach
DB makes it easier for companies to build and scale apps. We tackle problems head‑on and focus on solutions that create lasting impact because when our customers win, we all win.
Cockroach
DB provides the backbone of storing data on a global scale. The core mission of the SRE team is to operate at scale a secure and reliable Cockroach Cloud product. We provide consultation, planning, architectural oversight, concrete designs, development, and implementation that improve the resilience, efficiency, performance, and availability of our Cloud service. We also take pride in being good on‑call engineers, and we believe that reflection on the on‑call experience can contribute to short‑, medium‑ and long‑term improvements of the core product, including to CRDB itself.
Will
- Manage the infrastructure for cloud services, including running internal production systems and hosting Cockroach
DB for our external customers. - Design, write and deliver software and systems to increase product reliability and operational efficiency.
- Develop custom tools as necessary.
- Keep a complex system running and solve problems relating to mission‑critical services.
- Design, implement, operate, and troubleshoot the automation and monitoring of production clusters to maximize performance and availability.
- Drive the company through disaster recovery tests, where we manually turn down pieces of Cockroach
DB to test its overall resilience to failures. - Participate in an on‑call rotation for our production systems and hosted services.
In your first 30 days, you will onboard and be exposed to our current internal and customer‑facing production systems. Working with our existing SRE and engineering teams, you will pair on production operations and build out runbooks for the operation of different systems. After 3 months, you'll be fully integrated into the team, developing and owning tooling for reliability, automation, and other issues related to Cockroach Cloud’s stability and scalability.
YouHave
- Expertise in analyzing, monitoring, and troubleshooting large‑scale distributed systems.
- Experience in software development using one or more of the following:
Go, C, C++, Python, Java. - Proficiency working with algorithms, data structures, and production troubleshooting.
- Expertise in working with major cloud providers (AWS, Azure, GCP, etc.) and Cloud APIs.
- Debugged and optimized code and automated routine tasks.
- Working knowledge of web and network protocols and standards (HTTP, TLS, DNS, etc.)
- Prior on‑call experience, exhibiting sense of ownership, attention to detail, and urgency.
- Experience building collaborative relationships with colleagues, enjoying the code review process and partnering on challenging problems.
We are a group of software engineers first & foremost. We use software engineering as a means to achieve our mission; this is the SRE way. The SRE team is currently distributed across North America (5) and India (4).
Reporting toTom Schmidt – Director, Production Engineering. Tom recently joined Cockroach Labs as manager of Site Reliability Engineering and has taken responsibility for Cockroach Cloud’s production operations. He has 15 years at IBM and extensive experience in quality, automation, SRE advocacy, and leadership.
Jordan Lewis – VP, EngineeringJordan is the Head of Engineering for Cockroach Labs, responsible for the teams that build, maintain, and keep Cockroach
DB reliably serving the needs of the most demanding customer base. He joined in 2016 when the company was 25 people.
- Stock Options
- Medical Insurance
- Vision Insurance
- Dental Insurance
- Life and Disability Insurance
- Professional Development Funds
- Flexible Time Off
- Paid Holidays
- Paid Sick Days
- Paid Parental Leave
- Retirement Benefits
- Mental Wellbeing Benefits
- And more!
The annual anticipated base salary range for U.S. candidates for this role is $154,000—$203,950 USD.
Equal Opportunity…(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).