Site Reliability Engineer - Platform
Listed on 2025-12-27
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability
Ready to be pushed beyond what you think you’re capable of?
At Coinbase, our mission is to increase economic freedom in the world. It’s a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform — and with it, the future global financial system.
To achieve our mission, we’re seeking a very specific candidate. We want someone who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the financial system. We want someone who is eager to leave their mark on the world, who relishes the pressure and privilege of working with high caliber colleagues, and who actively seeks feedback to keep leveling up.
We want someone who will run towards, not away from, solving the company’s hardest problems.
Our work culture is intense and isn’t for everyone. But if you want to build the future alongside others who excel in their disciplines and expect the same from you, there’s no better place to be.
While many roles at Coinbase are remote‑first, we are not remote‑only. In‑person participation is required throughout the year. Team and company‑wide offsites are held multiple times annually to foster collaboration, connection, and alignment. Attendance is expected and fully supported.
The Reliability Engineering team helps realize our vision by supporting Coinbase engineering teams to build software that is world‑class in terms of its reliability. As a core service team, Coinbase Reliability Engineers work closely with the rest of engineering. We proactively seek out and gather the state‑of‑the‑art, best practices from the industry ough education and advocacy, we seek to ensure that reliability is a core value of our engineering culture.
We level up other engineers by sharing deep knowledge, performing proactive analysis and improving processes, tools, and automation. Ultimately, Reliability Engineering succeeds when all engineering teams are able to build reliable software on their own.
Our Reliability Engineering team highly values people with intellectual curiosity and openness. We collaborate across the organization, helping our engineers think big and take risks while building a culture of diversity, positive energy and blameless truth‑seeking. We encourage self‑starting on high‑impact projects within the context of strong support and mentorship.
What we look for in you (ie. job requirements)- Improve observability, reliability and availability by defining and measuring key metrics
- Build automation and improve systems to eliminate toil and operations work.
- Collaborate with our core infrastructure team to performance tune and optimize our cloud deployments. (Think Docker, Terraform, Kubernetes, EC2, etc.)
- Collaborate with Coinbase product teams to reduce service disruptions and automate incident response
- Proactively find and analyze reliability problems across our business units and stack, then design and implement software to create step‑function improvements.
- Educate, mentor and hold accountable the engineering team to improve the reliability of our systems and make reliability a core value of the Coinbase engineering culture.
- Write high quality, well tested code to meet the needs of your customers.
- Debugging extremely difficult technical problems, and making systems and products both work better and are easier to deploy, own, operate and diagnose.
- Review all feature designs within your product area and across the company for cross‑cutting projects.
- Be an owner of the security, safety, scale, operational integrity, and architectural clarity of these designs.
- Build pipelines to integrate with 3rd party vendors
- Participate in an on‑call support rotation to provide timely troubleshooting and resolution of urgent issues.
- Experience designing and building reliable systems capable of handling high throughput and low latency
- Experience with observability and monitoring systems such as Kibana, Datadog, etc.
- Familiarity with working in rapid growth environments
- Experience in Ruby, Go, and Terraform
- Experience with AWS, GCP, Azure, or other cloud environment
- Experience designing and building reliable systems
- Experience…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).