Principal Site Reliability Engineer
Listed on 2026-02-21
-
IT/Tech
Systems Engineer, Cloud Computing, IT Support, Cybersecurity
Since being founded in 2018, Copper has been building the standard for institutional digital asset infrastructure with a focus on custody, collateral management, and prime services.
Led by Amar Kuchinad, Copper's Global CEO, the firm provides a comprehensive suite of custody, trading and settlement solutions that reduce counter party risk and bring greater capital and operational efficiency to digital asset markets. At the heart of Copper's offering is Multi‑Party Computation (MPC) technology – the gold standard in secure custody. Copper’s multi‑award winning custody system is unique in that it can be connected to centralised exchanges, DeFi applications and even staking pools without the assets leaving the custody.
Built on top of this state‑of‑the‑art custody, Clear Loop is the first solution in the market that overcomes a growing industry challenge; counter party risk with exchanges. This solution underpins a full prime services offering, connecting global exchanges, and enabling customers to trade and settle directly from the safety of their MPC‑secured wallets. By reducing settlement time for transfers to a few milliseconds (without blockchain network dependency) and offering enhanced security measures, Clear Loop is rapidly reshaping the way asset managers trade and manage capital.
In addition to industry‑leading security certifications, Copper has one of the strongest insurance coverages in the industry from an A+ rated insurer, positioning the firm as the partner of choice for institutions seeking to safeguard their assets.
Department OverviewThe Engineering department is the backbone of Copper; entrusted with the critical responsibility of building and securing the infrastructure that safeguards billions of dollars in digital assets. We operate at the intersection of traditional institutional finance and decentralized blockchain technology.
Team PurposeWe believe the best infrastructure comes from curious minds working openly together. Our engineering organisation brings together people with diverse backgrounds and experiences who genuinely enjoy solving hard problems. We foster a blame‑free, intellectually honest environment where you're encouraged to take thoughtful risks and think big. You'll have the autonomy to own meaningful projects while gaining the support you need to grow your skills and advance your career.
RolePurpose
We're looking for a site reliability engineer at the intersection of software development and systems engineering. In this role, your focus will be delivering the reliability, performance, and uptime our clients depend on, whilst continuously improving how our systems operate.
Day‑to‑day, you'll tackle complex infrastructure challenges through a combination of thoughtful engineering and considered automation. Rather than fighting fires, you'll work to prevent them. You'll analyse system capacity and performance, identify bottlenecks, and build solutions that scale across our department, not just one team.
You will collaborate closely with the Dev Ops team, but focusing primarily on reliability engineering, operational excellence, and production readiness across the broader engineering org.
Key Responsibilities- Shape SRE;
Define how we think about reliability, observability, and operational excellence. Drive the adoption of SRE principles across the organization while building the systems and processes that make those principles measurable – think SLIs, SLOs and error budgets. - Scale Through Automation;
Champion architectural improvements that enhance both system reliability and deployment velocity. Provide consultation on system architecture, building reusable platforms and frameworks, planning capacity needs, and conducting production readiness reviews to ensure services launch and operate successfully. - Drive Technical Excellence;
Engage in and improve the lifecycle of microservices, from inception through deployment, operation, observability, and continuous refinement. - Lead Through Influence;
Partner with engineering and product leadership to embed reliability into our product development lifecycle. Conduct blameless post‑mortems and drive systemic improvements in…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).