Platform Engineer/SRE
Listed on 2025-12-12
-
IT/Tech
Cloud Computing, SRE/Site Reliability, Systems Engineer, IT Support
Location: Zürich
About Crypto Finance
Crypto Finance Group, part of Deutsche Börse Group, provides professional digital asset solutions to institutional clients. The Group comprises Crypto Finance AG, regulated by FINMA in Switzerland, offering trading, custody, and wallet services, as well as Crypto Finance (Deutschland) GmbH, regulated by BaFin in Germany, offering trading and custody services. As of 25 January 2025, Crypto Finance secured a MiCAR licence for the European market as one of the first providers in the EU.
Crypto Finance AG is a SIX‑approved crypto custodian for ETP issuers.
For more information, please visit our website at About us – Crypto Finance.
About the roleFrom our office in the Prime Tower, Zurich, we build cloud‑native systems that enable institutional clients to access, trade, and manage digital assets securely and efficiently.
We are currently looking for an SRE & Platform Engineer focused on application reliability, platform enablement, and the engineering experience of our internal teams. You will ensure that the backend services powering our Trading, Custody, Settlement, Staking, and Pledging platforms run reliably, scalably, and efficiently.
Our systems run on Google Cloud Platform (GCP), orchestrated with Kubernetes, and are composed of ~25 microservices designed for high availability and multi‑region capability.
This role is a blend of :- Application‑level SRE: reliability, performance, observability
- Platform engineering: improving the platform used by internal development teams
- Tooling: building internal tools that make engineering faster, safer, and more productive
If you enjoy solving reliability challenges, simplifying developer workflows, and designing cloud‑native systems that scale, this role is the perfect fit.
Responsibilities Application Reliability & Operations- Ensure reliability, performance, and the correct behavior of ~25 microservices in production
- Investigate and resolve application‑level issues, from rollout problems to distributed system quirks
- Define, monitor, and improve SLIs, SLOs, and operational health indicators
- Lead incident response and post‑mortems with a blameless culture
- Improve our application platform on GCP + Kubernetes, focusing on developer experience and reliability
- Support high availability and multi‑region deployments
- Work on service templates, automation, and platform‑wide improvements that help all teams ship faster and more safely
- Collaborate with Backend and Security teams to design systems for reliability from the start
- Build and maintain internal tools that simplify workflows for engineering teams
- Improve local development workflows and service onboarding
- Reduce friction across CI / CD, testing, and deployment processes
(examples: CLI tools, deployment helpers, scaffolding generators, debugging aids, automated checks)
CI / CD & Git Ops- Operate and improve our Git Ops pipelines with ArgoCD
- Design safer deployment strategies (progressive rollouts, canary, blue / green)
- Ensure reliable and repeatable delivery processes
- Improve observability across logging, metrics, and tracing
- Reduce alert noise and improve actionable insights
- Build dashboards and tooling that help teams understand system health at a glance
- Degree in Computer Science or equivalent experience
- 3‑5 years in SRE, Dev Ops, platform engineering, or application‑focused operations
- Strong understanding of distributed systems and modern backend architectures
- Hands‑on experience operating microservices in GCP and Kubernetes
- Comfortable with cloud‑native patterns (Autoscaling, RBAC, Helm / Kustomize)
- Strong understanding of reliability concepts (SLI / SLO / SLA, error budgets, resilience patterns)
- Experience with observability stacks (metrics, logs, tracing)
- Experience with ArgoCD or other Git Ops tooling
- Familiarity with REST, Web Socket, FIX; SWIFT is a plus
- Experience building internal tools or automation for engineering teams
- Ability to debug complex issues across multiple services
- Interest in digital assets and / or blockchain technologies
- Strong sense of operational quality, security, and reliability
- Professional proficiency in English (German is a plus)
- Elig…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: