Sr/Infrastructure/Site Reliability Engineer; SRE
Listed on 2026-02-16
-
IT/Tech
Systems Engineer, Cloud Computing
Sr./Staff - Infrastructure/Site Reliability Engineer (SRE)
Get AI-powered advice on this job and more exclusive features.
Shape the future of trust in the age of AI
At Oscilar, we're building the most advanced AI Risk Decisioning™ Platform. Banks, fintechs, and digitally native organizations rely on us to manage their fraud, credit, and compliance risk with the power of AI. If you're passionate about solving complex problems and making the internet safer for everyone, this is your place.
Why Join Us
- Mission-driven teams:
Work alongside industry veterans from Meta, Uber, Citi, and Confluent, all united by a shared goal to make the digital world safer. - Ownership and impact:
We believe in extreme ownership. You'll be empowered to take responsibility, move fast, and make decisions that drive our mission forward. - Innovate at the cutting edge:
Your work will shape how modern finance detects fraud and manages risk.
The Role
Oscilar is growing fast, and so is the complexity of our systems. We’re looking for a experienced SRE to take ownership of reliability across our multi-region, cloud-native platform. You’ll have the mandate and autonomy to design, implement, and evolve systems that stay performant and resilient—through traffic spikes, dependency failures, and global deployments. You’ll be shaping how we scale, how we build observability, and how we run infrastructure that supports billions of events and large-scale data pipelines.
What You’ll Own
- Architect and operate resilient cloud infrastructure (AWS, Pulumi, Kubernetes).
- Lead initiatives to improve availability, latency, and performance at scale.
- Design and evolve our CI/CD pipelines to optimize for speed, safety, and repeatability.
- Define the metrics, alerts, and runbooks that form our observability backbone.
- Run chaos experiments and failure simulations to harden the platform.
- Mentor engineers and set best practices for SRE across the company.
- Proven track record as a senior SRE, Dev Ops, or infrastructure engineer in high-scale environments.
- Expert-level skills in AWS and Infrastructure as Code (Pulumi, Terraform).
- Strong programming ability in Go and Java.
- Deep understanding of distributed systems (Kafka, Click House) and microservices architecture.
- Mastery of container orchestration (Kubernetes) and production debugging.
- Strong sense of ownership, and the judgment to balance velocity with reliability.
- Seniority level
Mid-Senior level
- Employment type
Full-time
- Job function
Engineering and Information Technology - Industries Technology, Information and Internet
Referrals increase your chances of interviewing at Oscilar by 2x
Sign in to set job alerts for “Site Reliability Engineer” roles.London, England, United Kingdom 1 day ago
Senior Software Engineer (Python/Django)London, England, United Kingdom 2 weeks ago
London, England, United Kingdom $-$ 3 weeks ago
London, England, United Kingdom $-$ 1 month ago
Site Reliability Engineer (Remote - EMEA)Sr. Site Reliability Engineer (SRE) (Remote - Europe)
London, England, United Kingdom $-$ 1 month ago
London, England, United Kingdom $-$ 3 weeks ago
London, England, United Kingdom 1 day ago
Newcastle Upon Tyne, England, United Kingdom 2 months ago
London, England, United Kingdom $-$ 1 month ago
London, England, United Kingdom 2 months ago
United Kingdom $-$ 1 month ago
United Kingdom $-$ 1 week ago
Wokingham, England, United Kingdom 3 weeks ago
Newcastle Upon Tyne, England, United Kingdom 1 day ago
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-LjbffrTo Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: