Staff Engineer - Site Reliability
Listed on 2025-11-09
-
IT/Tech
Systems Engineer, Cloud Computing
Staff Engineer - Site Reliability (d/f/m)
Why Scout
24?
Join Scout
24, Germany's leading real estate platform, Immo Scout
24, and be part of a diverse and inclusive team of over 1,000 colleagues from 58 nationalities. We're revolutionizing the real estate market and building a digital ecosystem that connects homeowners, seekers, and agents. At Scout
24, you'll have the opportunity to grow, innovate, and make a real impact.
- Competitive salary and bonus
- Flexible hybrid working model and autonomous time management
- 30 days of vacation
- Work from abroad opportunities
- Relocation support
- Immo Scout
24 Plus membership for employees - Dedicated learning time, online courses, and individual career paths
- Family service for childcare support
- Dog-friendly office (upon approval)
- Subsidized public transport or Job Bikes
- Commitment to sustainability and DEI
- Company pension scheme
- Innovative, barrier-free office with gym, napping room, and more
- Health check-ups and life situation coaching
- Latest technical tools and hardware
The Opportunity:
Staff Engineer, Site Reliability
We're seeking a passionate Staff Engineer, Site Reliability to lead our reliability program and drive a culture where reliability is embedded in our engineering DNA. You'll be a technical leader with company‑wide influence, shaping our engineering culture and driving initiatives at the organizational level.
Responsibilities:
- Design and implement a comprehensive reliability strategy aligned with business outcomes.
- Lead the definition of reliability targets (SLOs) for data‑driven decisions.
- Guide the technical roadmap for reliability and observability.
- Improve incident handling processes and automation.
- Define and implement our disaster recovery framework in the cloud.
- Partner with Platform teams to build observability and reliability tooling.
- Provide technical leadership on reliability to engineers, managers, and product managers.
- Foster cross‑departmental collaboration for a holistic approach to reliability.
- Mentor teams on operating resilient systems.
What you need to succeed:
- Significant experience in software engineering with scalable distributed systems (8+ years preferred).
- Strong SRE background with deep understanding of SLIs, SLOs, error budgets, and observability.
- Platform engineering/Dev Ops expertise and passion for monitoring distributed systems.
- Experience with incident management, post‑incident analysis, and organizational change.
- Experience leveraging AI tools to drive problem‑solving and productivity.
- Technical leadership experience.
- Growth mindset and strong collaboration abilities.
- Excellent written and spoken English.
We are committed to diversity and inclusion and support the professional development of people with disabilities. Please let us know if you need assistance during the application process.
How to ApplyInterested in this position? Please submit your resume and cover letter through the application portal.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).