×
Hier anmelden um sich kostenlos auf Stellen zu bewerben oder Stellenanzeigen aufzugeben. X

Senior SRE Engineer Cloud Operations

in 10115, Berlin, Berlin, Deutschland
Unternehmen: Qdrant
Vollzeit position
Verfasst am 2026-01-15
Berufliche Spezialisierung:
  • IT/Informationstechnik
    Cloud Computing, Site Reliability Ingenieur/in
Stellenbeschreibung

Qdrant is a cutting-edge vector database company on a mission to revolutionize how organizations manage and query unstructured data. Our open-source engine and managed cloud solutions power AI-driven search recommendation and data discovery  are a remote-first company building a global team of passionate engineers to push the boundaries of database infrastructure.

As a Senior Dev Ops / SRE Engineer on the Cloud Operations team you will focus on keeping Qdrant Cloud reliable observable and secure as usage and infrastructure complexity grow. Your primary responsibility is operational excellence : stability incident response and continuous improvement of production systems.

This role is operations-heavy ideal for engineers who thrive in owning reliability and reducing operational risk at scale.

Tasks
  • Operate and maintain production cloud infrastructure at scale
  • Own Kubernetes infrastructure networking and deployment pipelines
  • Improve monitoring logging alerting and operational visibility
  • Lead incident response root cause analysis and follow-up actions
  • Reduce operational toil through automation and better tooling
  • Improve reliability security and performance of production systems
  • Collaborate closely with Platform and Regions & Clusters teams
  • Maintain and evolve runbooks operational procedures and alerts
  • Participate in on-call rotations and continuous reliability improvements
Requirements Must have
  • 5 years of experience in Dev Ops SRE or infrastructure operations roles
  • Strong hands-on experience operating Kubernetes in production
  • Solid knowledge of Linux systems networking and cloud infrastructure
  • Experience working with AWS GCP or Azure
  • Strong understanding of monitoring alerting and incident management
  • Experience with infrastructure-as-code and automation tooling
  • Comfortable owning on-call responsibilities and production incidents
  • Strong operational mindset and clear communication skills
Nice to have
  • Experience with Terraform or similar IaC tools
  • Familiarity with Prometheus Grafana Loki or Open Telemetry
  • Exposure to security compliance or hardening initiatives
  • Scripting experience in Python Bash or Go
  • Experience in SaaS cloud or data infrastructure environments
Benefits
  • Competitive salary equity and benefits
  • Fully remote setup with flexible working hours
  • Clear ownership of reliability and operational excellence
  • Opportunity to work on mission-critical customer-facing infrastructure
  • Strong collaboration with platform and engineering teams

If you enjoy keeping complex systems reliable and improving operations through automation and discipline wed love to hear from you.

Recruiting Agencies and Headhunters please only via 𝙝𝙞𝙧𝙚

#J-18808-Ljbffr
Stellen-Anforderungen
10+ Jahre Berufserfahrung
Bitte beachten Sie, dass derzeit keine Bewerbungen aus Ihrem Zuständigkeitsbereich für diese Stelle über diese Jobseite akzeptiert werden. Die Präferenzen der Kandidaten liegen im Ermessen des Arbeitgebers oder des Personalvermittlers und werden ausschließlich von diesen bestimmt.
Um nach Stellen zu suchen, sie anzusehen und sich zu bewerben, die Bewerbungen aus Ihrem Standort oder Land akzeptieren, klicken Sie hier, um eine Suche zu starten:
 
 
 
Suchen Sie hier nach weiteren Stellen:
(nach Beruf, Fähigkeit)
Standort
Increase search radius (miles)

Sprache der Stellenausschreibung
Lebenslauf-Kategorie
Bildungsgrad
Filter
Mindest-Bildungsgrad für die Stelle
Mindest-Berufserfahrung für die Stelle
Veröffentlicht in den letzten:
Gehalt