Senior Software Engineer, Platform & Infrastructure
Deutschland
Verfasst am 2026-07-03
-
IT/Informationstechnik
AWS
Your role
We are a leading SaaS provider in our field and operate a large number of customer environments in our cloud. To ensure that these environments run smoothly, we are looking for you! Together with your colleagues on the Platform & Infrastructure team, you will be responsible for the scalability, security, and efficiency of our cloud infrastructure — covering AWS, Kubernetes, observability, and the internal developer platform that our product teams build on.
We ship continuously — on average 150 deployments to production every month — and your work is what keeps that pace fast, safe, and sustainable. You will actively contribute to shaping our technical roadmap, mentor engineers around you, and help cultivate a progressive and learning environment.
- Own and evolve our Kubernetes platform across multiple clusters: manage Helm chart deployments via an OCI registry hosting 40+ charts, enforce policy-as-code with Kyverno, and operate Git Ops workflows through Argo CD Application Sets with progressive delivery orchestrated by Kargo
- Drive technical designs for platform initiatives: scope the problem, propose multiple solutions, assess their trade-offs, and defend your recommendation — then see it through to production
- Harden the platform's security posture: workload identity via OIDC, runtime security, image scanning and secrets management
- Write and maintain custom Kubernetes operators and internal tooling in Go and Python that multiply the team's leverage across clusters — we run Zalando postgres-operator alongside our own operators
- Maintain and improve our observability stack (Prometheus, Grafana, Thanos, Open Search): build dashboards and alerts that give product teams real visibility into their services
- Keep Git Lab CI/CD pipelines fast and reliable — they power ~150 production deployments per month; execute cross-team changes (rollouts, database migrations, certificate rotations) with care and clear communication
- Operate and evolve AWS infrastructure (EKS, VPC, IAM, RDS, S3) including dedicated customer environments in their own AWS accounts; drive cost-efficiency initiatives tracked via Open Cost
- Own incidents end-to-end: from alert to fix to postmortem
- Raise the team's bar through thorough code and architecture reviews, mentor less experienced engineers, and help us assess technical candidates in interviews
- Be the infrastructure partner product teams come to when things are unclear or broken
- Participate in a shared on‑call rotation
Must-have
- 5+ years of professional engineering experience, including 3+ years in infrastructure, platform, or site reliability engineering
- Deep hands‑on experience with Kubernetes
: cluster operations, workload management, troubleshooting at scale - Helm chart authoring
: writing, packaging, and maintaining charts — not just consuming them - Git Ops experience with Argo CD or an equivalent tool (Flux, etc.)
- Working knowledge of AWS (EKS and supporting services such as IAM, VPC, RDS, S3)
- Experience with Infrastructure as Code (
Terraform or equivalent) - Proficiency in Go or Python — we write custom operators and internal tooling in both
- Experience owning production incidents end-to-end — response, mitigation, and postmortem
- Strong English communication skills, with the ability to explain technical decisions to both engineers and non‑technical stakeholders
- Kubernetes operator development experience (
Kubebuilder or similar) - Progressive delivery tooling (
Kargo
, Argo Rollouts, or equivalent) - Policy‑as‑code experience (
Kyverno or OPA) - Runtime security tooling (
Falco or equivalent) - Familiarity with observability tooling (
Grafana
, Open Search/Elasticsearch
, or equivalent) - Experience with CI/CD pipelines (
Git Lab CI preferred) - Experience mentoring engineers or participating in technical hiring
- German language skills (helpful for customer site interactions)
You will participate in an on‑call rotation. Incidents require fast response and clear communication. On‑call burden is tracked and addressed through rotation and capacity planning.
Perks & BenefitsJoin our team and simply be yourself – we celebrate diversity! Our entrepreneurial culture is driven by talented and passionate team. We offer full flexibility to suit your needs, whether you prefer working remotely, in the office, or in a hybrid setup. And if you’d like to work from abroad, you can do so for up to 180 days. We also provide a competitive compensation package and 30 days of paid vacation to help you recharge.
With a generous tech budget, you can choose the hardware and software that suits you best.
- Flexible working arrangements (remote, office, or hybrid)
- Modern office in the heart of Hanover for hybrid work
- Up to 180 days (6 months) of remote work from abroad
- Competitive compensation with benefits offering
- 30 days (6 weeks) of paid vacation
- Modern hardware and software solutions
Um nach Stellen zu suchen, sie anzusehen und sich zu bewerben, die Bewerbungen aus Ihrem Standort oder Land akzeptieren, klicken Sie hier, um eine Suche zu starten: