Location: Germany
Who we are:
With several hundred thousand servers in operation, Hetzner is one of the largest web hosting providers and data center operators in Europe. We provide our customers with high-tech products we develop in house, and with reliable infrastructure at fair prices. More than 500 Hetzner employees work every day to help shape the digital future and provide our customers with simple, efficient solutions to complex problems.
You want to ensure systems run reliably — and you enjoy finding weaknesses long before they become real problems?
Join us as a Dev Ops Engineer (m/f/d) on the Observability team at our locations in Falkenstein/Vogtland or Nuremberg! With cutting‑edge tools like Open Search, Thanos, and Open Telemetry, you’ll help us evolve our monitoring and logging infrastructure and support development teams in running complex systems with transparency and stability.
Your responsibilities:- Be responsible for the design, implementation, and operation of our observability tools
- Manage and further develop distributed systems for logging, metrics, and tracing using Open Search, Thanos, and Open Telemetry
- Ensure the scalability, performance, and high availability of our critical observability infrastructure
- Collaborate with developer teams to define and implement observability requirements across monitoring, logging, and tracing systems
- Troubleshoot and resolve complex technical issues in production environments
- Improve and maintain Kubernetes operators (including contributions to open source)
- Expand and optimize our internal tooling ecosystem
- A degree in computer science, a comparable qualification, or relevant professional experience
- Strong expertise in Dev Ops methodologies and tools (e.g. CI/CD; containerization with Docker and Kubernetes; infrastructure as code with Terraform, Ansible, or Puppet)
- Knowledge in the field of observability — especially with Open Search (or Elasticsearch), Thanos, Jaeger, and Open Telemetry
- Hands‑on experience in administering and scaling Open Search clusters, including index management, sharding, and performance tuning
- Confident use of Prometheus and Thanos for cross‑cluster metric collection and aggregation
- Solid understanding of Open Telemetry and its use in standardizing traces, metrics, and logs
- Strong communication skills in both English and German, and a collaborative, team‑oriented mindset
- Experience debugging and maintaining Thanos is a plus
At Hetzner Online, your tasks will be exciting, challenging, and varied. Our corporate philosophy emphasizes personal interaction, efficient decision making processes, and a strong DIY mentality. In addition to numerous social benefits, flexible working hours (where feasible), and opportunities for both professional and personal development, we can offer you an attractive salary.
Contact:Do you have any questions? Feel free to reach out to us at or call our contact person for this position.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).