About the Role
As a Staff Platform Engineer, you are the owner of the mission‑critical cloud infrastructure that powers the Wunder Graph Cosmo platform for our enterprise customers. Your primary responsibility is ensuring the reliability, performance, and scalability of this core platform by defining and meeting stringent SLOs. This role blends deep operational leadership with product‑focused infrastructure engineering in Go. You will architect our internal systems for scale, build and operate key product infrastructure—including our customer‑facing telemetry pipeline built on Open Telemetry and Click House, and the AI pipeline that powers our products.
We seek a hands‑on technical leader who thrives on solving ambiguous, big‑scale problems.
- You align with the Head of Engineering.
- You collaborate closely with the engineering team and customers.
- Enable our engineering teams to ship features for Wunder Graph Cosmo fast, reliably, and with confidence through a world‑class Internal Developer Platform (IDP).
- Take full ownership of our core platform infrastructure and services—and own them completely, from architecture to operation.
- Drive the architectural vision for our platform, making key decisions on technologies such as Kubernetes, Infrastructure as Code, and our observability stack.
- Bring deep platform expertise to the table, leveling up the entire team through mentorship, architectural guidance, and championing best practices.
- Grow with Wunder Graph as we scale, expanding your influence across product and organization while helping build a world‑class engineering team.
- Architect, build, and operate the core cloud‑native infrastructure for Wunder Graph Cosmo and Hub, primarily using Go and Kubernetes.
- Own and evolve our observability stack (Open Telemetry, Prometheus, Click House) and the infrastructure supporting our AI‑driven features to ensure deep, actionable insights into our systems.
- Build and optimize CI/CD pipelines to improve build times, automate quality and security gates, and create a seamless path to production for our engineers.
- Champion and implement Infrastructure as Code (IaC) best practices using tools like Terraform, building reusable and maintainable modules for our teams.
- Embed security best practices into the platform by designing and implementing network policies, RBAC, and automated checks to meet enterprise and SOC 2 compliance standards.
- Mentor other engineers, provide insightful code and design reviews, and document platform features and architectural decisions to foster a culture of collaboration and knowledge sharing.
- Proven experience architecting and operating scalable, highly available, and secure cloud‑native platforms in production, with strong proficiency in Go and deep expertise in Kubernetes
. - You thrive in the dynamic environment of a scaling, remote‑first company that has navigated strategic pivots and is on a rapid growth trajectory.
- Deep expertise in a major cloud provider (AWS, GCP, Azure) and Infrastructure as Code tools (e.g.,
Terraform
, Pulumi). - A strong understanding of system architecture, distributed systems, and the challenges of running high‑performance API gateways. Familiarity with Graph
QL Federation is a significant plus. - Experience building or managing modern observability stacks (e.g., Open Telemetry, Prometheus, Grafana, Click House).
- A self‑starter attitude and a leader’s mindset: you are comfortable with ambiguity, can identify and solve ill‑defined problems, and don’t need hand‑holding.
- Excellent written and verbal communication skills, with the ability to articulate complex technical concepts clearly in design documents, RFCs, and asynchronous discussions.
- Wunder Graph’s engineering teams are highly productive, shipping features faster and with more confidence because the internal platform you’ve built is reliable, self‑service, and provides an exceptional developer experience (DX).
- Our platform infrastructure scales seamlessly and reliably to meet the demands of our largest enterprise customers, like eBay, solidifying Cosmo’s reputation for performance and stability.
- You…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: