Principal Software Engineer - Storage Cache
Listed on 2026-05-02
-
Software Development
Software Engineer, Cloud Engineer - Software, DevOps, Backend Developer
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.
At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.
A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.
Roblox’s Cache TeamRoblox's Cache team is building a next-generation caching solution designed to deliver sub-millisecond average latency, horizontal scalability, and high efficiency—all at a drastically lower cost. Our ultimate vision is to shape a caching infrastructure capable of supporting 1 billion Daily Active Users while reducing costs by 90%. We are turning hours of onboarding and capacity expansion into seconds, freeing service owners entirely from managing cluster life cycles.
As a Principal Engineer on the Cache team (part of the Infra Storage org), you will innovate and operate large-scale, in-house distributed systems to solve Roblox's ever-growing caching challenges. You will report directly to the Engineering Manager for the Cache team.
Check out our recent engineering blog post here to learn more about the team's latest work!
You Will:- Lead the architectural transition to a next-generation, multitenant caching service built on Val Key, ensuring strict data, resource, and failure isolation for all tenants.
- Drive systemic optimizations to mitigate head‑of‑line blocking, manage hot keys, and maximize CPU and memory utilization across physical machine clusters.
- Design and build robust frameworks to automate development, chaos testing (fault/latency injection), and monitoring for 24x7 mission‑critical services, targeting 99.99%+ availability and elastic scalability.
- Champion engineering best practices by leading design reviews, performance benchmarking, failure drills, and blameless post‑incident retrospectives.
- Mentor and empower engineers, fostering a culture of deep domain expertise and seamless knowledge sharing across the Storage, Platform, and Product teams.
- Experience &
Education:
A BS degree in Computer Science (or equivalent professional experience) with at least 8+ years of hands‑on software engineering experience. - Distributed Systems Expertise:
Deep domain knowledge in building and operating large‑scale distributed systems. - Infrastructure Chops: A strong builder mindset with proven experience running Active/Active distributed systems on container orchestrators like Kubernetes or Nomad.
- Programming Proficiency:
Strong, hands‑on programming experience in Go and C++. - Problem‑Solving Track Record:
Proven success in resolving massive‑scale bottlenecks, such as overcoming the limitations of decentralized Gossip protocols or mitigating partial failures in distributed systems. - Observability
Skills:
Hands‑on experience with modern telemetry and observability stacks (e.g., Prometheus, Grafana, Alert Manager, Kibana). - [Bonus] Open Source Contributions: A track record of contributing to or maintaining major open‑source caching projects such as Redis, Val Key, or Memcached.
- [Bonus] Advanced Cache Internals:
Experience extending cache functionality (e.g., writing custom Redis modules in C/Rust, complex Lua scripting) or deep‑tuning underlying memory allocators like jemalloc. - [Bonus] Caching Proxies & Topologies:
Experience with caching proxies (e.g., Twemproxy, Envoy Redis filter) and designing complex, multi‑tiered caching architectures.
For roles that are based at our headquarters in San Mateo, CA:
The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job‑related factors such as professional background, training, work experience, location, business needs and…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).