×
Register Here to Apply for Jobs or Post Jobs. X

Principal Cache Engineer Scale Distributed Systems

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Roblox
Full Time position
Listed on 2026-05-29
Job specializations:
  • Software Development
    Software Engineer, DevOps, Cloud Engineer - Software
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: Principal Cache Engineer - High-Scale Distributed Systems

Requirements

  • Experience &

    Education:

    A BS degree in Computer Science (or equivalent professional experience) with at least 8+ years of hands-on software engineering experience
  • ,
  • Distributed Systems Expertise:
    Deep domain knowledge in building and operating large-scale distributed systems
  • ,
  • Infrastructure Chops: A strong builder mindset with proven experience running Active/Active distributed systems on container orchestrators like Kubernetes or Nomad
  • ,
  • Programming Proficiency:
    Strong, hands-on programming experience in Go and C++
  • ,
  • Problem-Solving Track Record:
    Proven success in resolving massive-scale bottlenecks, such as overcoming the limitations of decentralized Gossip protocols or mitigating partial failures in distributed systems
  • ,
  • Observability

    Skills:

    Hands-on experience with modern telemetry and observability stacks (e.g., Prometheus, Grafana, Alert Manager, Kibana)
  • ,
  • [Bonus] Open Source Contributions: A track record of contributing to or maintaining major open-source caching projects such as Redis, Val Key, or Memcached
  • ,
  • [Bonus] Advanced Cache Internals:
    Experience extending cache functionality (e.g., writing custom Redis modules in C/Rust, complex Lua scripting) or deep-tuning underlying memory allocators like jemalloc
  • ,
  • [Bonus] Caching Proxies & Topologies:
    Experience with caching proxies (e.g., Twemproxy, Envoy Redis filter) and designing complex, multi-tiered caching architectures
What the job involves
  • As a Principal Engineer on the Cache team (part of the Infra Storage org), you will innovate and operate large-scale, in-house distributed systems to solve Roblox's ever-growing caching challenges
  • ,
  • You will report directly to the Engineering Manager for the Cache team
  • ,
  • Lead the architectural transition to a next-generation, multitenant caching service built on Val Key, ensuring strict data, resource, and failure isolation for all tenants
  • ,
  • Drive systemic optimizations to mitigate head-of-line blocking, manage hot keys, and maximize CPU and memory utilization across physical machine clusters
  • ,
  • Design and build robust frameworks to automate development, chaos testing (fault/latency injection), and monitoring for 24x7 mission-critical services, targeting 99.99%+ availability and elastic scalability
  • ,
  • Champion engineering best practices by leading design reviews, performance benchmarking, failure drills, and blameless post-incident retrospectives
  • ,
  • Mentor and empower engineers, fostering a culture of deep domain expertise and seamless knowledge sharing across the Storage, Platform, and Product teams
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary