Product Manager - Cloud Storage
Bellevue, King County, Washington, 98009, USA
Listed on 2025-12-10
-
IT/Tech
Systems Engineer, Cloud Computing
Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mission is to make compute as ubiquitous as electricity and give everyone the power of superintelligence. One person, one GPU.
If you'd like to build the world's best AI cloud, join us.
* Note:
This position requires presence in our San Francisco or Bellevue office location 4 days per week;
Lambda’s designated work from home day is currently Tuesday.
The Product Manager, Cloud Storage Platform is a senior technical leader responsible for setting vision, strategy, and architecture for Lambda’s storage infrastructure across cloud and hybrid environments. You will own the complete lifecycle of our storage platform — from ultra–high-performance block and file systems to petabyte- and exabyte-scale object storage — ensuring it delivers unmatched performance, durability, scalability, and cost efficiency for the most demanding AI workloads in the world.
This role demands deep expertise across both software-defined and cloud-native storage architectures, along with the ability to unify them into a seamless, high-performance platform. You will define how storage is delivered, managed, and scaled globally, influencing multi-billion-dollar infrastructure investments and guiding world-class engineering teams to deliver storage capabilities that set a new industry benchmark for AI infrastructure.
Key ResponsibilitiesDefine and execute the long-term vision and strategic roadmap for Lambda’s storage platform across cloud and hybrid environments, ensuring it delivers uncompromising performance, scalability, durability, and cost efficiency for the world’s largest AI workloads.
Lead the evaluation, selection, and seamless integration of advanced storage technologies — spanning block, file, and object architectures — using rigorous benchmarking to optimize IOPS, throughput, latency, and total cost of ownership.
Translate complex infrastructure capabilities into clear product requirements, precise service-level objectives (SLOs), and measurable performance benchmarks that align with demanding AI and HPC use cases.
Architect and implement intelligent data tiering strategies (hot, warm, cold) to maximize performance where it matters and drive significant cost savings at scale.
Collaborate with infrastructure and operations leaders to forecast multi-year capacity growth, design for petabyte-to-exabyte scalability, and ensure consistent performance under peak workloads.
Define and enforce lifecycle management, replication, and disaster recovery policies that guarantee data integrity, compliance, and near-zero downtime.
Own the observability and optimization roadmap for the storage platform, deploying advanced telemetry, monitoring, and analytics to proactively detect and remediate bottlenecks before they impact customers.
Partner closely with engineering to drive continuous performance tuning, eliminate systemic inefficiencies, and ensure the platform remains ahead of industry benchmarks.
Bachelor’s degree or foreign equivalent in Computer Science, Electrical Engineering, Computer Engineering, or a closely related technical field.
Seven (7) years of progressive, post-baccalaureate experience in product management, including at least four (4) years focused specifically on cloud-scale storage or infrastructure platforms.
Proven expertise in the following areas, demonstrated within the required seven (7) years of experience:
Designing and delivering large-scale storage platforms, including block, file, and object architectures, for performance-critical workloads.
Evaluating and selecting storage technologies through benchmarking of throughput, IOPS, latency, durability, and total cost of ownership.
Architecting and managing storage solutions for petabyte- to exabyte-scale datasets, including intelligent tiering strategies.
Defining lifecycle management, replication, and disaster recovery strategies to ensure data durability and high availability.
Integrating storage services across hybrid and multi-cloud…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).