Data Engineer, Notifications
San Francisco, San Francisco County, California, 94199, USA
Listed on 2026-04-23
-
IT/Tech
Data Engineering, Data Analyst
Join the Future of Commerce with Whatnot!
Whatnot is the largest livestream shopping platform in North America and Europe to buy, sell, and discover the things you love. Whether it's trading cards, fashion, electronics, or live plants, our sellers are building real businesses across hundreds of categories. We're building live commerce at a scale that's never been done in the West, and there's no playbook to copy. The people here are shaping how an entirely new industry develops.
Whatnot is inspired by our values and anchored in hubs across the US, UK, Ireland, Poland, Germany, and Australia. We move fast, stay close to our users, and focus on the work that drives the most impact.
We're one of the fastest growing marketplaces and were recently named the #1 Best Startup Employer in America by Forbes. Check out the latest Whatnot updates on our news and engineering blogs and join us as we enable anyone to turn their passion into a business and bring people together through commerce.
RoleAt Whatnot data engineers build foundational data systems that power product development, experimentation, machine learning, and operational decision-making across the company. As an Data Engineer on the Notifications Platform, you will play a critical role in owning and evolving one of the highest-volume and highest-impact data domains at Whatnot.
Notifications generate hundreds of millions of events per day, and the underlying data systems support everything from ML iteration and experimentation to adhoc engineering debugging. In this role, you will define the technical direction for the Notifications data mart, harden it to production-grade reliability, and set new patterns for domain-owned data engineering.
Working in a highly cross-functional role, you will collaborate closely with Product Engineers, Data Scientists, ML Engineers, and the Analytics Platform team. You’ll make key architectural decisions around data modeling, reliability, latency, and cost — and then make them real.
On any given day, you will:
Own data architecture end-to-end. Define how we capture, model, and serve critical business data—then implement it in production. You’ll make architectural decisions around storage formats, compute patterns, and SLAs that balance cost, scalability, and consistency.
Build mission-critical pipelines. Develop and operate batch data workflows that process high-volume events related to notifications with tight guarantees for latency, completeness, and accuracy.
Design and implement canonical models. Create domain-oriented data models that serve as the source of truth for analytics, ML, and production applications. Establish and enforce modeling standards, ownership boundaries, and data contracts across teams.
Enforce data quality ld tests, lineage, monitoring, and reconciliation systems that make every dataset observable and every anomaly actionable.
Automate operational workflows. Partner with business systems and platform teams to eliminate manual data handoffs and reconcile data across services, warehouses, and external systems.
Enable insights and experimentation. Support analytics, ML, and product engineering teams by exposing high-quality, self-healing, maintainable data assets.
We offer flexibility to work from home or from one of our global office hubs, and we value in-person time for planning, problem-solving, and connection. Team members in this role must live within commuting distance of our New York, Seattle, Los Angeles, and San Francisco hubs.
Curious about who thrives at Whatnot? We’ve found that embodying a low ego, growth mindset, and high-impact drive goes a long way here.
As our next Data Engineer, you bring deep experience designing reliable, scalable data systems and are excited to take true ownership of a complex product domain.
You should have 5+ years of experience in the data or software engineering domain, plus:
Strong experience building and maintaining production-grade data pipelines with clear SLAs, monitoring, and alerting
Deep expertise in SQL, including complex model graphs, dependency management, and performance optimization
Are comfortable writing production-grade code in Python or SQL…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).