More jobs:
Senior Experimentation Engineer
Job in
San Francisco, San Francisco County, California, 94199, USA
Listed on 2026-06-15
Listing for:
Confidential
Full Time
position Listed on 2026-06-15
Job specializations:
-
IT/Tech
Data Analyst, AI Engineer (Applied/Software), Data Engineering, Machine Learning/ ML Engineer
Job Description & How to Apply Below
Responsibilities
- Design and build the core experimentation platform infrastructure: experiment assignment service, randomization and traffic splitting, multi-layer experiment conflict detection, experiment configuration management (low-code, ≤30 min setup), and full lifecycle tooling (creation, monitoring, graduation, rollback) — supporting App, Web, and backend surfaces simultaneously
- Build automated metric pipelines that connect experiment assignments to full-funnel business outcomes — Signup CVR, eFTD CVR, eFTT CVR, deposit amount, trading volume — with sub-day latency; solve the cross-device identity bridging problem (device user s the registration boundary) and implement "time-to-convert" metrics (e.g., eFTD within N minutes of page exposure) that are currently unavailable
- Implement advanced statistical methods: CUPED and stratified variance reduction to improve experiment sensitivity without increasing sample size, sequential testing for early stopping (mSPRT / always-valid inference), network interference correction for referral and social experiments, and pre-experiment SRM (Sample Ratio Mismatch) checks; design and execute conversion lift studies to quantify the causal impact of product changes on business metrics
- Build self-serve experiment creation and analysis tooling for product managers, growth marketers, and data scientists — including experiment design wizards, power calculators, automated significance reporting, and decision support dashboards; reduce the experiment launch cycle from "requires 2 days of engineering" to "self-serve in under 30 minutes" for all three platforms (App, Web, backend)
- Establish experiment quality standards: event instrumentation requirements, guardrail metric monitoring, automated anomaly detection to prevent shipping regressions, and an experiment knowledge base that documents winning patterns, failed hypotheses, and domain-specific learnings — ensuring teams learn from each other rather than rediscovering the same findings
- Partner with Growth Product, Personalization (千人千面), Trade
GPT, ByX Community, and Asia-Pacific data engineering teams to design and analyze high-impact experiments across personalization, campaign optimization, user journey, push notifications, community feed, and AI product features; serve as the internal authority on causal inference and experiment design across all business units - Build the experimentation platform as the feedback engine for Bybit's AI strategy: design automated attribution systems and causal inference pipelines that deliver real-time feedback signals to AI models (Trade
GPT response ranking, personalization algorithms, push notification optimization) — enabling AI models to self-iterate based on causal experiment results rather than correlation-only metrics; build experiment bloodline tracking that traces how each AI model version performs across user segments, and end-to-end observability for recommendation and growth systems that accelerates the iteration cycle from research to production deployment - Define engineering standards, conduct design reviews, and mentor junior engineers; drive cross-team adoption of experimentation best practices as the US team grows
- 5+ years of industry experience in experimentation engineering, data engineering, or growth engineering at a consumer-scale internet company
- Proven track record building and operating large-scale A/B testing infrastructure: experiment assignment, metric pipelines, statistical analysis, and self-serve tooling serving hundreds of experiments simultaneously
- Deep expertise in causal inference and experimental statistics: hypothesis testing, power analysis, CUPED/variance reduction, sequential testing (mSPRT, always-valid inference), network effects and interference correction, conversion lift studies, and treatment effect estimation; ability to apply statistical test theories to optimize user experience and validate business decisions
- Strong proficiency in Python and SQL; hands‑on experience with real-time data processing frameworks — Flink Streaming or Spark Structured Streaming — as well as data warehouses (Hive, Click House, or equivalent),…
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×