growth-experiment
name: growth-experiment
description: "Identify the true conversion bottleneck and design an experiment that addresses root cause — the difference between a 2% button-color lift and a 10x funnel fix."
/growth-experiment
Growth teams that run button color tests are not running growth experiments — they're running UI experiments. The bottleneck is almost never the button. It's the moment 60% of your activated users hit a friction point and quietly leave, or the acquisition channel flooding your funnel with users who never had purchase intent. This skill forces a structured bottleneck diagnosis before any experiment is designed — so the experiment addresses the real drop-off, not the loudest hypothesis in the room.
The Leaky Bucket — Find the Worst Stage
Map your funnel with actual numbers at each stage:
- Acquisition → Activation → Habit → Retention → Revenue → Referral
- Where is the largest absolute drop? (Not the largest percentage drop — the stage where the most people fall off in raw numbers)
- Where has drop-off worsened in the last 90 days?
- Which stage, if fixed, would most move your north-star metric?
- This is your bottleneck stage. Everything below is downstream; everything above is upstream. Work here first.
Which Cohort Is Worst
- Segment the bottleneck stage by acquisition channel: which source produces users who convert at half the average rate?
- Segment by product version, plan tier, or user persona if applicable
- Segment by time: is drop-off worse for users acquired in the last 30 days, or is it consistent across cohorts?
- Find the cohort that is 2x worse than your best-performing cohort — that gap is your experiment opportunity
The Behavior Signal
- What do users in the worst cohort actually do before they drop off? (Session replay, click maps, event logs)
- What do users who convert do differently? (Even one behavioral difference is a hypothesis)
- What's the last thing users in the drop-off cohort interact with before they leave?
- Have you surveyed the drop-offs? (Exit surveys with a single question often surface root cause faster than analytics)
Root Cause Hypothesis — Pick One
- State it as a falsifiable claim: "Users drop at the [step] because [mechanism], which we know because [evidence]"
- Example: "Users drop at the team-invite step because the value of the product is only visible after 3+ colleagues join, and solo users hit a dead end — evidenced by 72% of 7-day churners having zero connected teammates"
- What's the alternative hypothesis you're not pursuing? Why?
- Is this a motivation problem (users don't understand why to do it), a friction problem (they want to but can't easily), or a fit problem (wrong user in the funnel)?
Experiment Design
- What's the minimum change that tests the hypothesis? (Not a redesign — one targeted intervention)
- What is the control group experiencing today?
- What exactly changes for the treatment group?
- What's the primary metric — conversion through the bottleneck stage, not a vanity metric
- What's the minimum detectable effect at your current traffic volume? (Use your actual weekly users — don't run an underpowered test)
- How many weeks do you need to reach statistical significance?
Realistic Impact Estimate
- If the hypothesis is correct and the experiment works, what does the funnel metric improve to?
- What does that improvement translate to in revenue or growth rate over 12 months?
- Is this a one-time lift or does it compound?
- What's the downside if it doesn't work — is the variant clearly worse, or just neutral?
Rules
- The bottleneck stage must be identified with actual funnel data before an experiment is proposed — no skipping to ideation
- One hypothesis per experiment — testing multiple changes simultaneously is not an experiment
- The root cause must be classified as motivation/friction/fit — this determines the type of intervention
- You must have enough weekly traffic to reach significance in under 6 weeks — if not, redesign the scope
- The primary metric must be a conversion rate, not an engagement metric
This output is a single experiment brief tied to the real bottleneck — with a root cause hypothesis, a specific change, a traffic calculation, and an impact estimate that justifies running it.