Growth Experiments

Growth Experimentation Examples: 12 That Worked

12 publicly documented growth experiments that worked, the one trait the winners share, and how to turn each pattern into a scored experiment of your own.

Most "growth experiment examples" lists are decoration: a gallery of wins with no through-line you can use. This one has a through-line. Across these 12 publicly documented experiments, the winners share one trait, and it is not creativity. Each one removed a specific friction or tested a specific belief, then measured a single number. That is the whole game.

A note on the numbers: these are publicly documented A/B test case studies, with results as reported by the companies and their testing platforms (VWO, Optimizely, and company write-ups). Treat them as patterns, not promises. The same change on your product will not produce the same percentage; the transferable part is the move, not the number.

What separates a real experiment from a tweak

A growth experiment succeeds when it starts from a falsifiable hypothesis rooted in a real understanding of the customer, not a surface-level guess. Random testing wastes resources. The systematic version identifies a bottleneck, forms a hypothesis, tests the smallest version, and scales the winner.

Critical success factors include:

• Deep customer understanding before testing

• Clear metrics tied to business goals

• Rapid iteration cycles

• Documentation of learnings

• Cross-functional team alignment

Many growth experiments consist of combining existing systems in novel ways to create value, take for example the early-days Airbnb example of posting their listings on Craigslist. Innovation often comes from creative connections, not only new inventions.

12 Growth Experiments That Delivered Real Results

PayU Checkout Simplification

By simply eliminating the email address field from the form placed on their Checkout page, the company was able to register an improvement of 5.8% in conversions as compared to the control. One less field. Significant revenue impact.

The lesson? Friction kills conversions. Every form field represents a decision point where users can abandon. PayU tested this hypothesis and proved that asking for mobile numbers alone was sufficient for their checkout process.

EA's SimCity Pre-Order Experiment

Electronic Arts challenged conventional wisdom about discount incentives. Ditching the discount didn't just work, it worked wonders, boosting sales by over 40 percent compared to the control. The clean, straightforward offer outperformed the 20% discount promise.

SimCity enthusiasts weren't in it for the savings, they just wanted to play the latest SimCity and probably weren't interested in buying other games. Understanding true customer motivation beats applying generic best practices.

Vestiaire's Influencer Content Testing

Vestiaire's strategic use of A/B testing led to over 4,000 new app installs and halved the cost per install. The fashion marketplace allowed influencers creative freedom, then amplified top-performing content through paid advertising.

This two-stage approach, creating diverse content then testing performance before scaling spend, demonstrates smart resource allocation. Test cheap, scale winners.

Going's CTA Language Optimization

A simple word change produced dramatic results. The new "Trial for free" CTA led to a 104% increase in trial starts month-over-month. This significant uplift not only improved conversion rates through paid channels but also surpassed the performance of organic traffic for the first time.

Language matters. "Trial for free" frames the offer as a benefit-first action, while generic CTAs create ambiguity about next steps.

Campaign Monitor's Direct-to-Trial Strategy

The variant employing DTR demonstrated an enormous 31.4% increase in conversions, which in this case meant signing up for a trial of Campaign Monitor's software. Removing unnecessary steps between interest and product experience accelerated decision-making.

The experiment ran for 77 days with 1,274 visits, proving that meaningful results don't always require massive traffic. Statistical significance matters more than volume.

Bannersnack's CTA Button Redesign

Bannersnack discovered that a larger, higher-contrast call-to-action button made a huge difference. The company used heatmap data to identify that visitors weren't clicking their original CTA, then tested design variations until finding the winning combination.

Multiple testing rounds, each informed by behavioral data, created incremental improvements that compounded into significant conversion gains.

Zalora's Button Consistency Test

By simply bringing uniformity to Zalora's call to action button, the eCommerce giant saw an increase of 12.3% in its checkout rate. Inconsistent design creates cognitive friction. Consistency builds trust and reduces hesitation.

This experiment highlights how small UX details impact major business metrics. A unified button style across the checkout flow removed subtle barriers to completion.

Intertop's Checkout Optimization

Intertop's conversion rate increased by 54.68% in the test variant. When they officially rolled out the changes, the average revenue per user (ARPU) grew by 11.46%, and the checkout bounce rate decreased by 13.35%.

The compound effect of multiple optimizations, tested systematically, transformed their checkout experience. Revenue per user growth demonstrates that better UX doesn't just convert more visitors, it encourages higher-value purchases.

Google Workspace Trial Period Adjustment

Google's team found that a 30-day trial was too long, and most users had made a call by day three. If reducing your trial period proves to not hurt your conversion rate, this change can also help your team accelerate testing cycles.

Shorter trials mean faster feedback loops and quicker revenue recognition. The counterintuitive finding that less time converts better challenges industry assumptions about trial lengths.

ShopClues Navigation Optimization

ShopClues found that the main navigation bar links on the homepage were getting a lot of clicks, especially "Wholesale," while the others were not. The company decided to replace the "Wholesale" section with other marketing categories such as "super saver" bazaar, etc. It also moved the 'Wholesale' section to the left side of the site.

Analyzing which navigation elements attracted attention, then reorganizing based on user behavior rather than internal preferences, sent better-qualified traffic to category pages.

AdonisClothing's Model Variation Test

The bearded model variation outperformed the control by 49.73%, leading to a 33% increase in orders. The fashion retailer discovered that women shopping for men preferred product images featuring bearded models.

The key insight is that clients are buying into an image. Understanding your audience's preferences and aligning product presentation with consumer psychology can significantly boost conversions.

Remote Company's Email Subject Line Testing

By testing different subject line versions, she was able to boost open rates by an average of 10% on tested campaigns compared to those that ran without testing. Consistent A/B testing of email subject lines created reliable performance improvements across campaigns.

Small percentage gains across thousands of sends translate to hundreds or thousands of additional opens, clicks, and conversions. Compounding matters.

Why Most Companies Fail at Experimentation

Many organizations resist experimentation because of short-term pressures. Teams feel the need to deliver immediate results, making tests seem risky or unnecessary. This mindset kills innovation before it starts.

The data reveals a stark reality. If the average success rate of the growth experiments is 10-20%, how many wins should you produce to impress your boss, teammates or yourself with the experiment's results? Low testing volume guarantees failure.

Common experimentation killers:

• Fear of negative results

• Lack of executive buy-in

• Insufficient traffic for significance

• Poor documentation systems

• Siloed teams competing internally

• Overthinking instead of shipping

Most experiments run were failures, but we learned a lot and it meant that the successful experiments felt that much sweeter. It's clear that HubSpot has a very robust experimentation program. It might seem heavy and most startups don't need that level of rigor. Instead, you should be focused on getting experiments live and learning. Don't let documentation and process be a barrier to learning and progress.

Building Your Experimentation Framework

Three pillars are critical to success: culture, process, and skills. Without all three, experimentation efforts stall or produce unreliable results.

Culture means celebrating learning over being right. Failed experiments provide valuable insights. The most successful marketers encourage curiosity, reward learning (and failing), and give employees the freedom to experiment without fear of failure.

Process creates consistency. ICE scoring (Impact, Confidence, Ease) is easiest and fastest to score because the score names are easy to understand, anyone on the team can answer the questions and it's relatively short. Prioritization frameworks prevent teams from testing random ideas.

Skills enable execution. Growth teams need to combine a wide array of skill sets, from marketing, copy-writing and psychology to analytics and engineering. Cross-functional collaboration accelerates testing velocity.

Measuring What Matters

Success might be defined by improving a metric by a certain percentage or getting the answer to a question. Clear success criteria prevent goal-shifting mid-experiment.

Leading indicators matter more than vanity metrics. Track conversion rates, activation rates, retention curves, and revenue per user. These metrics connect directly to business outcomes.

Companies that systematically test ideas and make decisions based on real data are able to gather better insights, make better decisions, and ultimately, grow faster. Data-driven beats opinion-driven every time.

Scaling Your Testing Velocity

Best-in-class companies, such as Amazon, Netflix, and Meta, run thousands of experiments every single day. If we open the Uber App now, each of us will likely see different versions of it as a result of different experimentation buckets.

Volume creates advantage. More tests mean more learnings, more winners, and faster iteration cycles. Start with 3-5 experiments weekly, then scale systematically.

TSheets scaled to 20 A/B tests per month by building an experimentation-forward culture. This required tools, training, and executive support to maintain momentum.

Velocity accelerators include:

• Dedicated experimentation tools

• Pre-built test templates

• Automated result tracking

• Weekly review meetings

• Shared learning repositories

Key Takeaways

• Test systematically, not randomly – Base experiments on customer insights and clear hypotheses

• Volume beats perfection – Run more experiments to find more winners faster

• Small changes compound – 5-10% improvements across multiple touchpoints create exponential growth

• Document everything – Failed tests provide insights for future experiments

• Speed matters – Faster iteration cycles accelerate learning and growth

• Culture enables execution – Teams need permission to fail and resources to succeed

Frequently Asked Questions

How many experiments should we run each month?

Start with 3-5 experiments weekly if you have sufficient traffic. Booking.com runs over 25,000 tests annually. Airbnb ramped up from 100 to over 700 tests per week in just two years. Scale based on team capacity and traffic volume.

What if we don't have enough traffic for statistical significance?

If you operate with a small data set, it could prevent you from running statistically significant tests. But you can still experiment by turning to qualitative research. Some intelligence is always better than no intelligence. Use heatmaps, user interviews, and session recordings.

How long should experiments run?

Run tests until reaching statistical significance or predetermined time limits. The A/B test ran for 77 days, during which the landing pages accumulated a total of 1,274 visits. Lower traffic sites need longer test periods.

What's the typical success rate for growth experiments?

The average success rate of growth experiments is 10-20%. This means most tests fail to beat the control. High testing volume becomes critical for finding winners.

Should we test multiple changes simultaneously?

Test one variable at a time for clear attribution. Multivariate testing and A/B testing are different methods. The first one tests multiple versions and changes at the same time, while A/B tests focus on the impact of a single change. Start simple, then advance to multivariate testing with sufficient traffic.

How do we prioritize which experiments to run first?

Use ICE scoring (Impact, Confidence, Ease) because it's easiest and fastest to score, the score names are easy to understand, anyone on the team can answer the questions and it's relatively short. Focus on high-impact, high-confidence, easy-to-execute tests first.

What tools do we need to start experimenting?

Begin with Google Analytics for tracking, Hotjar for behavioral insights, and basic A/B testing tools like Google Optimize or Optimizely. Bannersnack turned to Hotjar click heatmaps to investigate how users interacted with the page. With this tool, the company could visualize the areas with the most clicks and see spots website visitors ignored.

Your Next Move

Growth experimentation separates companies that scale from those that stagnate. The examples above prove that systematic testing, even with modest traffic, produces measurable results.

Start with one experiment this week. Form a clear hypothesis, define success metrics, and run the test. Document your learnings regardless of outcome. Repeat weekly.

What experiment will you run first?

How to turn these examples into your own experiments

A pattern list only matters if it changes what you test next. Every winner above began as a falsifiable hypothesis that moved one metric. To do the same: write each idea as a structured hypothesis, score the candidates with ICE and ROTI so the fast, cheap tests run before the slow builds, and start from a scored growth experiment template instead of a blank doc. The growth experimentation guide covers the full program, and the experiment database keeps the backlog ranked as you run it.