What is minimum detectable effect (MDE)?

MDE is the smallest difference between variants that you want to reliably detect. For example, a 2% MDE means you want to detect at least a 2 percentage point improvement. Smaller MDE requires larger sample sizes.

What confidence level should I use?

95% is the industry standard for email tests. It means there's only a 5% chance the observed difference is due to random chance. High-stakes tests (like pricing changes) may warrant 99% confidence.

What is statistical power?

Power (typically 80%) is the probability of detecting a real effect when one exists. 80% power means you have an 80% chance of correctly identifying a true winner. Higher power requires larger samples.

My list is too small for the required sample. What should I do?

Increase your MDE (look for bigger effects), lower your confidence level to 90%, or accumulate results across multiple sends. Testing subject lines (which produce larger effects) is easier with small lists.

Can I stop a test early if one variant is clearly winning?

Only if you use sequential testing methods designed for early stopping. Standard tests require the full sample to be collected. Peeking at results and stopping early inflates false positive rates significantly.

How long should I wait after sending to measure results?

Wait at least 24–48 hours for open rate tests and 48–72 hours for click rate tests. For conversion-based tests, wait through your full attribution window (typically 7 days) before analyzing.

Email A/B Test Calculator

Calculate the required sample size for statistically significant email A/B tests. Ensure reliable experiment results.

Metric Being Tested

Baseline Open Rate

Min Detectable Effect (MDE)

Confidence Level

Statistical Power

Number of Variants

List Size

Daily Email Sends

Sample Per Variant

7,552

Minimum recipients per version

Total Sample Size

15,104

2 variants combined

List Feasibility

Sufficient

Need 30.21% of your 50,000 list

Est. Days to Collect

8 days

At 2,000 sends/day

Detecting

2pp change

From 25.0% to 27.0% open rate

False Positive Risk

At 95% confidence level

List Coverage Required

30.2% of list

0%100% of list (50,000)

MDE Sensitivity Analysis

MDE (pp)	Per Variant	Total Sample	Days Needed	Feasible?
0.5pp	118,549	237,098	119 days	✗ No
1pp	29,830	59,660	30 days	✗ No
1.5pp	13,342	26,684	14 days	✓ Yes
2pp ←	7,552	15,104	8 days	✓ Yes
3pp	3,398	6,796	4 days	✓ Yes
5pp	1,252	2,504	2 days	✓ Yes

Power Level Comparison

Power	Per Variant	Total Sample	Risk of Missing Real Effect
70%	5,936	11,872	30%
80% ←	7,552	15,104	20%
90%	10,110	20,220	10%
95%	12,500	25,000	5%

Planning notes, formulas, and examples

About the Email A/B Test Calculator

The Email A/B Test Calculator determines the minimum sample size needed for a statistically significant email test. Running A/B tests with too few recipients produces unreliable results, leading to wrong conclusions and suboptimal decisions.

The required sample size depends on your baseline metric (e.g., current open rate), the minimum detectable effect (MDE) you care about, and your desired statistical confidence level. A smaller MDE or higher confidence requires a larger sample.

This calculator uses the standard two-proportion z-test formula to compute per-variant sample sizes. It helps you decide whether your list is large enough to test effectively and how long you may need to accumulate sufficient data.

By calculating this metric accurately, digital marketers gain actionable insights that inform content strategy, audience targeting, and campaign optimization across all channels. Understanding this metric in precise terms allows marketing professionals to set realistic goals, track progress effectively, and refine their approach based on real performance data.

When This Page Helps

Without proper sample size calculation, you might declare a winner based on random variation. This calculator prevents premature decisions by telling you exactly how many subscribers each test variant needs for reliable results.

How to Use the Inputs

Enter your baseline conversion rate (open rate, click rate, etc.).
Enter the minimum detectable effect (smallest difference worth detecting).
Select your desired confidence level (95% is standard).
Select your desired statistical power (80% is standard).
View the required sample size per variant.
Multiply by 2 for total sample size (both variants combined).

Formula used

n = (Z² × 2 × p̅(1 − p̅)) ÷ MDE²
Where Z = Z-score for confidence level, p̅ = pooled proportion, MDE = minimum detectable effect

Example Calculation

Result: 3,589 per variant

To detect a 2 percentage point improvement from a 25% baseline open rate at 95% confidence and 80% power, you need approximately 3,589 subscribers per variant (7,178 total). If your list is smaller, increase MDE or accept lower confidence.

Tips & Best Practices

Test one variable at a time for clean, interpretable results.
Let tests run for the full calculated sample size before declaring a winner.
A 95% confidence level means a 5% chance your result is a false positive.
Larger MDE requires smaller samples—focus on detecting meaningful differences.
If your list is small, test high-impact variables (subject lines) that produce larger effects.
Consider running tests over time if a single send doesn't reach required sample size.

Why Sample Size Matters in Email Testing

Email A/B tests without proper sample sizes produce unreliable results. With too few subscribers per variant, random variation can easily masquerade as a real difference, leading you to adopt inferior tactics.

Understanding the Formula

The sample size formula balances statistical confidence, power, baseline rate, and minimum detectable effect. Each parameter trades off against the others—higher confidence or smaller MDE means larger required samples.

Practical Testing with Limited Lists

If your list is under 10,000, focus on testing variables with large expected effects (subject lines, offers) where a 3–5% MDE is acceptable. Save subtle tests (button color, footer layout) for lists large enough to detect small differences.

Building a Testing Culture

The most successful email programs test continuously. Run one test per campaign, document results, and build a knowledge base over time. Even on small lists, consistent testing with appropriate sample sizes yields valuable insights.

Sources & Methodology

Last updated: February 8, 2026

Frequently Asked Questions

MDE is the smallest difference between variants that you want to reliably detect. For example, a 2% MDE means you want to detect at least a 2 percentage point improvement. Smaller MDE requires larger sample sizes.