All tools
Marketing tools

A/B Test Calculator

Calculate statistical significance between two variants instantly — enter visitors and conversions, get p-value, confidence level and a clear verdict.

p-value 95% / 99% confidence Chi-squared test Relative uplift Sample size guide Instant verdict
Get started free Sign in

Free · No credit card · 50 credits/day

Confidence level reference

Confidence p-value Meaning Use when
80% p < 0.20 Weak — 1 in 5 chance it's noise Exploratory tests, low-stakes decisions
90% p < 0.10 Moderate — 1 in 10 chance it's noise Low-traffic sites needing faster decisions
95% p < 0.05 Standard — 1 in 20 chance it's noise Most A/B tests, industry default
99% p < 0.01 Strong — 1 in 100 chance it's noise High-stakes changes, irreversible decisions

Frequently asked questions

What is statistical significance in A/B testing?

Statistical significance tells you the probability that the difference between control and variant is real, not random. 95% confidence means only a 5% chance (p < 0.05) the difference is noise. 99% confidence is used for high-stakes decisions. Running tests for too little time or traffic produces misleadingly significant results.

How long should I run an A/B test?

At least one full business cycle (1–2 weeks) to account for day-of-week effects. Never stop when you first reach 95% significance — the "peeking problem" inflates false positive rates. Calculate required sample size before starting, then run until you hit it regardless of interim results.

What is p-value and how do I interpret it?

p-value is the probability of seeing a difference this large if no real difference exists. p=0.05 means 5% chance of a false positive (95% confidence). Lower p-value = stronger evidence. A significant p-value does NOT indicate practical importance — a 0.1% lift may be statistically significant but not worth acting on.

What is the minimum sample size for an A/B test?

Depends on: (1) baseline conversion rate, (2) minimum detectable effect (MDE — the smallest lift worth acting on), (3) desired confidence and power. At 3% baseline and 20% MDE, you need ~5,000 visitors per variant. At 10% MDE: ~20,000 per variant. Underpowered tests produce unreliable results.

Related marketing tools

More tools for conversion optimisation.

Conversion Rate Calculator

Calculate and compare conversion rates across periods.

Landing Page Analyzer

Analyze headlines, CTAs and structure of any landing page.

Ad Copy Generator

Generate copy variations to A/B test in your campaigns.

Stop guessing — start testing

Free account. 50 credits per day. Access to 75+ tools instantly.

Create free account →