Cohen's d Calculator

Calculate Cohen's d, Hedges' g, and Glass's Δ effect sizes with CLES, overlap metrics, confidence intervals, and benchmark interpretation tables.

Cohen's d
0.5000
Interpretation: Medium
95% CI
-0.014 to 1.014
SE = 0.2622
CLES (Prob. of Superiority)
63.8%
Probability Group 2 > Group 1
Overlap
80.3%
% overlap between distributions
U₃ (Non-overlap)
69.1%
% of Group 2 above Group 1 mean
Pooled SD
10.0000
df = 58

Effect Size Conversions

MeasureValueFormula
Cohen\'s d0.5000(M₂ − M₁) / S_pooled
Hedges\' g0.4935d × (1 − 3/(4df − 1))
Glass\'s Δ0.5000(M₂ − M₁) / S₁
Point-Biserial r0.2425d / √(d² + 4)
Eta-squared (η²)0.0588d² / (d² + 4)

Cohen\'s Benchmarks

Categoryd RangeCLESYour d
Negligible0.00.252.8%
Small0.20.559.8%
Medium0.50.867.7%◀ 0.50
Large0.81.276.0%
Very Large1.22.087.1%

Distribution Overlap Visualization

Group 1 (M=50.0) Group 2 (M=55.0)
Planning notes, formulas, and examples

About the Cohen's d Calculator

Cohen's d is the most widely used measure of effect size for comparing two group means. It expresses the difference as a number of standard deviations, making it possible to compare effects across different studies, scales, and contexts. A d of 0.5 means the groups differ by half a standard deviation — regardless of whether you're measuring test scores, blood pressure, or reaction times.

This calculator computes four effect size variants: Cohen's d (pooled SD), Hedges' g (bias-corrected for small samples), Glass's Δ (uses control group SD only), and a control-SD variant. It also converts to point-biserial r and eta-squared, computes the Common Language Effect Size (CLES — the probability that a random person from Group 2 outscores a random person from Group 1), distribution overlap, and U₃ statistics.

Effect sizes are essential for meta-analyses, power analyses, and for determining whether a statistically significant result is practically meaningful. A p-value tells you whether an effect exists; Cohen's d tells you how big it is.

When This Page Helps

Statistical significance (p-values) doesn't tell you the size of an effect — it only tells you whether it's distinguishable from zero. Cohen's d fills this gap by quantifying how large the difference is in standardized units. This is critical for power analyses (planning study sample sizes), meta-analyses (combining results across studies), and practical decision-making.

It gives everything you need: the d value with confidence interval, alternative variants for different assumptions, conversions to other effect size metrics, and intuitive interpretations like CLES and overlap. It's essential for researchers, students, and anyone conducting or reading quantitative research.

How to Use the Inputs

  1. Enter the mean and standard deviation for each group (Group 1 = control, Group 2 = treatment).
  2. Enter the sample sizes for each group.
  3. Select the effect size variant: pooled d (default), Hedges' g, Glass's Δ, or control SD.
  4. Use presets for quick demonstrations of small, medium, and large effects.
  5. Review the effect size, confidence interval, CLES, and overlap statistics.
  6. Check the benchmark table to interpret the magnitude of your effect.
  7. See the conversion table for equivalent r and η² values.
Formula used
Cohen's d = (M₂ − M₁) / S_pooled S_pooled = √[((n₁−1)S₁² + (n₂−1)S₂²) / (n₁ + n₂ − 2)] Hedges' g = d × (1 − 3/(4(n₁+n₂−2) − 1)) Glass's Δ = (M₂ − M₁) / S₁ CLES = Φ(d / √2) Overlap = 2Φ(−|d|/2) r = d / √(d² + 4)

Example Calculation

Result: d = 0.50 (Medium effect)

Groups differ by 5 points with pooled SD = 10, giving d = 5/10 = 0.50. This is a medium effect by Cohen's benchmarks, with CLES = 63.8% (a randomly chosen Group 2 member outscores a Group 1 member 64% of the time).

Tips & Best Practices

  • Always report an effect size alongside a p-value — significance without magnitude is incomplete.
  • Use Hedges' g for meta-analyses and when combining small studies.
  • CLES is the most intuitive way to communicate effect sizes to non-technical audiences.
  • Cohen's benchmarks (0.2, 0.5, 0.8) are guidelines, not rules — context matters enormously.
  • For paired designs, compute d differently: d = mean_difference / SD_difference.
  • Check the confidence interval — if it spans zero, the effect may not be reliable despite a large d.

History and Context

Jacob Cohen introduced his d statistic in 1962 and elaborated it in his landmark 1988 book "Statistical Power Analysis for the Behavioral Sciences." He proposed the small/medium/large benchmarks as rough guides when prior research was unavailable, emphasizing that discipline-specific norms should take precedence.

The distinction between statistical significance and practical significance became a major theme in the "replication crisis" of the 2010s. Many statistically significant findings had negligible effect sizes, calling into question their practical importance. Today, major journals require effect size reporting alongside p-values.

Variants and When to Use Them

**Cohen's d** uses the pooled standard deviation and assumes equal variances. It's the default for independent-samples t-tests. **Hedges' g** applies a small-sample correction factor of approximately 1 − 3/(4df − 1), which matters for n₁ + n₂ < 50. **Glass's Δ** uses only the control group's SD, appropriate when the treatment changes variability (e.g., an educational intervention that affects not just the mean but the spread of scores).

Effect Sizes in Meta-Analysis

Meta-analysis combines effect sizes from multiple studies using weighted averaging. Hedges' g is preferred because its correction removes the small-sample bias that would otherwise inflate the combined estimate. The confidence interval for d is also crucial — studies with wider CIs receive less weight in random-effects meta-analyses. This calculator's CI output is directly usable for meta-analytic input.

Sources & Methodology

Last updated:

Frequently Asked Questions

  • Use Hedges' g when either sample size is small (n < 20). Cohen's d has a slight upward bias for small samples, and the Hedges correction factor adjusts for this. For larger samples, d and g are virtually identical.