What is the difference between Wilcoxon rank-sum and Mann-Whitney U?

They're the same test with different formulations. Wilcoxon uses rank sums directly; Mann-Whitney uses U statistics. They always give the same p-value. The names are used interchangeably.

When should I use this instead of a t-test?

When your data is ordinal, heavily skewed, contains outliers, or when sample sizes are too small to verify normality (n < 15-20 per group). If data is reasonably normal, the t-test is slightly more powerful.

What does the U statistic mean?

U counts the number of times a value from one group precedes a value from the other group in the ranked sequence. Large U values (close to n₁×n₂) suggest the groups don't overlap; small U values suggest extensive overlap.

Can I use this for paired data?

No, this test is for independent groups. For paired data, use the Wilcoxon signed-rank test instead, which is the non-parametric alternative to the paired t-test.

How are ties handled?

When two or more observations have the same value, they receive the average of the ranks they would have occupied. A tie correction factor can be applied to the variance formula for more accurate z-approximations.

What is the Hodges-Lehmann estimator?

It's the median of all n₁×n₂ pairwise differences between the two groups. It provides a robust, distribution-free estimate of the location shift between groups and serves as the non-parametric confidence interval center.

Wilcoxon Rank-Sum Test Calculator

Perform the Wilcoxon rank-sum (Mann-Whitney U) test for non-parametric comparison of two groups. Get U statistic, z-score, p-value, effect size, and rank table.

Wilcoxon Rank-Sum Test Calculator

Group 1 Data (comma-separated)

Group 2 Data (comma-separated)

Tail

Significance Level (α)

U Statistic

U₁ = 90, U₂ = 10 (min = 10)

Z Approximation

3.0237

Normal approximation: (U₁ − μ_U) / σ_U

p-Value

0.0025

Significant at α = 0.05

Decision

Reject H₀

Distributions differ significantly

Effect Size (r)

0.6761

Large

Hodges-Lehmann

2.0000

Median pairwise difference (robust location shift estimate)

Group Summary

Group	n	Median	Rank Sum	Mean Rank
Group 1	10	7.00	145.0	14.50
Group 2	10	5.00	65.0	6.50

Rank Details

Value	Group	Rank
3	Group 2	1.5
3	Group 2	1.5
4	Group 2	3.5
4	Group 2	3.5
5	Group 1	6.5
5	Group 2	6.5
5	Group 2	6.5
5	Group 2	6.5
6	Group 1	10.5
6	Group 1	10.5
6	Group 2	10.5
6	Group 2	10.5
7	Group 1	14.5
7	Group 1	14.5
7	Group 1	14.5
7	Group 2	14.5
8	Group 1	18
8	Group 1	18
8	Group 1	18
9	Group 1	20

Visual: Rank Distribution

Group 1 Mean Rank

14.5

Expected

10.5

Group 2 Mean Rank

6.5

Planning notes, formulas, and examples

About the Wilcoxon Rank-Sum Test Calculator

The Wilcoxon rank-sum test (also called the Mann-Whitney U test) is the non-parametric alternative to the independent two-sample t-test. It compares two groups without assuming normality, instead working with the ranks of the combined data to test whether one group tends to produce larger values than the other.

This calculator takes raw data from two groups, computes the Mann-Whitney U statistic, performs a z-approximation for the p-value, and provides effect sizes and the Hodges-Lehmann median difference estimator. A complete rank table shows every observation's assigned rank.

The Wilcoxon rank-sum test is ideal when data is ordinal (Likert scales, rankings), heavily skewed, contains outliers, or comes from small samples where normality cannot be verified. It's widely used in biomedical research, psychology, ecology, and quality control.

When This Page Helps

The t-test can be sensitive to skew and outliers, while the Wilcoxon rank-sum test works on ranks and is better suited to ordinal data or distributions that are difficult to treat as normal. This calculator automates the ranking, tie handling, and p-value step so the result is easier to inspect.

How to Use the Inputs

Enter comma-separated data values for Group 1 and Group 2.
Or click a preset to load example data.
Select the tail direction: two-tailed, right-tailed, or left-tailed.
Set your significance level alpha.
Review the U statistic, z-approximation, and p-value.
Check the effect size (r) and Hodges-Lehmann median difference.
Examine the rank table to see how observations were ranked.

Formula used

Mann-Whitney U Statistic:
  U₁ = R₁ − n₁(n₁+1)/2
  U₂ = R₂ − n₂(n₂+1)/2
  U = min(U₁, U₂)

Z Approximation (for large samples):
  z = (U₁ − μ_U) / σ_U
  μ_U = n₁n₂/2
  σ_U = √(n₁n₂(N+1)/12)

Effect Size:
  r = |z| / √N

Hodges-Lehmann Estimator:
  Median of all n₁×n₂ pairwise differences

Example Calculation

Result: U = 14.5, z = 3.09, p = 0.002

Group 1 (median = 7) has significantly higher ranks than Group 2 (median = 5). U = 14.5 with z = 3.09 gives p = 0.002 (two-tailed), indicating a statistically significant difference. The effect size r = 0.69 suggests a large effect. The Hodges-Lehmann estimate of the median shift is 3.0.

Tips & Best Practices

Ties are handled by assigning average ranks. Many ties reduce the test's discriminating power.
The z-approximation is reliable for sample sizes n₁, n₂ ≥ 10. For very small samples, use exact critical values.
Unlike the t-test, the Wilcoxon test is not testing means — it tests whether one distribution is stochastically greater than the other.
The Hodges-Lehmann estimator provides a robust estimate of the median shift between groups.
Effect size r follows Cohen's benchmarks: 0.1 = small, 0.3 = medium, 0.5 = large.
This test is sometimes incorrectly called a "test of medians" — it actually tests the full distributions, not just their centers.

Rank-Based Testing Philosophy

Rank-based tests replace raw observations with their ranks in the combined sample, making the analysis robust to outliers and distributional assumptions. If a single outlier changes a value from 100 to 10,000, the rank changes by at most one position. This robustness comes at a small cost in power: when data truly is normal, the Wilcoxon test is about 95.5% as efficient as the t-test.

Handling Ties in Rank Data

Tied observations receive the average rank. For example, if observations at positions 3, 4, and 5 all share the same value, each receives rank 4. When there are many ties, a correction factor adjusts the variance of the U statistic. With discrete data (like Likert scales), ties are common and the correction becomes important.

Interpreting the Hodges-Lehmann Estimator

The Hodges-Lehmann estimator is the median of all pairwise differences d = x₁ᵢ − x₂ⱼ. It estimates the shift in location between the two distributions. Unlike the mean difference, it's resistant to outliers. A confidence interval for this estimator can be constructed using the distribution of U, providing a non-parametric analog to the confidence interval from a t-test.

Sources & Methodology

Last updated: March 8, 2026

Frequently Asked Questions

They're the same test with different formulations. Wilcoxon uses rank sums directly; Mann-Whitney uses U statistics. They always give the same p-value. The names are used interchangeably.

Wilcoxon Rank-Sum Test Calculator

Wilcoxon Rank-Sum Test Calculator

Group Summary

Rank Details

Visual: Rank Distribution

About the Wilcoxon Rank-Sum Test Calculator

When This Page Helps

How to Use the Inputs

Example Calculation

Tips & Best Practices

Rank-Based Testing Philosophy

Handling Ties in Rank Data

Interpreting the Hodges-Lehmann Estimator

Sources & Methodology

Frequently Asked Questions

More in this topic

T-Test Calculator

ANOVA Calculator

McNemar's Test Calculator