Youden's Index Calculator

Calculate Youden's J statistic from sensitivity and specificity or a 2×2 table. Includes ROC space visualization, PPV/NPV, DOR, LR+/LR−, MCC, and a 13-metric performance dashboard.

Youden\'s Index (J)
0.8450
Quality: Excellent | J = Sens + Spec − 1
Sensitivity
85.00%
TP rate: correctly identified positive cases
Specificity
99.50%
TN rate: correctly identified negative cases
Balanced Accuracy
92.25%
(Sens + Spec) / 2 = (J + 1) / 2
PPV / Precision
89.95%
Prob. disease given positive test (prevalence-dependent)
NPV
99.21%
Prob. no disease given negative test
Likelihood Ratio +
170.00
Odds of positive in diseased vs healthy
Likelihood Ratio \u2212
0.1508
Odds of negative in diseased vs healthy

ROC Space Position

FPR (1 \u2212 Specificity) \u2192
Sensitivity
\u25CF Test \u25CF Perfect (0,1) | J distance

2\u00D72 Contingency Table

Disease +Disease \u2212Total
Test +425 (TP)47 (FP)472
Test \u221275 (FN)9,453 (TN)9,528
Total5009,50010,000

Comprehensive Performance Metrics

MetricValueInterpretation
Youden's Index (J)0.8450Excellent — 0 = useless, 1 = perfect
Sensitivity85.00%Catches 85.0% of true positives
Specificity99.50%Correctly excludes 99.5% of negatives
False Positive Rate0.50%0.5% of healthy misclassified
False Negative Rate15.00%15.0% of diseased missed
PPV (Precision)89.95%At 5.0% prevalence
NPV99.21%At 5.0% prevalence
Accuracy98.77%Overall correct classification rate
Balanced Accuracy92.25%Class-balanced accuracy (= (J+1)/2)
DOR1,127.7Diagnostic Odds Ratio — higher is better
F1 Score0.8740Harmonic mean of precision & sensitivity
MCC0.8685Matthews correlation: −1 to +1
NNS23.5Number needed to screen per TP

Youden\'s Index Scale

00.20.40.60.81.0
UninformativePoorFairGoodExcellent
Planning notes, formulas, and examples

About the Youden's Index Calculator

Youden's Index (J) is the single most informative summary statistic for a diagnostic (or classification) test. Defined as Sensitivity + Specificity − 1, it ranges from 0 (useless) to 1 (perfect), combining both error types into one number. A test with J = 0.84 means you're capturing 84 percentage points more correct classifications than random chance.

This calculator computes J from either raw sensitivity/specificity percentages or a 2×2 contingency table (TP, FP, FN, TN). Beyond J itself, the dashboard reports 13 performance metrics: PPV, NPV, accuracy, balanced accuracy, diagnostic odds ratio, likelihood ratios, F1 score, Matthews correlation coefficient, and the number needed to screen.

The ROC space visualization plots the test's operating point and shows J as the vertical distance from the chance line — the same quantity maximized when finding the optimal ROC cutoff. The quality gauge maps J to interpretive bands (Uninformative through Excellent) for quick assessment.

When This Page Helps

Youden's Index distills a diagnostic test to its essence: how much better is it than guessing? This calculator goes further, computing 13 metrics alongside a visual ROC space plot, so you can evaluate a test from every angle — discrimination, prediction, odds ratios, and classification quality.

The preset library includes real-world medical tests, making it easy to benchmark your test. The contingency table mode accepts raw counts when you have experimental data rather than published rates, and the J quality gauge gives immediate visual feedback.

How to Use the Inputs

  1. Choose input mode: sensitivity/specificity percentages or a 2×2 contingency table.
  2. Enter sensitivity, specificity, and prevalence (or TP/FP/FN/TN counts).
  3. Use medical test presets for common scenarios.
  4. Read Youden's Index and the quality rating from the primary output.
  5. Examine the ROC space plot to see the test's position relative to chance and perfection.
  6. Review the 13-metric performance table for comprehensive evaluation.
  7. Adjust prevalence to see how it changes PPV/NPV (J is prevalence-independent).
Formula used
J = Sensitivity + Specificity − 1 = TPR − FPR. Equivalently, J = (TP × TN − FP × FN) / ((TP + FN)(FP + TN)). Ranges from −1 to +1; meaningful tests have J > 0.

Example Calculation

Result: J = 0.8450, Quality: Excellent

J = 0.85 + 0.995 − 1 = 0.845. The test captures 84.5 percentage points more correct classifications than random assignment. At 5% prevalence, PPV = 89.5% and NPV = 99.9%. LR+ = 170, indicating strong positive discrimination.

Tips & Best Practices

  • J = 0 does not mean 50% accuracy — it means the test is no better than chance for the given prevalence.
  • Maximize J to find the optimal ROC cutoff, but consider cost asymmetries (missed cancer vs. false alarm).
  • Compare PPV at your population's prevalence, not just J, when deploying a screening program.
  • DOR > 100 indicates strong discrimination; DOR < 10 is often insufficient for clinical use.
  • Report both J and likelihood ratios in publications — J summarizes overall performance, LRs quantify diagnostic shifts.
  • Use the contingency table mode when you have raw experimental counts — it avoids rounding errors from pre-calculated rates.

Youden's Index and ROC Analysis

When plotting an ROC curve from continuous test results, each possible cutoff gives a different (FPR, Sensitivity) pair. Youden's Index identifies the optimal cutoff — the point on the curve farthest from the chance diagonal. This maximum-J cutoff maximizes the sum of sensitivity and specificity simultaneously, providing a principled and widely cited selection criterion.

Beyond J: When One Number Isn't Enough

J assumes equal weight for sensitivity and specificity, which isn't always appropriate. Screening for a lethal cancer demands high sensitivity (catching all cases) even at the cost of specificity. In contrast, confirmatory tests must have high specificity. The full metric dashboard in this calculator — DOR, LR+, LR−, PPV, NPV — helps evaluate the test for your specific clinical scenario.

Youden's Index in Machine Learning

In binary classification, J appears as "informedness" or "bookmaker informedness." It's equivalent to balanced accuracy × 2 − 1 and closely related to Matthews Correlation Coefficient and Cohen's Kappa. When class imbalance makes accuracy misleading, J and its relatives provide a more honest assessment of classifier performance.

Sources & Methodology

Last updated:

Frequently Asked Questions

  • It summarizes a diagnostic test's discriminatory ability in a single number. It's most commonly used to find the optimal cutoff on an ROC curve — the point where J is maximized. It also enables quick comparison of different tests: higher J means better overall discrimination.