Correlation Calculator
Calculate Pearson and Spearman correlation coefficients, R², covariance, significance testing, z-scores, and ranks from X/Y data. Full statistical analysis.
Calculate R², adjusted R², SS decomposition (SST/SSR/SSE), F-statistic, standard error, and residual analysis. Full ANOVA decomposition with interpretation guide.
| X | Y | Predicted | Residual | Residual % | |Residual| Bar |
|---|---|---|---|---|---|
| 1.00 | 2.1000 | 2.0667 | 0.0333 | 1.59% | |
| 2.00 | 4.2000 | 4.0548 | 0.1452 | 3.46% | |
| 3.00 | 5.8000 | 6.0429 | -0.2429 | -4.19% | |
| 4.00 | 8.1000 | 8.0310 | 0.0690 | 0.85% | |
| 5.00 | 9.9000 | 10.0190 | -0.1190 | -1.20% | |
| 6.00 | 12.2000 | 12.0071 | 0.1929 | 1.58% | |
| 7.00 | 13.8000 | 13.9952 | -0.1952 | -1.41% | |
| 8.00 | 16.1000 | 15.9833 | 0.1167 | 0.72% |
| R² | Meaning | Typical Use | Your Data |
|---|---|---|---|
| 1.00 | Perfect fit | Exact mathematical relationships | |
| 0.90–0.99 | Excellent | Physics, chemistry, engineering | |
| 0.70–0.89 | Good | Biology, medicine, economics | |
| 0.50–0.69 | Moderate | Social sciences, psychology | |
| 0.30–0.49 | Weak | Behavioral science, surveys | |
| 0.00–0.29 | Poor | Model inadequate or no relationship |
R², the coefficient of determination, summarizes how much of the outcome's variation is explained by a regression model. It is one of the most commonly reported fit statistics, but it is easiest to interpret when it is tied back to the underlying sum-of-squares breakdown.
This calculator shows that decomposition directly: total variation, explained variation, and residual variation. It also reports adjusted R², the F-statistic, and standard error so you can place the headline R² in a fuller regression context.
That makes the page useful both for quick model-fit checks and for understanding what an R² value actually means beyond a single decimal number.
R² is often quoted without context, which makes it easy to overstate what a model has accomplished. Showing the explained and unexplained pieces side by side makes the statistic more concrete and keeps the discussion anchored in what the model still misses.
R² = 1 − SS_Res/SS_Tot = SS_Reg/SS_Tot. Adjusted R² = 1 − (1−R²)(n−1)/(n−p−1). F = (SS_Reg/p)/(SS_Res/(n−p−1)). SE = √(SS_Res/(n−p−1)).Result: R² = 0.9988, Adjusted R² = 0.9986, SS_Tot = 165.90, SS_Reg = 165.70, SS_Res = 0.20
99.88% of Y's variation is explained by the linear model. Only 0.12% (SS_Res = 0.20) is unexplained, indicating excellent fit.
All regression analysis rests on one identity: SS_Total = SS_Regression + SS_Residual. SS_Total measures total variation in Y around its mean. SS_Regression measures how much of that variation the model captures. SS_Residual is what remains unexplained.
R² is simply the fraction SS_Reg/SS_Tot. When R² = 0.85, the model captures 85% of variation; the remaining 15% is due to factors not in the model or random noise.
ANOVA (Analysis of Variance) formally tests whether SS_Regression is "large enough" relative to SS_Residual. The F-statistic = (SS_Reg/p) / (SS_Res/(n-p-1)), where p is the number of predictors. Under the null hypothesis (model explains nothing), F follows an F-distribution. Large F → small p-value → model is significant.
1. "Higher R² = better model" — false. Overfitting gives high R² but poor predictions. 2. "R² shows causation" — false. It measures association only. 3. "R² < 0.5 means the model is useless" — false in social sciences where R² = 0.10 can represent an important finding. 4. "R² is always between 0 and 1" — false for adjusted R² and when using non-OLS methods.
Last updated:
R² always increases when you add more predictors, even useless ones. Adjusted R² penalizes for each additional predictor, decreasing if the predictor doesn't improve the model enough. Use adjusted R² when comparing models with different numbers of predictors.
Standard R² ranges 0 to 1. However, adjusted R² can be negative if the model fits worse than simply using the mean as a prediction. If adjusted R² is negative, your model is adding noise, not signal.
For simple linear regression (one predictor), R² = r² exactly. For multiple regression, R² is the square of the multiple correlation coefficient R, which differs from individual Pearson correlations.
F tests whether the model explains significantly more variance than expected by chance. Large F means the model is statistically significant. It's the ratio of per-predictor explained variance to per-observation unexplained variance.
It depends entirely on context. In physics: poor. In economics: decent. In social psychology: excellent. Always compare R² to typical values in your specific field, not to an abstract standard.
No — overfitting. Adding enough predictors always increases R² toward 1.0, but the model becomes worthless for prediction. Use adjusted R², AIC, BIC, or cross-validation to balance fit with simplicity.
Calculate Pearson and Spearman correlation coefficients, R², covariance, significance testing, z-scores, and ranks from X/Y data. Full statistical analysis.
Fit Y = aX³ + bX² + cX + d with R², inflection point, critical points, end behavior analysis, and residual table. Comparison to linear fit.
Predict future values with exponential growth/decay. Doubling time, half-life, Rule of 70, milestone table, growth trajectory, discrete & continuous compounding.