To install click the Add extension button. That's it.

The source code for the WIKI 2 extension is being checked by specialists of the Mozilla Foundation, Google, and Apple. You could also do it yourself at any point in time.

4,5
Kelly Slayton
Congratulations on this excellent venture… what a great idea!
Alexander Grigorievskiy
I use WIKI 2 every day and almost forgot how the original Wikipedia looks like.
Live Statistics
English Articles
Improved in 24 Hours
Added in 24 Hours
Languages
Recent
Show all languages
What we do. Every page goes through several hundred of perfecting techniques; in live mode. Quite the same Wikipedia. Just better.
.
Leo
Newton
Brights
Milds

Explained sum of squares

From Wikipedia, the free encyclopedia

In statistics, the explained sum of squares (ESS), alternatively known as the model sum of squares or sum of squares due to regression (SSR – not to be confused with the residual sum of squares (RSS) or sum of squares of errors), is a quantity used in describing how well a model, often a regression model, represents the data being modelled. In particular, the explained sum of squares measures how much variation there is in the modelled values and this is compared to the total sum of squares (TSS), which measures how much variation there is in the observed data, and to the residual sum of squares, which measures the variation in the error between the observed data and modelled values.

YouTube Encyclopedic

  • 1/5
    Views:
    21 805
    64 706
    835 845
    20 607
    107 628
  • ANOVA (Analysis of Variance) and Sum of Squares | Statistics Tutorial #26 | MarinStatsLectures
  • Regression 3: Sums of Squares and R-squared
  • ANOVA 1: Calculating SST (total sum of squares) | Probability and Statistics | Khan Academy
  • Sum of squares
  • Sum of n squares | explained visually |

Transcription

Definition

The explained sum of squares (ESS) is the sum of the squares of the deviations of the predicted values from the mean value of a response variable, in a standard regression model — for example, yi = a + b1x1i + b2x2i + ... + εi, where yi is the i th observation of the response variable, xji is the i th observation of the j th explanatory variable, a and bj are coefficients, i indexes the observations from 1 to n, and εi is the i th value of the error term. In general, the greater the ESS, the better the estimated model performs.

If and are the estimated coefficients, then

is the i th predicted value of the response variable. The ESS is then:

where is the value estimated by the regression line .[1]

In some cases (see below): total sum of squares (TSS) = explained sum of squares (ESS) + residual sum of squares (RSS).

Partitioning in simple linear regression

The following equality, stating that the total sum of squares (TSS) equals the residual sum of squares (=SSE : the sum of squared errors of prediction) plus the explained sum of squares (SSR :the sum of squares due to regression or explained sum of squares), is generally true in simple linear regression:

Simple derivation

Square both sides and sum over all i:

Here is how the last term above is zero from simple linear regression[2]

So,

Therefore,

Partitioning in the general ordinary least squares model

The general regression model with n observations and k explanators, the first of which is a constant unit vector whose coefficient is the regression intercept, is

where y is an n × 1 vector of dependent variable observations, each column of the n × k matrix X is a vector of observations on one of the k explanators, is a k × 1 vector of true coefficients, and e is an n × 1 vector of the true underlying errors. The ordinary least squares estimator for is

The residual vector is , so the residual sum of squares is, after simplification,

Denote as the constant vector all of whose elements are the sample mean of the dependent variable values in the vector y. Then the total sum of squares is

The explained sum of squares, defined as the sum of squared deviations of the predicted values from the observed mean of y, is

Using in this, and simplifying to obtain , gives the result that TSS = ESS + RSS if and only if . The left side of this is times the sum of the elements of y, and the right side is times the sum of the elements of , so the condition is that the sum of the elements of y equals the sum of the elements of , or equivalently that the sum of the prediction errors (residuals) is zero. This can be seen to be true by noting the well-known OLS property that the k × 1 vector : since the first column of X is a vector of ones, the first element of this vector is the sum of the residuals and is equal to zero. This proves that the condition holds for the result that TSS = ESS + RSS.

In linear algebra terms, we have , , . The proof can be simplified by noting that . The proof is as follows:

Thus,

which again gives the result that TSS = ESS + RSS, since .

See also

Notes

  1. ^ "Sum of Squares - Definition, Formulas, Regression Analysis". Corporate Finance Institute. Retrieved 2020-06-11.
  2. ^ Mendenhall, William (2009). Introduction to Probability and Statistics (13th ed.). Belmont, CA: Brooks/Cole. p. 507. ISBN 9780495389538.

References

  • S. E. Maxwell and H. D. Delaney (1990), "Designing experiments and analyzing data: A model comparison perspective". Wadsworth. pp. 289–290.
  • G. A. Milliken and D. E. Johnson (1984), "Analysis of messy data", Vol. I: Designed experiments. Van Nostrand Reinhold. pp. 146–151.
  • B. G. Tabachnick and L. S. Fidell (2007), "Experimental design using ANOVA". Duxbury. p. 220.
  • B. G. Tabachnick and L. S. Fidell (2007), "Using multivariate statistics", 5th ed. Pearson Education. pp. 217–218.
This page was last edited on 28 February 2024, at 20:50
Basis of this page is in Wikipedia. Text is available under the CC BY-SA 3.0 Unported License. Non-text media are available under their specified licenses. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc. WIKI 2 is an independent company and has no affiliation with Wikimedia Foundation.