13.5 R-squared, and Adjusted R-squared

  • R2 is the Coeficient of Determination. It represents the proportion of variation in y (about its mean) explained by the multiple linear regression model with predictors, x1,x2,

R2=SSRSSTO=1SSESSTO - R2 always increases (or stays the same) as more predictors are added to a multiple linear regression model, even if the predictors added are unrelated to the response variable. Thus, by itself, R2 cannot be used to help us identify which predictors should be included in a model and which should be excluded.

  • Adjusted-R2=1(n1n(k+1))(1R2)

  • Adjusted-R2, does not necessarily increase as more predictors are added, and can be used to help us identify which predictors should be included in a model and which should be excluded. Simply stated, when comparing two models used to predict the same response variable, we generally prefer the model with the higher value of Adjusted-R2.