Thursday, March 18, 2010

Regression - How To Quickly Read the Output of Excel’s Regression

Regression Analysis

Done in Excel

How To Read the Output




There is a lot more to the Excel Regression output than just the regression equation. If you know how to quickly read the output of a Regression done in, you’ll know right away the most important points of a regression: if the overall regression was a good, whether this output could have occurred by chance, whether or not all of the independent input variables were good predictors, and whether residuals show a pattern (which means there’s a problem).



Excel Regression Output With Color-Coding Added

regression, multiple regression, regression model, regression excel, regression analysis, multiple regression, regression coefficient, statistical analysis in excel
Click On Image To See Enlarged View


This video will illustrate exactly how to quickly and easily understand the output of Regression performed in Excel:


Step-By-Step Video About How To Quickly Read and Understand the Output of Excel Regression

(Is Your Sound Turned On?)


The 4 Most Important Parts of Regression Output
1) Overall Regression Equation’s Accuracy
(R Square and Adjusted R Square)

2) Probability That This Output Was Not By Chance
(ANOVA – Significance of F)

3) Individual Regression Coefficient and Y-Intercept Accuracy


4) Visual Analysis of Residuals


Some parts of the Excel Regression output are much more important than others. The goal here is for you to be able to glance at the Excel Regression output and immediately understand it, so we will focus our attention only on the four most important parts of the Excel regression output.

1) Overall Regression’s Accuracy

 

regression, multiple regression, regression model, regression excel, regression analysis, multiple regression, regression coefficient, statistical analysis in excel
Click On Image To See Enlarged View


R Square– This is the most important number of the output. R Square tells how well the regression line approximates the real data. This number tells you how much of the output variable’s variance is explained by the input variables’ variance. Ideally we would like to see this at least 0.6 (60%) or 0.7 (70%).


Adjusted R Square – This is quoted most often when explaining the accuracy of the regression equation. Adjusted R Square is more conservative the R Square because it is always less than R Square. Another reason that Adjusted R Square is quoted more often is that when new input variables are added to the Regression analysis, Adjusted R Square increases only when the new input variable makes the Regression equation more accurate (improves the Regression equations’s ability to predict the output). R Square always goes up when a new variable is added, whether or not the new input variable improves the Regression equation’s accuracy.



2) Probability That This Output Was Not By Chance

regression, multiple regression, regression model, regression excel, regression analysis, multiple regression, regression coefficient, statistical analysis in excel
Click On Image To See Enlarged View


Significance of F
– This indicates the probability that the Regression output could have been obtained by chance. A small Significance of F confirms the validity of the Regression output. For example, if Significance of F = 0.030, there is only a 3% chance that the Regression output was merely a chance occurrence.


3) Individual Regression Coefficient Accuracy

regression, multiple regression, regression model, regression excel, regression analysis, multiple regression, regression coefficient, statistical analysis in excel
Click On Image To See Enlarged View

P-value of each coefficient and the Y-intercept – The P-Values of each of these provide the likelihood that they are real results and did not occur by chance. The lower the P-Value, the higher the likelihood that that coefficient or Y-Intercept is valid. For example, a P-Value of 0.016 for a regression coefficient indicates that there is only a 1.6% chance that the result occurred only as a result of chance.



4) Visual Analysis of Residuals

Charting the Residuals
regression, multiple regression, regression model, regression excel, regression analysis, multiple regression, regression coefficient, statistical analysis in excel
Click On Image To See Enlarged View


The Residual Chart
regression, multiple regression, regression model, regression excel, regression analysis, multiple regression, regression coefficient, statistical analysis in excel
Click On Image To See Enlarged View


The residuals are the difference between the Regression’s predicted value and the actual value of the output variable. You can quickly plot the Residuals on a scatterplot chart. Look for patterns in the scatterplot. The more random (without patterns) and centered around zero the residuals appear to be, the more likely it is that the Regression equation is valid.


There are many other pieces of information in the Excel regression output but the above four items will give a quick read on the validity of your Regression.

regression, multiple regression, regression model, regression excel, regression analysis, multiple regression, regression coefficient, statistical analysis in excel



If anyone has any comments or observations related to this article, feel free to submit them because your input and opinions are highly valued.



If You Like This, Then Share It...
Dig this regression, multiple regression, regression model, regression excel, regression analysis, multiple regression, regression coefficient, statistical analysis in excel regression, multiple regression, regression model, regression excel, regression analysis, multiple regression, regression coefficient, statistical analysis in excel Technorati regression, multiple regression, regression model, regression excel, regression analysis, multiple regression, regression coefficient, statistical analysis in excel regression, multiple regression, regression model, regression excel, regression analysis, multiple regression, regression coefficient, statistical analysis in excel regression, multiple regression, regression model, regression excel, regression analysis, multiple regression, regression coefficient, statistical analysis in excel

10 comments:

  1. Thanks for posting this great Excel how to guide! I am sure many users will find this handy!

    You should share your knowledge with the active Excel community over on Facebook:
    http://www.facebook.com/MicrosoftExcel

    Have you tried Office 2010? If so, how are you liking it?

    Cheers,
    Bryn
    MSFT Office Outreach
    ReplyDelete
  2. Thank you so much for this!!! Very helpful

    Mallory Wood
    ReplyDelete
  3. Cheers! [A deadline has been made]
    ReplyDelete
  4. Fantastic post. Am seriously considering buying the entire package.

    regards
    jan/
    ReplyDelete
  5. Really good work...helped a lot....
    Thank you ....
    ReplyDelete
  6. Thank you so much. Very helpful.
    ReplyDelete
  7. uda man, test saver
    ReplyDelete
  8. nice and clear! other blogs are so deep into explaining the equations - this is one of the only ones I've seen that actually helps interpret the output in plain language! thanks!
    ReplyDelete
  9. very simply and useful.. thanks
    ReplyDelete