This is one of the following seven articles on Logistic Regression in Excel

Hosmer-Lemeshow Test in

Excel – Logistic Regression

Goodness-of-Fit Test

Another goodness-of-fit test commonly applied to logistic regression results is the Hosmer-Lemeshow test. This is a Chi-Square Goodness-Of-Fit test that quantifies how closely the predicted results match the actual observations. The test can be summarized as follows:

The total number of observations is split up into ten groups, called quintiles. The number of expected (predicted) positives and negatives in each quintile is compared with the observed number of positives and negatives in each quintile. The comparison of expected numbers and observed numbers produces a test statistic called the Chi-Square Statistic. A p Value is then derived which determines whether or not the model is a good fit.

A large p Value indicates that the difference between the number of observed and expected values is insignificant and the model is therefore considered valid. If the p Value is smaller than the specified level of significance (usually set at 0.05), the difference between the number of observed and expected values is statistically significant and the model is therefore considered not valid.

Hosmer-Lemeshow Test in Excel

The data should be divided up into 10 equally-sized groups called quintiles or bins. Produce the following four counts of the data in each bin:

- Positive values observed in that bin

- Positive values expected in the bin

- Negative values observed in that bin

- Negative values expected in that bin

Arrange all of that data is done in the following diagram. Place the positive observed and expected values together on one side. Place the negative observed and expected values together on the other side. This is shown as follows:

(Click On Image To See a Larger Version)

A Chi-Square Goodness-Of-Fit test requires that the average number of values in each “Expected” bin is at least 5 and that every “Expected” bin has a value of at least 1.

This test suffers when the total number of observations is not large. Test creators David Hosmer and Stanley Lemeshow recommend that the minimum number of observations be at least 200.

This test is performed almost exactly like any other Chi-Square Goodness-Of-Fit test except the degrees of freedom equals the number of bins – 2. In this case, that would be as follows:

df = Number of bins – 2 = 10 – 2 = 8

Calculate the following for each positive observed/expected group and for each negative observed/expected group:

(Number observed – number expected)² / (Number of expected)

Calculate the test statistic called the Chi-Square Statistic, Χ².

Χ² = ∑ (Number observed – number expected)² / (Number of expected)

Χ² = 6.08418

This test statistic, Χ², is distributed approximately according to the Chi-Square distribution with (Number of bins) – 2 degrees of freedom if the average number of values in each “Expected” bin is at least 5 and that every “Expected” bin has a value of at least 1.

A p Value can be derived from the Chi-Square Statistic as follows:

p Value = CHISQ.DIST.RT(Χ²,2) = CHISQ.DIST.RT(6.08418,2) = 0.63780

This p Value states that there is a 63.78 percent chance the difference between the observed and expected values is merely a random result and is not significant. The model is therefore considered to be a good model because the predicted values appear to be a good fit to the observed values. The Null Hypothesis stating that there is no difference between the Expected and Observed values cannot be rejected.

A small p Value would indicate that the model was not that good of a fit.

The p Value indicates the percentage of area under the Chi-Square distribution curve that is to the right of the Chi-Square Statistics of 6.08418. This is illustrated in the following diagram.

(Click On Image To See a Larger Version)

In Excel 2010 and later the formula CHIDIST(Χ²,df) can be replaced with the following formula: CHISQ.DIST.RT(Χ²,df)

Excel Master Series Blog Directory

Statistical Topics and Articles In Each Topic

Technorati Tags: hosmer-lemeshow,goodness-of-fit,chi-square,logistic regression,excel,excel 2010,excel 2013,statistics,excel solver,optimization

Become an Excel Statistical Master

Excel Master Series - MBA-level statistics - Over 1,100+ Pages of Easy-To-Follow Instructions in Excel

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

Step-By-Step Optimization With Excel Solver

What's In It?

For anyone who wants to be operating at a high level with the Excel Solver quickly, this is the book for you. Step-By-Step Optimization With Excel Solver is a 200+ page .pdf e-manual of simple yet thorough explanations on how to use the Excel Solver to solve today’s most widely known optimization problems. Loaded with screen shots that are coupled with easy-to-follow instructions, this book will simplify many difficult optimization problems and make you a master of the Excel Solver almost immediately.

Here are just some of the Solver optimization problems that are solved completely with simple-to-understand instructions and screen shots in this e-manual:

• The famous “Traveling Salesman” problem using Solver’s Alldifferent constraint and the Solver’s Evolutionary method to find the shortest path to reach all customers. This also provides an advanced use of the Excel INDEX function.

• The well-known “Knapsack Problem” which shows how optimize the use of limited space while satisfying numerous other criteria.

• How to perform nonlinear regression and curve-fitting on the Solver using the Solver’s GRG Nonlinear solving method.

• How to solve the “Cutting Stock Problem” faced by many manufacturing companies who are trying to determine the optimal way to cut sheets of material to minimize waste while satisfying customer orders.

• Portfolio optimization to maximize return or minimize risk.

• Venture capital investment selection using the Solver’s Binary constraint to maximize Net Present Value of selected cash flows at year 0. Clever use of the If-Then-Else statements makes this a simple problem.

• How use Solver to minimize the total cost of purchasing and shipping goods from multiple suppliers to multiple locations.

• How to optimize the selection of different production machine to minimize cost while fulfilling an order.

• How to optimally allocate a marketing budget to generate the greatest reach and frequency or number of inbound leads at the lowest cost.

Step-By-Step Optimization With Excel Solver has complete instructions and numerous tips on every aspect of operating the Excel Solver. You’ll fully understand the reports and know exactly how to tweek all of the Solver’s settings for total custom use. This e-manual also provides lots of inside advice and guidance on setting up the model in Excel so that it will be as simple and intuitive as possible to work with.

All of the optimization problems in this book are solved step-by-step using a 6-step process that works every time. In addition to detailed screen shots and easy-to-follow explanations on how to solve every optimization problem in the book, a link is provided to download an Excel workbook that has all problems completed exactly as they are in this e-manual.

Step-By-Step Optimization With Excel Solver is exactly the e-manual you need if you want to be optimizing at an advanced level with the Excel Solver quickly.

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

Immediate, Absolute, No-Questions-Asked, Money-Back Guarantee If Not TOTALLY, 100% Satisfied. In Other Words, If Any Excel Master Series eManual That You've Purchased Here Does Not Provide Instructions That Are CRYSTAL CLEAR and EASY TO UNDERSTAND, You Get All Of Your Money Back Immediately and Keep the eManual. Guaranteed!

Meet The Author