Monday, June 2, 2014

Chi-Square Goodness-of-Fit Test Overview

This is one of the following three articles on Chi-Square Goodness-Of-Fit Tests in Excel

Overview of the Chi-Square Goodness-of-Fit Test

Chi-Square Goodness- of-Fit Test With Pre-Determined Bins Sizes in 7 Steps in Excel 2010 and Excel 2013

Chi-Square Goodness-Of-Fit-Normality Test in 9 Steps in Excel 2010 and Excel 2013

Chi-Square Goodness-of-

Fit Test Overview

Chi-Square Goodness-Of-Fit (GOF) tests are hypothesis tests that determine how closely a sample of data fits a hypothesized distribution. The actual data observations are divided up into groups called bins. The same number of data points is divided up into identical bins in the groupings that would be expected if these data points exactly matched the hypothesized distribution.

The counts of actual data observations in each bin are compared with the expected number of data points that would be in identical bins if the data exactly matched the hypothesized distribution.

Test Statistic Χ²

A Test Statistic called the Chi-Square Statistic, Χ², is calculated based upon the comparison of the counts of actual data points in each bin and the counts of expected data points in each of the bins. The formula for the Chi-Square Statistic is as follows:

(Click On Image To See a Larger Version)

n = the total number of bins that containing expected groupings of data points

Actual_i = the number of actual observed data points that fall into the ith bin

Expected_i = the number of expected data points in the ith bin if the data exactly matched hypothesized distribution.

Required Assumptions

The distribution of the Chi-Square Statistic, Χ², can be approximated by the Chi-Square distribution if the following 3 conditions are met:

1) n ≥ 5

2) The minimum expected number of data points in any of the bins is at least 1

3) The average number of expected data points in a bin is at least 5

Null Hypothesis

The Null Hypothesis of this hypothesis test states that Χ² = 0. This would mean that actual and expected counts of data points in each bin are the same. This Null Hypothesis is rejected if either of the following two equivalent conditions exist:

1) The Chi-Square Statistic is larger than the Critical Chi-Square Value

2) The p Value is smaller than the specified alpha.

Basic Excel Formulas

The formulas for the Critical Chi-Square Value and p Value in Excel are the following:

Critical Chi-Square Value = CHISQ.INV.RT(α, df)

p Value = CHISQ.DIST.RT(Chi-Square Statistic, df)

df = degrees of freedom and is calculated using one of two different formulas depending on which of the two types of GOF tests is being performed.

The Two Types of GOF Tests

1) Bin Sizes Are Pre-Determined

An example would be to test whether the weekly sales count is uniformly distributed throughout the seven days of the week. The actual sales count for each day would be compared with expected bins each containing one seventh of the total weekly sales count. The sales count for each day would be expected equal one-seventh of the week’s total sales count if sales were uniformly distributed throughout the seven week days. This type of a GOF test often starts with the actual observed data already allocated to bins. This is the case here in that actual sales are grouped at the start into bins each holding the sales of a separate day. This example will be performed shortly within this section.

df = n - 1

n = number of bins of expected data

This type of Chi-Square GOF Test will be performed in Excel in the next blog article.

2) Bin Sizes Arbitrarily Set To Match a Distribution

An example would be to perform a Chi-Square Goodness-of-Fit Test for normality on a large single group of data values. This type of a GOF test starts with the actual observed data in a single group and therefore not yet allocated to bins. The expected bins are created by establishing arbitrary CDF endpoints of each bin. The upper and lower CDF endpoints of each expected bin determine the total number of data points that should be placed in each of these expected bins. The actual data values will be grouped in bins whose endpoints match those of the expected bins. Standardizing the actual observed data points is a way of simplifying their bin allocation. The Chi-Square GOF Test for Normality will be performed in this section using this method.

df = n – 1 – m

n = number of bins of expected data

m = number of parameters needed to fully describe the distribution, e.g. m = 2 for the normal distribution, which is fully described by two parameters; the mean and standard deviation.

The Chi-Square Goodness-of-Fit Test For Normality will be performed in detail in Excel in a later article in this blog.

Excel Master Series Blog Directory

Click Here To See a List Of All

Statistical Topics And Articles In

This Blog

You Will Become an Excel Statistical Master!

4 comments:

Fustat to ThanjavurFebruary 16, 2016 at 1:32 PM
\\m = number of parameters needed to fully describe the distribution, e.g. m = 2 for the normal distribution, which is fully described by two parameters; the mean and standard deviation\\ In that case, Poisson distribution has an m-value of 1, weibull 3-parameter distribution has a m-value of 3. So what is the m-value of Exponential and other family of distributions ?
ReplyDelete
Replies
Rishi Raj GautamJanuary 8, 2018 at 6:31 AM
http://blog.excelmasterseries.com/2015/03/blog-directory-of-statistics-topics-and.html
ReplyDelete
Replies
miracle box crack latest versionApril 5, 2021 at 11:01 PM
outstanding blog post
ReplyDelete
Replies
Christopher HarrisFebruary 18, 2025 at 7:36 AM
Writing essays can be challenging, but PaperTyper https://papertyper.ai/ makes the process so much easier. As a student, I appreciate how this AI-powered tool helps generate ideas, refine structure, and check grammar effortlessly. The platform’s user-friendly interface and free features make it a great choice for students looking to improve their writing.
ReplyDelete
Replies

Add comment

Become an Excel Statistical Master

Excel Master Series - MBA-level statistics - Over 1,100+ Pages of Easy-To-Follow Instructions in Excel

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

Step-By-Step Optimization With Excel Solver

What's In It?

For anyone who wants to be operating at a high level with the Excel Solver quickly, this is the book for you. Step-By-Step Optimization With Excel Solver is a 200+ page .pdf e-manual of simple yet thorough explanations on how to use the Excel Solver to solve today’s most widely known optimization problems. Loaded with screen shots that are coupled with easy-to-follow instructions, this book will simplify many difficult optimization problems and make you a master of the Excel Solver almost immediately.

Here are just some of the Solver optimization problems that are solved completely with simple-to-understand instructions and screen shots in this e-manual:

• The famous “Traveling Salesman” problem using Solver’s Alldifferent constraint and the Solver’s Evolutionary method to find the shortest path to reach all customers. This also provides an advanced use of the Excel INDEX function.

• The well-known “Knapsack Problem” which shows how optimize the use of limited space while satisfying numerous other criteria.

• How to perform nonlinear regression and curve-fitting on the Solver using the Solver’s GRG Nonlinear solving method.

• How to solve the “Cutting Stock Problem” faced by many manufacturing companies who are trying to determine the optimal way to cut sheets of material to minimize waste while satisfying customer orders.

• Portfolio optimization to maximize return or minimize risk.

• Venture capital investment selection using the Solver’s Binary constraint to maximize Net Present Value of selected cash flows at year 0. Clever use of the If-Then-Else statements makes this a simple problem.

• How use Solver to minimize the total cost of purchasing and shipping goods from multiple suppliers to multiple locations.

• How to optimize the selection of different production machine to minimize cost while fulfilling an order.

• How to optimally allocate a marketing budget to generate the greatest reach and frequency or number of inbound leads at the lowest cost.

Step-By-Step Optimization With Excel Solver has complete instructions and numerous tips on every aspect of operating the Excel Solver. You’ll fully understand the reports and know exactly how to tweek all of the Solver’s settings for total custom use. This e-manual also provides lots of inside advice and guidance on setting up the model in Excel so that it will be as simple and intuitive as possible to work with.

All of the optimization problems in this book are solved step-by-step using a 6-step process that works every time. In addition to detailed screen shots and easy-to-follow explanations on how to solve every optimization problem in the book, a link is provided to download an Excel workbook that has all problems completed exactly as they are in this e-manual.

Step-By-Step Optimization With Excel Solver is exactly the e-manual you need if you want to be optimizing at an advanced level with the Excel Solver quickly.

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

Immediate, Absolute, No-Questions-Asked, Money-Back Guarantee If Not TOTALLY, 100% Satisfied. In Other Words, If Any Excel Master Series eManual That You've Purchased Here Does Not Provide Instructions That Are CRYSTAL CLEAR and EASY TO UNDERSTAND, You Get All Of Your Money Back Immediately and Keep the eManual. Guaranteed!

Meet The Author