Wednesday, July 28, 2010

Every Hypothesis Test

Done in Excel in 4 Steps

As an Internet marketing manager I use hypothesis testing all the time. There are quite a few great marketing uses of the hypothesis test with Excel that I will explain in detail in future articles of this blog. If you would like to see one very useful application of the hypothesis test in an article in this blog, check out this blog article on how to construct a split-tester in Excel that is better than the Google Website Optimizer. The basic test of this split-tester (and the Google Website Optimizer) is a hypothesis test.

Hypothesis Test Determines if Something Changed

In a nutshell, a hypothesis test is used to determine if something really has changed. For example, maybe you changed your Intenet marketing program slightly and you want to determine within 95% certainty whether the sales results that you've noticed are caused by your changes or are they just the result of random chance. The hypothesis test is the perfect tool to quickly answer that question. I will go so far as to say that the hypothesis test is my favorite Internet marketing statistical tool.

Hypothesis Test - Solved With 4-Step Framework

Right now I would like to present a 4-step framework that can be used to solve ALL hypothesis tests. To my knowledge, I have not seen this framework presented anywhere else, but it definitely works for every type of hypothesis test.

Hypothesis Test Must 1st Be Classified

Before you can begin the 4-step procedure, you must classify the hypothesis test you are about to perform. There are 4 separate categories in which the hypothesis test must be classified before applying the 4-step method. Each classification must be solved a slightly different way while applying the 4-step method. You therefore must determine upfront the type of hypothesis test so you will know exactly how to apply the 4-step method. The 4 categories of hypothesis tests are as follows:

Problem Classification:

Select the proper choice of each of the four ways that a Hypothesis problem is classified as follows:

1) Mean Testing vs. Proportion Testing

• Proportion test samples have only two possible outcomes.
• Mean test samples have multiple possible outcomes.

2) One-Tailed vs. Two-Tailed Testing

    • Two-tailed tests determine whether two means are merely different.
    • One-tailed tests determine whether one mean is different in one
         direction.

3) One-Sample vs. Two Sample Testing

    • One sample is taken if original or "Before" comparison data is
           available.
    • Two samples are taken if no comparison data is available.

4) Unpaired Data Testing vs. Paired Data Testing

    • Paired data testing can be performed if "Before" and "After" data
         are collected from the same objects. Mean testing can be
         performed on paired data - Proportion testing cannot.
    • Unpaired data testing is performed on data collected in groups.

Here below is a more detailed explanation of the above classifications:

1) Mean testing vs. Proportion testing -

This is the most important distinction that must be made. Mean testing and proportion are both solved using the same 4-step method but use different formulas.

Mean testing – Hypothesis tests of mean use samples that can taken a range of values. For example, you are testing to determine if sales have gone up over the course of a month. The sampled daily sales can have a wide range of values.

Proportion testing – Hypothesis test of proportion use samples that can have only 2 values. For example, you are testing to determine whether new keywords in a Google AdWords ad group have increased conversion rate. You are sampling whether or not a click converted. Your sample has only 2 possible values. The click either converted or it didn’t.

2) One-tailed vs. Two-tailed testing

– This depends upon whether you are using the hypothesis test to determine whether the mean or proportion of one sampled group is merely different that the mean or proportion of another sampled group, or whether it is specifically different in one direction – whether it is larger or smaller.

One-tailed test – You are testing to determine if the mean or proportion of one sampled group is different in one specific direction than the mean or proportion of the other sampled group.

Two-tailed test – You only want to know if the mean or proportion of one group is different than that of the other group, but aren’t testing for direction.

3) One-sample vs. two-sample testing

– Whether you need to take one sample or two samples depends on whether you need have original or “before” sample data available. Two-sample testing is performed if no “before” data is available, or if no data is available on either side.

Paired data testing – An example of this would be “before” and “after” testing of the same object. For example, you are measuring whether sales really went up. Paired data testing can only be performed for a hypothesis test of mean, not proportion.

Unpaired data samples – Groups of unpaired data testing are treated independently of each other.

4) Unpaired data testing vs. paired data testing

– Most hypothesis tests use unpaired data. Whether data is paired or unpaired depends on whether both samples were collected from the same objects or not.

The 4-Step Method To Solve ALL Hypothesis Tests

After having classified the hypothesis test according to the 4 categories, you are now ready to perform the 4-step method. In summary, the steps are as follows:

1) Create the Null and Alternate Hypotheses

2) Map the Normal Curve

- Showing the Distribution of the Variable Used by the Null Hypothesis.

3) Map the Region of Certainty

– The Area Under the Normal Curve That Corresponds With the Degree of Certainty You Require For Your Hypothesis Test.

4) Perform Either the Critical Value Test or the P Value Test

– to Determine Whether To Reject or Fail To Reject the Null Hypothesis

Without going into too much detail, we will take a brief look at solving a hypothesis test using the 4-step method.

Problem - One-Tailed, One-Sample, Unpaired Hypothesis Test of Mean

Testing whether a delivery time has gotten worse

Problem: A furniture company states that its average delivery time is 15 days with a (population) standard deviation of 4 days. A random sample of 50 deliveries showed an average delivery time of 17 days. Determine within 98% certainty (0.02 significance level) whether delivery time has increased.

SOLUTION:

We know that this is a test of mean and not proportion because each individual sample taken can have a wide range of values: Any delivery time sample measurement from 12 to 18 days is probably reasonable.

We know that this is a one-tailed test because we are trying to determine if the "After Data" mean delivery time is larger than the "Before Data" mean delivery time, not whether the mean delivery times are merely different.

We know that only one sample needs to be taken because the population data being tested is already available.

This is unpaired data because groups are sampled independently. Below is the Before and After sample data: