Monday, June 2, 2014

Overview of the t-Distribution

This is one of the following three articles about the t distribution in Excel

t Distribution’s PDF (Probability Density Function) in Excel 2010 and Excel 2013

t Distribution’s CDF (Cumulative Distribution Function) in Excel 2010 and Excel 2013

Overview of the t-

Distribution

The t-Distribution is used much more often than the normal distribution to perform several basic parametric statistical tests such as hypothesis tests of a population mean and confidence intervals of a population mean. Requirements for statistical tests are generally less rigorous when a statistical test can be based upon the t-Distribution instead of the normal distribution.

The t-Distribution (also called the Student’s t-Distribution) describes the distribution of a sample taken from a normally-distributed population when the population standard deviation is unknown. The t-Distribution closely resembles the standard normal distribution (the normal distribution when the mean equals zero and the standard deviation equals one) except that the t-Distribution’s outer tails have more weight (are thicker) and its mean has a lower peak than the standard normal distribution.

As sample size increases, the t-Distribution converges to (more closely resembles) the standard normal distribution. When the sample size becomes large (n > 30), the t-Distribution almost exactly resembles the standard normal distribution.

Following is an Excel-generated image of the PDF (Probability Density Function) of the t-Distribution with a very low degrees of freedom. Sample size, n, equals 3 and degrees of freedom, df, equals n – 1 = 2. The PDF of the t-Distribution has a lower peak and thicker tails when its degrees of freedom is small.

(Click On Image To See a Larger Version)

As the degrees of freedom increases, the PDF of the t-Distribution converges toward (resembles more and more) the standard normal distribution. The standard normal distribution is a normal distribution curve with its mean, μ, equal to zero and its standard deviation, σ, equal to one. the standard normal curve is a special case of the t-Distribution with a sample size, n, equal to infinity. Note how the shape of the PDF curve of the t-Distribution changes as the sample size increases from n = 3 to n = ∞. The height of the peak over the mean has risen significantly and the tails are quite a bit thinner. This is shown in the following Excel-generated image:

(Click On Image To See a Larger Version)

These differences are also reflected the CDF (Cumulative Distribution Function) graphs of the t-Distribution with the same degrees of freedom. The t-Distribution’s CDF approaches its asymptotic values of 0 and 1 much further from t =0 at smaller degrees of freedom than for larger degrees of freedom.

The t-Distribution CDF graph with only 2 degrees of freedom is still a significant distance from the asymptotic values of 0 and 1 at 3 standard errors above and below t = 0. This is shown in the following Excel-generated graph:

(Click On Image To See a Larger Version)

The t-Distribution CDF graph with the degrees of freedom equaling its highest possible value of ∞ has nearly reached the asymptotic values of 0 and 1 quite a bit closer than 3 standard errors above and below t = 0. This is shown in the following Excel-generated graph:

(Click On Image To See a Larger Version)

The t-Distribution’s CDF is simply a graph of the accumulation of its PDF as the t Value goes from -∞ to +∞. The t-Distribution’s PDF is bell-shaped and symmetrical about a t Value of 0. The t-Distribution’s CDF will therefore show that 50 percent of the area under the PDF curve, F(t,v) = 0.50, occurs at the t Value of 0.

History of the t-Distribution

One of the most colorful and well-known stories in the annals of statistics is the origin of the name of the Student’s t-Distribution, which the t-Distribution is often called. This distribution was first presented in the English language by William Sealy Gosset under the pseudonym “Student” in his article “The probable error of a mean” in the scientific journal Biometrika in March 1908. At the time Gosset was employed at the Guinness Brewery in Dublin, Ireland and was studying the nature of small samples of brewery ingredients such as barley. Gosset published his article under the pen name “Student” because his employer either did not allow staff to publish scientific papers or did not want competitors to know that the Guinness Brewery was using this test on small samples of raw materials.

The name “Student’s distribution” was conferred on the distribution by Ronald Fisher in his 1925 article “Applications of Student’s distribution.” This article also assigned the label “t” to value of the Test Statistic for this distribution.

Prior to Gosset’s English-language introduction, the t-Distribution was first described by German mathematicians Friedrich Helmert and Jacob Lüroth in 1876.

Properties of the t-Distribution

- The mean of the t-Distribution equals 0. The t-Distribution forms a bell-shaped curve about a mean of 0. This differs from the normal distribution because the normal distribution can be symmetrical about a mean of any real number.

- The variance of the t-Distribution is equal to v / (v – 2) when v (Greek letter “nu”) exceeds 2. v equals the degrees of freedom, which equals the sample size minus 1. The variance is always greater than 1 but converges to 1 as the sample size gets larger. The t-Distribution converges to the standard normal distribution (which has a variance equaling 1) as sample size approaches infinity.

- The standard error of t-Distribution is equal to the sample standard deviation divided by the square root of the sample size.

- The t-Distribution has only one parameter which is the degrees of freedom. Degrees of freedom is usually designated as df or ν (Greek letter “nu”). This differs from the normal distribution because the normal distribution is described by the following two parameters: mean (μ - Greek letter “mu”) and standard deviation (σ - Greek letter “sigma”).

- The graph of the t-Distribution is symmetrical about a mean of 0 and the units on its horizontal axis describing the distance from the mean of 0 are units of standard errors. This differs from the normal distribution because the normal distribution can be symmetrical about a mean of any real number and the units of its horizontal axis describing the distance from its mean use the same real number scale in which the mean was measured. The t-Distribution’s PDF or CDF at any real number X requires that the X value be converted to the number of standard errors that the X value is from the sample mean. The standard error is equal to the sample standard deviation divided by the square root of the sample size.

The t-Distribution can be used to describe any statistic that has a bell-shaped distribution, i.e., unimodal, symmetrical, and without significant outliers. The t-Distribution is used to analyze samples taken from a normally-distributed population when either of the following is true:

1) Small size is small (n < 30).

2) The population standard deviation is not known, which is often the case.

The t-Distribution is used much more often than the normal distribution when performing hypothesis tests or creating confidence intervals based upon samples taken from a normally-distributed population. If a sample is found to be normally distributed, its population is assumed to be normally distributed.

The t-Distribution more closely describes the distribution of a small sample (n < 30) taken from a normally-distributed population than the normal distribution does. Small samples taken from a normally-distributed population have a slightly higher probability that sample values will occupy the outer tails than do larger samples. The t-Distribution has slightly thicker tails and a lower peak than does the normal distribution. The t-Distribution is therefore used to describe the distribution of small samples taken from a normally-distributed population.

The extra weight in the outer tails of the t-Distribution accounts for the additional uncertainty of having to use the sample standard deviation to estimate the population standard deviation. This estimate becomes more uncertain as sample size decreases. the t-Distribution’s shape reflects that as its outer tails become thicker as sample size decreases.

The t-Distribution can be used to perform hypothesis tests or create confidence intervals of a normally-distributed population when the population standard deviation is not known. The normal distribution should not be used for these types of analysis when the population standard deviation is not known. In the real world it is much more common scenario that the standard deviation of the population from which the sample was drawn is not known.

Excel Master Series Blog Directory

Click Here To See a List Of All

Statistical Topics And Articles In

This Blog

You Will Become an Excel Statistical Master!

5 comments:

salmaMarch 12, 2020 at 1:24 PM

شركة شحن عفش من السعودية الى الاردن

ارخص نقل عفش بمكة ارخص نقل عفش بمكة
نقل عفش من جدة الى الدمام نقل عفش من جدة الى الدمام
نقل عفش من الرياض الى المدينة المنورة نقل عفش من الرياض الى المدينة المنورة

ReplyDelete
Replies
LizaNovember 22, 2021 at 2:24 AM
Looking for an assignment writing related solution? Failed and tired of wasting time is searching for budget-friendly and genuine assignment writing service? Well if you reached this page then we recommend you to forget all your worries. As we Treat Assignment Help is there to help to drag you out from nerve-wracking assignment worries. We got budget-friendly and UK expert-written assignment services just for you.
ReplyDelete
Replies
Nick HunterFebruary 9, 2022 at 2:24 AM
I am a statistics student, and I have to complete my assignment on T- distribution. Suddenly, I came across your post, and your provided information helped me out. I completed my assignment. I am going to share your post link with my class-fellows. I know that this information is helpful for all statistics students. On the other hand, some students take PhD dissertation help while writing their final year dissertations.
ReplyDelete
Replies
British Dissertation HelpMay 26, 2022 at 5:20 AM
Medical dissertation or Nursing dissertation is considered as the most difficult task. If you are a medical student and searching for the best Nursing dissertation solution, then you must opt for the nursing dissertation help services provided by the British dissertation help team. The effectiveness of our dissertation expert team will also support you to maintain all the techniques to write a proper nursing dissertation. Our experts utilize different analytical tools like SPSS, Microsoft project, Project Libre, NVivo to complete a research paper. We also provide the free editing and proofreading services to the students, so it helps to make more improvements in the quality of writing.
ReplyDelete
Replies
Angel17June 17, 2023 at 11:45 PM
I find this post so useful. I learned a lot of insights. https://goo.gl/maps/q5EYTnKjwEviHJuYA
ReplyDelete
Replies

Add comment

Become an Excel Statistical Master

Excel Master Series - MBA-level statistics - Over 1,100+ Pages of Easy-To-Follow Instructions in Excel

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

Step-By-Step Optimization With Excel Solver

What's In It?

For anyone who wants to be operating at a high level with the Excel Solver quickly, this is the book for you. Step-By-Step Optimization With Excel Solver is a 200+ page .pdf e-manual of simple yet thorough explanations on how to use the Excel Solver to solve today’s most widely known optimization problems. Loaded with screen shots that are coupled with easy-to-follow instructions, this book will simplify many difficult optimization problems and make you a master of the Excel Solver almost immediately.

Here are just some of the Solver optimization problems that are solved completely with simple-to-understand instructions and screen shots in this e-manual:

• The famous “Traveling Salesman” problem using Solver’s Alldifferent constraint and the Solver’s Evolutionary method to find the shortest path to reach all customers. This also provides an advanced use of the Excel INDEX function.

• The well-known “Knapsack Problem” which shows how optimize the use of limited space while satisfying numerous other criteria.

• How to perform nonlinear regression and curve-fitting on the Solver using the Solver’s GRG Nonlinear solving method.

• How to solve the “Cutting Stock Problem” faced by many manufacturing companies who are trying to determine the optimal way to cut sheets of material to minimize waste while satisfying customer orders.

• Portfolio optimization to maximize return or minimize risk.

• Venture capital investment selection using the Solver’s Binary constraint to maximize Net Present Value of selected cash flows at year 0. Clever use of the If-Then-Else statements makes this a simple problem.

• How use Solver to minimize the total cost of purchasing and shipping goods from multiple suppliers to multiple locations.

• How to optimize the selection of different production machine to minimize cost while fulfilling an order.

• How to optimally allocate a marketing budget to generate the greatest reach and frequency or number of inbound leads at the lowest cost.

Step-By-Step Optimization With Excel Solver has complete instructions and numerous tips on every aspect of operating the Excel Solver. You’ll fully understand the reports and know exactly how to tweek all of the Solver’s settings for total custom use. This e-manual also provides lots of inside advice and guidance on setting up the model in Excel so that it will be as simple and intuitive as possible to work with.

All of the optimization problems in this book are solved step-by-step using a 6-step process that works every time. In addition to detailed screen shots and easy-to-follow explanations on how to solve every optimization problem in the book, a link is provided to download an Excel workbook that has all problems completed exactly as they are in this e-manual.

Step-By-Step Optimization With Excel Solver is exactly the e-manual you need if you want to be optimizing at an advanced level with the Excel Solver quickly.

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

Immediate, Absolute, No-Questions-Asked, Money-Back Guarantee If Not TOTALLY, 100% Satisfied. In Other Words, If Any Excel Master Series eManual That You've Purchased Here Does Not Provide Instructions That Are CRYSTAL CLEAR and EASY TO UNDERSTAND, You Get All Of Your Money Back Immediately and Keep the eManual. Guaranteed!

Meet The Author