Tuesday, June 3, 2014

Standard Normal Distribution in Excel 2010 and Excel 2013

This is one of the following eight articles on the normal distribution in Excel

Overview of the Normal Distribution

Normal Distribution’s PDF (Probability Density Function) in Excel 2010 and Excel 2013

Normal Distribution’s CDF (Cumulative Distribution Function) in Excel 2010 and Excel 2013

Solving Normal Distribution Problems in Excel 2010 and Excel 2013

Overview of the Standard Normal Distribution in Excel 2010 and Excel 2013

An Important Difference Between the t and Normal Distribution Graphs

The Empirical Rule and Chebyshev’s Theorem in Excel – Calculating How Much Data Is a Certain Distance From the Mean

Demonstrating the Central Limit Theorem In Excel 2010 and Excel 2013 In An Easy-To-Understand Way

Overview of the Standard

Normal Distribution

The normal distribution is actually a family of distributions. Each unique normal distribution can be fully described by its two parameters, which are the following:

1) Its population mean, µ, which is a location parameter.

2) Its population standard deviation, σ, which is a scale parameter.

The most basic normal distribution is called the Standard Normal Distribution. Its population mean, µ, equals 0 and its population standard deviation, σ, equals 1.

The PDF curve of the Standard Normal Distribution appears as follows in this Excel-generated graph:

(Click On Image To See a Larger Version)

The CDF curve of the Standard Normal Distribution appears as follows in this Excel-generated graph:

(Click On Image To See a Larger Version)

When the population mean, µ, equals 0 and the population standard deviation, σ, equals 1, the x values equal the number of standard deviations that each x is from the mean. In this situation, each x value equals its z score. A data point’s z score equals the number of population standard deviations that the point is from the population mean. The formula to calculate an x value’s z score is the following:

(Click On Image To See a Larger Version)

Every unique normal distribution becomes to the Standard Normal Distribution when the x values are converted to z scores. The Standard Normal Distribution (with zero population mean and unit population standard deviation) is sometimes referred to as the standard Gaussian distribution or the unit normal distribution is denoted by the Greek letter “phi” as follows:

φ(x) (small “phi”) denotes the PDF of the standard normal distribution at x, which also equals z score(x) because σ = 1.

Φ(x) (capital “phi”) denotes the CDF of the standard normal distribution at x, which also equals z score(x) because σ = 1.

When µ = 0 and σ = 1, an X value is equal to its z score. If µ = 0 and σ = 1, then the Excel formulas NORM.DIST(X,µ,σ, TRUE or FALSE) can be replaced by the simpler Excel formula NORM.S.DIST(z, TRUE or FALSE)

Calculating the PDF of a

Standardized Normal Distribution in

Excel

X = 2

µ = 0

σ = 1

f(X = 2, µ = 0, σ = 1) = NORM.DIST(2,0,1,FALSE) = 0.05399

f(X = 2, µ = 0, σ = 1) = NORM.S.DIST(z score(x),FALSE)

(Click On Image To See a Larger Version)

z score(x) = (2 – 0)/1 = 2

φ(2) = f(X = 2, µ = 0, σ = 1)

f(X = 2, µ = 0, σ = 1) = NORM.S.DIST(2,FALSE) = 0.05399

φ(2) = 0.05399

This is shown in the following Excel-generated graph:

(Click On Image To See a Larger Version)

Calculating the CDF of a

Standardized Normal Distribution in

Excel

X = 2

µ = 0

σ = 1

F(X = 2, µ = 0, σ = 1) = NORM.DIST(2,0,1,TRUE) = 0.9773

F(X = 2, µ = 0, σ = 1) = NORM.S.DIST(z score(x),TRUE)

(Click To See a Larger Version)

z score(x) = (2 – 0)/1 = 2

Φ(2) = F(X = 2, µ = 0, σ = 1)

F(X = 2, µ = 0, σ = 1) = NORM.S.DIST(2,TRUE) = 0.9773

Φ(2) = 0.9773

This is shown in the following Excel-generated graph:

(Click On Image To See a Larger Version)

Here are some properties of the CDF of the standard normal distribution

Φ(-∞) = 0 = 0%

Φ(0) = 0.5 = 50%

Φ(∞) = 1 = 100%

Φ(X) = 1 - Φ(-X) and therefore Φ(X) + Φ(-X) = 100%

The t-Distribution’s Convergence to

the Standard Normal Distribution

The t-Distribution has the following two important similarities to the standard normal distribution:

1) Both the t-Distribution and the standard normal distribution are centered about means of 0. One difference between the t-Distribution and the normal distribution is that the normal distribution can assume any value as its mean. The t-Distribution is always symmetrical about a mean of 0, as is the standard normal distribution.

2) The horizontal axis of the t-Distribution is measured in units of t value. The t-value of a point is the number of sample standard errors that the point is from the mean. The horizontal axis of the standard normal distribution is measured in units of z Value, i.e., the number of population standard deviations that the point is from the mean. This is the result of the standard normal distribution’s population standard deviation being set to the unit value of 1. The formulas for z score(x) and t value(x) are shown as follows:

The z score of a randomly-sampled point, x, from a normally-distributed population is calculated as follows:

(Click On Image To See a Larger Version)

The z score of point x taken from a large sample (n > 30) from a population of unknown distribution is calculated as follows:

(Click On Image To See a Larger Version)

The t value of a point x taken from a small sample (n < 30) of a normally-distributed population or a large sample from a population of unknown distribution is calculated as follows:

(Click On Image To See a Larger Version)

When sample size is large, the means of large, similar-sized random samples are normal-distributed regardless of the distribution of the underlying population as per the Central Limit Theorem.

The Standard Error (SE) that is calculated for the t value using the sample standard deviation, s, is an estimate of actual SE that would be calculated with the population standard deviation, σ.

The z Value (z score) is the unit of measure of the horizontal axis of the standard normal distribution and the t value is the unit of measure of the horizontal axis of the t-Distribution.

z score(x) is the number of population standard deviations that a point x is from the population mean. t value(x) is the number of standard errors that point x is from the sample mean.

It is important to note that z scores are created using population parameters µ (population mean) and σ (population standard deviation). t values are created using the sample statistics x_bar (sample mean), s (sample standard deviation), and n (sample size).

The underlying reason for this is that the normal distribution is used to analyze normally-distributed data only when population parameters µ and σ are known. The t-Distribution is used to analyze normally-distributed data when only sample statistics x_bar, s, and n are known. It is much more common in the real world that only sample statistics are known so the t-Distribution is often the tool of choice for analyzing normally-distributed data.

The t-Distribution has only a single parameter: v, the degrees of freedom, which is equal to v = n – 1. The t-Distribution’s shape changes as sample size, n, changes. Very low values of n (very small sample sizes) produce a t-Distribution PDF graph with wider tails and a lower peak. The follow Excel-generated graph shows a t-Distribution’s PDF curve with a sample size of n = 3:

(Click On Image To See a Larger Version)

As sample increases, the t-Distribution’s shape changes as its peak rises and less weight remains in the outer tails. The t-Distribution converges to exactly resemble the standard normal distribution when the sample size is large enough. The follow Excel-generated graph shows a t-Distribution’s PDF curve with a sample size n approaches infinity:

(Click On Image To See a Larger Version)

The PDF curve of the standard normal distribution shows an exact match:

(Click On Image To See a Larger Version)

It should be noted that a number of texts incorrectly state the t-Distribution converges to the normal distribution as sample size increases. The normal distribution represents a family of distribution curves each having a unique combination of µ and σ. The specific normal curve needs to be specified. The correct statement would be that the t-Distribution converges to the standard normal distribution as sample size increases.

Excel Master Series Blog Directory

Click Here To See a List Of All

Statistical Topics And Articles In

This Blog

You Will Become an Excel Statistical Master!

3 comments:

salmaMarch 12, 2020 at 12:02 PM

تنظيف منازل بالدمام تنظيف منازل بالدمام
تنظيف منازل بالاحساء تنظيف منازل بالاحساء
تنظيف منازل بمكة تنظيف منازل بمكة
تنظيف منازل بجدة تنظيف منازل بجدة
تنظيف منازل بالمدينة المنورة تنظيف منازل بالمدينة المنورة

تنظيف بمكة شركة تنظيف بمكة بالبخار
تنظيف بالمدينة المنورة تنظيف بالمدينة المنورة
تنظيف بالجبيل افضل شركة تنظيف بالجبيل
تنظيف بالدمام مؤسسة تنظيف بالدمام
تنظيف بالخبر ارخص شركة تنظيف بالخبر

ReplyDelete
Replies
please help with my homeworkNovember 8, 2021 at 11:50 PM
very nice post, Youve done a brilliant job making sure that people understand where youre coming from. And let me tell you, I get it. huge stuff and I cant wait to check out more of your websites
ReplyDelete
Replies
Angel17June 17, 2023 at 8:54 AM
This post is so useful. You can help a lot of people on this one. fence installation Amarillo, TX
ReplyDelete
Replies

Add comment

Become an Excel Statistical Master

Excel Master Series - MBA-level statistics - Over 1,100+ Pages of Easy-To-Follow Instructions in Excel

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

Step-By-Step Optimization With Excel Solver

What's In It?

For anyone who wants to be operating at a high level with the Excel Solver quickly, this is the book for you. Step-By-Step Optimization With Excel Solver is a 200+ page .pdf e-manual of simple yet thorough explanations on how to use the Excel Solver to solve today’s most widely known optimization problems. Loaded with screen shots that are coupled with easy-to-follow instructions, this book will simplify many difficult optimization problems and make you a master of the Excel Solver almost immediately.

Here are just some of the Solver optimization problems that are solved completely with simple-to-understand instructions and screen shots in this e-manual:

• The famous “Traveling Salesman” problem using Solver’s Alldifferent constraint and the Solver’s Evolutionary method to find the shortest path to reach all customers. This also provides an advanced use of the Excel INDEX function.

• The well-known “Knapsack Problem” which shows how optimize the use of limited space while satisfying numerous other criteria.

• How to perform nonlinear regression and curve-fitting on the Solver using the Solver’s GRG Nonlinear solving method.

• How to solve the “Cutting Stock Problem” faced by many manufacturing companies who are trying to determine the optimal way to cut sheets of material to minimize waste while satisfying customer orders.

• Portfolio optimization to maximize return or minimize risk.

• Venture capital investment selection using the Solver’s Binary constraint to maximize Net Present Value of selected cash flows at year 0. Clever use of the If-Then-Else statements makes this a simple problem.

• How use Solver to minimize the total cost of purchasing and shipping goods from multiple suppliers to multiple locations.

• How to optimize the selection of different production machine to minimize cost while fulfilling an order.

• How to optimally allocate a marketing budget to generate the greatest reach and frequency or number of inbound leads at the lowest cost.

Step-By-Step Optimization With Excel Solver has complete instructions and numerous tips on every aspect of operating the Excel Solver. You’ll fully understand the reports and know exactly how to tweek all of the Solver’s settings for total custom use. This e-manual also provides lots of inside advice and guidance on setting up the model in Excel so that it will be as simple and intuitive as possible to work with.

All of the optimization problems in this book are solved step-by-step using a 6-step process that works every time. In addition to detailed screen shots and easy-to-follow explanations on how to solve every optimization problem in the book, a link is provided to download an Excel workbook that has all problems completed exactly as they are in this e-manual.

Step-By-Step Optimization With Excel Solver is exactly the e-manual you need if you want to be optimizing at an advanced level with the Excel Solver quickly.

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

Immediate, Absolute, No-Questions-Asked, Money-Back Guarantee If Not TOTALLY, 100% Satisfied. In Other Words, If Any Excel Master Series eManual That You've Purchased Here Does Not Provide Instructions That Are CRYSTAL CLEAR and EASY TO UNDERSTAND, You Get All Of Your Money Back Immediately and Keep the eManual. Guaranteed!

Meet The Author