This is one of the following seven articles on Multiple Linear Regression in Excel

Overview of Prediction

Interval of Multiple

Regression In Excel

A prediction interval is a confidence interval about a Y value that is estimated from a regression equation. A regression prediction interval is a value range above and below the Y estimate calculated by the regression equation that would contain the actual value of a sample with, for example, 95 percent certainty.

The Prediction Error for a point estimate of Y is always slightly larger than the Standard Error of the Regression Equation shown in the Excel regression output directly under Adjusted R Square.

The Standard Error of the Regression Equation is used to calculate a confidence interval about the mean Y value. The Prediction Error is use to create a confidence interval about a predicted Y value. There will always be slightly more uncertainty in predicting an individual Y value than in estimating the mean Y value.

For that reason, a Prediction Interval will always be larger than a Confidence Interval for any type of regression analysis.

Calculating an exact prediction interval for any regression with more than one independent variable (multiple regression) involves some pretty heavy-duty matrix algebra. Fortunately there is an easy short-cut that can be applied to multiple regression that will give a fairly accurate estimate of the prediction interval.

Prediction Interval Formula

The formula for a prediction interval about an estimated Y value (a Y value calculated from the regression equation) is found by the following formula:

Prediction Interval = Y_est ± t-Value_α/2 * Prediction Error

Prediction Error = Standard Error of the Regression * SQRT(1 + distance value)

Distance value, sometimes called leverage value, is the measure of distance of the combinations of values, x₁, x₂,…, x_k from the center of the observed data. Calculation of Distance value for any type of multiple regression requires some heavy-duty matrix algebra. This is given in Bowerman and O’Connell (1990).

Some software packages such as Minitab perform the internal calculations to produce an exact Prediction Error for a given Alpha. Excel does not. Fortunately there is an easy substitution that provides a fairly accurate estimate of Prediction Interval. The following fact enables this:

The Prediction Error for a point estimate of Y is always slightly larger than the Standard Error of the Regression Equation shown in the Excel regression output directly under Adjusted R Square.

The Standard Error (highlighted in yellow in the Excel regression output) is used to calculate a confidence interval about the mean Y value. The Prediction Error is use to create a confidence interval about a predicted Y value. There will always be slightly more uncertainty in predicting an individual Y value than in estimating the mean Y value.

Prediction Interval Estimate

Formula

The Prediction Error is always slightly bigger than the Standard Error of a Regression. The Prediction Error can be estimated with reasonable accuracy by the following formula:

Prediction Error_est = P.E._est

P.E._est = (Standard Error of the Regression)* 1.1

Prediction Interval_est = Y_est ± t-Value_α/2 * P.E._est

Prediction Interval_est = Y_est ± t-Value_α/2 * (Standard Error of the Regression)* 1.1

Prediction Interval_est = Y_est ± TINV(α, df_Residual) * (Standard Error of the Regression)* 1.1

The t-value must be calculated using the degrees of freedom, df, of the Residual (highlighted in Yellow in the Excel Regression output and equals n – 2).

df_Residual = n – 2 = 20 – 2 = 18

t-Value_α/2,df=n-2 = TINV(0.05,20-2)

t-Value_α/2,df=n-2 = TINV(0.05,18) = 2.1009

In Excel 2010 and later TINV(α, df) can be replaced be T.INV(1-α/2,df)

Example in Excel of Estimating

the Prediction Interval

Create a 95 percent prediction interval about the estimated value of Y if a company had 10,000 production machines and added 500 new employees in the last 5 years.

In this case the company’s annual power consumption would be predicted as follows:

Y_est = Annual Power Consumption (kW) = 37,123,164 + 10.234 (Number of Production Machines X 1,000) + 3.573 (New Employees Added in Last 5 Years X 1,000)

Y_est = Annual Power Consumption (kW) = 37,123,164 + 10.234 (10,000 X 1,000) + 3.573 (500 X 1,000)

Y_est = Estimated Annual Power Consumption = 49,143,690 kW

Y_est = 49,143,690

Prediction Interval_est = Y_est ± TINV(α, df_Residual) * (Standard Error of the Regression)* 1.1

In Excel 2010 and later TINV(α, df) can be replaced be T.INV(1-α/2,df)

The Standard Error of the Regression is found to be 21,502,161 in the Excel regression output as follows:

(Click On Image To See a Larger Version)

Prediction Interval_est = 49,143,690 ± TINV(0.05, 18) * (21,502,161)* 1.1

Prediction Interval_est = [49,143,690 ± 49,691,800 ]

Prediction Interval_est = [ -549,110, 98,834,490 ]

This is a relatively wide Prediction Interval that results from a large Standard Error of the Regression (21,502,161).

It is very important to note that a regression equation should never be extrapolated outside the range of the original data set used to create the regression equation. The inputs for a regression prediction should not be outside of the following ranges of the original data set:

Number of machine: 442 to 28,345

New employees added in last 5 years: -1,460 to 7,030

Excel Master Series Blog Directory

Statistical Topics and Articles In Each Topic

Technorati Tags: excel,excel 2010,excel 2013,regression,multiple regression,prediction interval,statistics

Become an Excel Statistical Master

Excel Master Series - MBA-level statistics - Over 1,100+ Pages of Easy-To-Follow Instructions in Excel

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

Step-By-Step Optimization With Excel Solver

What's In It?

For anyone who wants to be operating at a high level with the Excel Solver quickly, this is the book for you. Step-By-Step Optimization With Excel Solver is a 200+ page .pdf e-manual of simple yet thorough explanations on how to use the Excel Solver to solve today’s most widely known optimization problems. Loaded with screen shots that are coupled with easy-to-follow instructions, this book will simplify many difficult optimization problems and make you a master of the Excel Solver almost immediately.

Here are just some of the Solver optimization problems that are solved completely with simple-to-understand instructions and screen shots in this e-manual:

• The famous “Traveling Salesman” problem using Solver’s Alldifferent constraint and the Solver’s Evolutionary method to find the shortest path to reach all customers. This also provides an advanced use of the Excel INDEX function.

• The well-known “Knapsack Problem” which shows how optimize the use of limited space while satisfying numerous other criteria.

• How to perform nonlinear regression and curve-fitting on the Solver using the Solver’s GRG Nonlinear solving method.

• How to solve the “Cutting Stock Problem” faced by many manufacturing companies who are trying to determine the optimal way to cut sheets of material to minimize waste while satisfying customer orders.

• Portfolio optimization to maximize return or minimize risk.

• Venture capital investment selection using the Solver’s Binary constraint to maximize Net Present Value of selected cash flows at year 0. Clever use of the If-Then-Else statements makes this a simple problem.

• How use Solver to minimize the total cost of purchasing and shipping goods from multiple suppliers to multiple locations.

• How to optimize the selection of different production machine to minimize cost while fulfilling an order.

• How to optimally allocate a marketing budget to generate the greatest reach and frequency or number of inbound leads at the lowest cost.

Step-By-Step Optimization With Excel Solver has complete instructions and numerous tips on every aspect of operating the Excel Solver. You’ll fully understand the reports and know exactly how to tweek all of the Solver’s settings for total custom use. This e-manual also provides lots of inside advice and guidance on setting up the model in Excel so that it will be as simple and intuitive as possible to work with.

All of the optimization problems in this book are solved step-by-step using a 6-step process that works every time. In addition to detailed screen shots and easy-to-follow explanations on how to solve every optimization problem in the book, a link is provided to download an Excel workbook that has all problems completed exactly as they are in this e-manual.

Step-By-Step Optimization With Excel Solver is exactly the e-manual you need if you want to be optimizing at an advanced level with the Excel Solver quickly.

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

More Easy-To-

Follow eManuals

That You Will

Master Quickly

*******************

Become an Excel Statistical Master

It's a Full
Easy-To-Follow
MBA Course in Business Statistics

ALL IN EXCEL

&

MUCH Clearer

Than Your Text

Book

Download the
1,100+ Page Excel Statistical Master now

Immediate, Absolute, No-Questions-Asked, Money-Back Guarantee If Not TOTALLY, 100% Satisfied. In Other Words, If Any Excel Master Series eManual That You've Purchased Here Does Not Provide Instructions That Are CRYSTAL CLEAR and EASY TO UNDERSTAND, You Get All Of Your Money Back Immediately and Keep the eManual. Guaranteed!

Meet The Author