Professional Documents
Culture Documents
Correlation
Chapter 13
2
R
Regression
i A Analysis
l i - Introduction
I t d ti
3
R
Regression
i A Analysis
l i - Uses
U
Some examples.
z Is there a relationship
p between the amount Healthtex
spends per month on advertising and its sales in the
month?
z Can we base an estimate of the cost to heat a home
in January on the number of square feet in the
home?
z Is there a relationship between the miles per gallon
achieved by large pickup trucks and the size of the
engine?
z Is there a relationship between the number of hours
that students studied for an exam and the score
earned?
4
Correlation Analysis
5
R
Regression
i E Example
l
6
S tt Di
Scatter Diagram
7
Correlation, r
The Coefficient of Correlation
8
P f t Correlation
Perfect C l ti
9
Mi it b Scatter
Minitab S tt Pl Plots
t
10
Correlation Coefficient - Interpretation
11
C
Correlation
l ti Coefficient
C ffi i t - Formula
F l
12
Coefficient of Determination
14
C
Correlation
l ti Coefficient
C ffi i t - Example
E l
15
Correlation Coefficient – Excel Example
16
C
Correlation
l ti Coefficient
C ffi i t - Example
E l
However, does this mean that more sales calls cause more sales?
No, we have not demonstrated cause and effect here, only that the
two variables—sales calls and copiers sold—are related.
17
Coefficient of Determination (r2) - Example
•This
This is a proportion or a percent; we can say that
57.6 percent of the variation in the number of
copiers sold is explained, or accounted for, by the
variation in the number of sales calls.
18
Testing
g the Significance
g of
the Correlation Coefficient
19
Testing the Significance of
the Correlation Coefficient - Example
20
Testing the Significance of
the Correlation Coefficient - Example
The computed t (3.297) is within the rejection region, therefore, we will reject H0. This means
tthe
e correlation
co e at o in tthe
e popu
population
at o is
s not
ot zero.
e o From
o ap practical
act ca sta
standpoint,
dpo t, itt indicates
d cates to tthe
e
sales manager that there is correlation with respect to the number of sales calls made
and the number of copiers sold in the population of salespeople.
21
Mi it b
Minitab
22
Li
Linear Regression
R i Model
M d l
23
C
Computing
ti ththe Slope
Sl off the
th Line
Li
24
C
Computing
ti ththe Y-Intercept
YI t t
25
Regression Analysis
26
Regression Analysis – Least Squares
Principle
27
Illustration of the Least Squares
Regression Principle
28
R
Regression
i E Equation
ti - Example
E l
29
Finding the Regression Equation - Example
32
The Standard Error of Estimate
ΣY 2 − aΣY − bΣXY
^
Σ (Y − Y ) 2
s y.x = s y. x =
n−2 n−2
33
Standard Error of the Estimate - Example
Y = 18.9476 + 1.1842 X Σ (Y − Y ) 2
s y.x =
n−2
784 .211
= = 9.901
10 − 2
34
Graphical Illustration of the Differences between Actual ^
Y – Estimated Y (Y − Y )
35
Standard Error of the Estimate - Excel
36
Assumptions
p Underlying
y g Linear
Regression
For each value of X, there is a group of Y values, and these
z Y values are normally distributed. The means of these normal
distributions of Y values all lie on the straight line of regression.
z The standard deviations of these normal distributions are equal.
z The Y values are statistically independent
independent. This means that in
the selection of a sample, the Y values chosen for a particular X
value do not depend on the Y values for any other X values.
37
Confidence Interval and Prediction
Interval Estimates of Y
38
Confidence Interval Estimate - Example
39
Confidence Interval Estimate - Example
40
Y = 48.5526
Confidence Interval Estimate - Example
42
Confidence Interval Estimate - Example
Thus, the 95 percent confidence interval for the average sales of all
sales representatives who make 25 calls is from 40.9170 up to
56.1882 copiers.
43
Prediction Interval Estimate - Example
44
Prediction Interval Estimate - Example
45
Y = 48.5526
Prediction Interval Estimate - Example
If Sh
Sheila
il BBaker
k makes
k 25
2 sales
l calls,
ll the
h number
b off copiers
i she
h
will sell will be between about 24 and 73 copiers.
46
Confidence and Prediction Intervals –
Minitab Illustration
47
T
Transforming
f i Data
D t
48
T
Transforming
f i Data
D t - Example
E l
49
S tt
Scatterplot
l t off Golf
G lf Data
D t
50
What can we do to explore other (nonlinear)
relationships?
One possibility is to transform one of the
example, instead of using Y as
variables For example
variables.
the dependent variable, we might use its log,
reciprocal,
p , square,
q , or square
q root. Another
possibility is to transform the independent
variable in the same way. There are other
transformations, but these are the most
common.
51
T
Transforming
f i Data
D t - Example
E l
53
Linear Regression Using the
Transformed Y
54
Using the Transformed Equation for
Estimation
56