Professional Documents
Culture Documents
STATISTICAL DATA
ANALYSIS IN EXCEL
Lecture 4
Analysis of Variance (ANOVA)
dr
dr.. Petr Nazarov
petr.nazarov@crp--sante.lu
petr.nazarov@crp
31-10-2011
Statistical data analysis in Excel.
4. ANOVA
INTRODUCTION TO ANOVA
Why ANOVA?
5!
= 10
2!3!
Probability of an error:
error: 1(0.95
95))10 = 0.4
ANOVA
example from Partek
http://easylink.playstream.com/affymetrix/ambsymposium/partek_08.wvx
Statistical data analysis in Excel.
4. ANOVA
INTRODUCTION TO ANOVA
Example from Case Problem 3
As part of a longlong-term study of individuals 65 years of age or older, sociologists and
physicians at the Wentworth Medical Center in upstate New York investigated the relationship
between geographic location and depression. A sample of 60 individuals, all in reasonably
good health, was selected; 20 individuals were residents of Florida, 20 were residents of New
York, and 20 were residents of North Carolina. Each of the individuals sampled was given a
standardized test to measure depression. The data collected follow; higher test scores
indicate higher levels of depression.
Q: Is the depression level same in all 3 locations?
depression.xls
1. Good health respondents
Florida New York N. Carolina
3
8
10
7
11
7
7
9
3
3
7
5
8
8
11
8
7
8
H0: 1= 2= 3
Ha: not all 3 means are equal
4. ANOVA
INTRODUCTION TO ANOVA
Meaning
H0: 1= 2= 3
Ha: not all 3 means are equal
14
12
Depression level
10
m2
m3
m1
6
4
2
NC
NC
NC
NC
NC
NC
NY
NY
NY
NY
NY
NY
NY
FL
FL
FL
FL
FL
FL
FL
0
Measures
4. ANOVA
SINGLE-FACTOR ANOVA
Example
14
12
Depression level
10
m2
m3
m1
6
4
2
NC
NC
NC
NC
NC
NC
NY
NY
NY
NY
NY
NY
NY
FL
FL
FL
FL
FL
FL
FL
0
Measures
4. ANOVA
SINGLE-FACTOR ANOVA
Example
ANOVA table
A table used to summarize the analysis of variance computations and results. It contains
columns showing the source of variation, the sum of squares, the degrees of freedom, the
mean square, and the F value(s).
In Excel use:
Tools Data Analysis ANOVA Single Factor
depression.xls
ANOVA
Source of Variation
Between Groups
Within Groups
SS
78.53333
330.45
Total
408.9833
df
MS
F
P-value
F crit
2 39.26667 6.773188 0.002296 3.158843
57 5.797368
59
SSE
Statistical data analysis in Excel.
4. ANOVA
MULTI-FACTOR ANOVA
Factors and Treatments
Factor
Another word for the independent
variable of interest.
Factorial experiment
An experimental design that allows statistical
conclusions about two or more factors.
good health
Treatments
Different levels of a factor.
bad health
Factor 1: Health
Florida
depression.xls
Factor 2: Location
New York
North Carolina
4. ANOVA
MULTI-FACTOR ANOVA
2-factor ANOVA with r Replicates: Example
depression.xls
1. Reorder the data into format understandable for Excel
Good health
bad health
Florida
3
7
7
3
7
3
13
12
17
17
7
8
14
9
15
12
8
11
10
12
15
18
11
17
13
11
13
11
4. ANOVA
Factor 1: Health
Factor 2: Location
MULTI-FACTOR ANOVA
2-factor ANOVA with r Replicates: Example
Health
Location
Interaction
Error
ANOVA
Source of Variation
SS
Sample
1748.033
Columns
73.85
Interaction
26.11667
Within
981.2
Total
df
1
2
2
114
2829.2
MS
F
P-value
F crit
1748.033 203.094
4.4E-27 3.92433
36.925 4.290104 0.015981 3.075853
13.05833 1.517173 0.223726 3.075853
8.607018
119
16
250
14
200
12
10
150
8
100
6
4
50
2
0
0
Health
Location
Interaction
Error
4. ANOVA
Health
Location
Interaction
Error
QUESTIONS ?
4. ANOVA
10