Professional Documents
Culture Documents
ANALYSIS OF VARIANCE
ANOVA
Exemplu
Determinam cum recolta este influentata de tipul
de ingrasamint folosit. Un fermier foloseste 3 tipuri
de ingrasamint notate A,B and C
Variabila raspuns - productia
factorul - tipul de ingrasamint
tratamentul - ingrasamintul A, B and C
TERMINOLOGIE
Exemplu 2
Analizam cum pretul actiunilor este
determinat de rata dobinzii pe care o ofera.
Studiem obligatiuni care platesc rate de
6%, 8%,10%
Variabila raspuns - pretul actiunii
factorul - rata dobinzii
tratamentul - 6%,8% sau10%
MODELE ANOVA
n1
n2
n2
. . . . . . . .
. . . . . . . .
. . . . . . . .
nk
Xk nk y k 1 yk 2
y knk Yk N ( k , k2 ) y ki
yk i 1
nk
Volumul
eantionului
n n1 n2 ... nk
Ipoteze in ANOVA
Media total a populaiei va fi estimat prin media total a
eantionului: k n n
y
k
yn
k
ij i i
i 1 j 1
y i
i 1
n n
Setul de ipoteze H 0 : 1 2 ... k
H A : non H 0 (cel puin dou medii snt neegale)
Dac ipoteza nul este acceptat, atunci putem concluziona
c factorul de grupare nu are o influen semnficativ
asupra variabilei de interes.
Ideea de baz n testarea ipotezelor ANOVA este regula de
adunare a dispersiilor, descompunerea dispersiei totale n
dispersia dintre grupe(factorul sistematic) i dispersia din
interiorul grupelor(factorul aleator).
Tabelul de analiz a varianei
ANOVA Table
Source of SS df MS F
Variance
Between Groups
MST
k SST
(Factorul
sistematic) SST ni ( yi y )2 k-1 MST
k 1 MSE
i 1
Within Groups
k ni SSE
(Factorul
SSE ( yij yi ) 2 n-k MSE
aleator)
i 1 j 1
nk
k ni
Total SStotal ( yij y )2 n-1
i 1 j 1
Testul F(Fischer)
Decizia se ia pe baza testului F: se compar valoarea
statisticii F calculat n tabelul ANOVA cu valoarea critic,
corespunztoare cuantilei repartiiei F cu (k-1,n-k) grade de
libertate.
Dac F F ; k 1; n k atunci respingem ipoteza nul,
deci putem afirma, cu probabilitatea 1 , c factorul de
grupare are o influen semnificativ asupra variabilei de
interes.
Valoarea critic n EXCEL: F ;k 1;n k FINV ( , k 1, n k )
Comparaii multiple Procedura Tukey-Kramer
MSE 1 1
DC QU
2 ni n j
unde QU este cuantila superioar a distribuiei
studentizate a distanei (Studentized range
distribution) cu k grade de libertate la numrtor
i n-k grade de libertate la numitor.
Procedura Tukey-Kramer
Variaia total
SST=
Variaia dintre
grupuri
SSA
+ Variaia aleatoare
Variaia dintre
blocuri
SSBL + SSE
Sum of Squares for Blocking
SST = SSA + SSBL + SSE
r
SSBL c (Yi. Y) 2
i 1
Where:
c = number of groups
r = number of blocks
Yi. = mean of all values in block i
Y = grand mean (mean of all data values)
Partitioning the Variation
Total variation can now be split into three
parts:
SSBL
MSBL Mean square blocking
r 1
SSA
MSA Mean square among groups
c 1
SSE
MSE Mean square error
(r 1)(c 1)
Randomized Block ANOVA Table
Source of SS df MS F ratio
Variation
Among MSA
Treatments SSA c-1 MSA
MSE
Among SSBL r-1 MSBL MSBL
Blocks
MSE
Error SSE (r1)(c-1) MSE
Total SST rc - 1
c = number of populations rc = sum of the sample sizes from all populations
r = number of blocks df = degrees of freedom
Blocking Test
H0 : 1. 2. 3. ...
H1 : Not all block means are equal
MSBL
F=
MSE
Blocking test: df1 = r 1
df2 = (r 1)(c 1)
Reject H0 if F > FU
Main Factor Test
H0 : .1 .2 .3 ... .c
H1 : Not all population means are equal
MSA
F=
MSE Main Factor test: df1 = c 1
df2 = (r 1)(c 1)
Reject H0 if F > FU
The Tukey Procedure
1= 2 3 x
The Tukey Procedure
(continued)
MSE
Critical Range Qu
r
Compare:
Is x.j x.j' Critical Range ? x .1 x .2
If the absolute mean difference x .1 x .3
is greater than the critical range
then there is a significant x .2 x .3
difference between that pair of
means at the chosen level of etc...
significance.
Exemplu
6 experi n gastronomie trebuie s evalueze 4
restaurante n privina calitii serviciilor
Experii aloc fiecrui restaurant un punctaj de
la 1 la 100
Se poate afirma c exist o diferen
semnificativ ntre cele patru restaurante n
ceea ce privete punctajele acordate?
Exist vreo diferen n ceea ce privete
modalitatea de punctare a celor 6 experi?
Cum realizm ANOVA folosind EXCEL
SST SSB c1
Factor B Variation
Total Variation
SSAB
Variation due to interaction (r 1)(c 1)
between A and B
n-1
SSE rc(n 1)
Random variation (Error)
Two Factor ANOVA Equations
Total Variation: r c n
SST ( Xijk X) 2
i1 j1 k 1
Factor A Variation: r
SSA cn ( Xi.. X)
2
i1
Factor B Variation:
c
SSB rn ( X. j. X)
2
j1
Two Factor ANOVA Equations
(continued)
Interaction Variation:
r c
SSAB n ( Xij. Xi.. X.j. X)2
i1 j1
where: X
i1 j1 k 1
ijk
X Grand Mean
c n
rcn
X
j1 k 1
ijk
X ijk
X. j. i1 k 1
Mean of jth level of factor B (j 1, 2, ..., c)
rn
n
Xijk
Xij.
r = number of levels of factor A
Mean of cell ij
k 1 n
c = number of levels of factor B
n = number of replications in each cell
Mean Square Calculations
SSA
MSA Mean square factor A
r 1
SSB
MSB Mean square factor B
c 1
SSAB
MSAB Mean square interactio n
(r 1)(c 1)
SSE
MSE Mean square error
rc(n'1)
Two-Way ANOVA:
The F Test Statistic
F Test for Factor A Effect
H0: 1.. = 2.. = 3.. =
MSA Reject H0
H1: Not all i.. are equal F
MSE if F > FU
Factor B Level 1
Mean Response
Mean Response
Factor B Level 1
Factor B Level 3
Factor B Level 2
Factor B Level 2
Factor B Level 3