Anova 2

ANOVA
February 12, 2015
ANOVA models
Last time, we discussed the use of categorical variables in multivariate regression. Often, these are encoded
as indicator columns in the design matrix.
In [1]: %%R
url = http://stats191.stanford.edu/data/salary.table
salary.table = read.table(url, header=T)
salary.table$E = factor(salary.table$E)
salary.table$M = factor(salary.table$M)
salary.lm = lm(S ~ X + E + M, salary.table)
head(model.matrix(salary.lm))
1
2
3
4
5
6
(Intercept)
1
1
1
1
1
1
X E2 E3 M1
1 0 0 1
1 0 1 0
1 0 1 1
1 1 0 0
1 0 1 0
2 1 0 1
Often, especially in experimental settings, we record only categorical variables.

Such models are often referred to ANOVA (Analysis of Variance) models.
These are generalizations of our favorite example, the two sample t-test.
1.1
Example: recovery time
Suppose we want to understand the relationship between recovery time after surgery based on an
patients prior fitness.
We group patients into three fitness levels: below average, average, above average.
If you are in better shape before surgery, does it take less time to recover?
In [2]: %%R
url = http://stats191.stanford.edu/data/rehab.csv
rehab.table = read.table(url, header=T, sep=,)
rehab.table$Fitness <- factor(rehab.table$Fitness)
head(rehab.table)
1
2
3
4
5
6
Fitness Time
1
29
1
42
1
38
1
40
1
43
1
40
In [3]: %%R -h 800 -w 800

attach(rehab.table)
boxplot(Time ~ Fitness, col=c(red,green,blue))
1.2
One-way ANOVA
First generalization of two sample t-test: more than two groups.

Observations are broken up into r groups with ni , 1 i r observations per group.
Model:
Yij = + i + ij ,
ij N (0, 2 ).
Pr
Constraint:
i=1 i = 0. This constraint is needed for identifiability. This is equivalent to only
adding r 1 columns to the design matrix for this qualitative variable.
This is not the same parameterization we get when only adding r 1 0-1 columns, but it gives the
same model.
The estimates of can be obtained from the estimates of using Rs default parameters.
For a more detailed exploration into Rs creation of design matrices, try reading the following tutorial
on design matrices.
1.3
Remember, its still a model (i.e. a plane)
1.4
Fitting the model
Model is easy to fit:

ni
1 X
Yij = Y i .
Ybij =
ni j=1
If observation is in i-th group: predicted mean is just the sample mean of observations in i-th group.
Simplest question: is there any group (main) effect?
H0 : 1 = = r = 0?
Test is based on F -test with full model vs. reduced model. Reduced model just has an intercept.
Other questions: is the effect the same in groups 1 and 2?
H0 : 1 = 2 ?
In [4]: %%R
rehab.lm <- lm(Time ~ Fitness)
summary(rehab.lm)
Call:
lm(formula = Time ~ Fitness)
Residuals:
Min
1Q Median
-9.0
-3.0
-0.5
3Q
3.0
Max
8.0
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept)
38.000
1.574 24.149 < 2e-16 ***
Fitness2
-6.000
2.111 -2.842 0.00976 **
Fitness3
-14.000
2.404 -5.824 8.81e-06 ***
--3
Signif. codes:
0 *** 0.001 ** 0.01 * 0.05 . 0.1 1
Residual standard error: 4.451 on 21 degrees of freedom

Multiple R-squared: 0.6176,Adjusted R-squared: 0.5812
F-statistic: 16.96 on 2 and 21 DF, p-value: 4.129e-05
In [5]: %%R
print(predict(rehab.lm, list(Fitness=factor(c(1,2,3)))))
c(mean(Time[Fitness == 1]), mean(Time[Fitness == 2]), mean(Time[Fitness == 3]))
1 2 3
38 32 24
[1] 38 32 24
Recall that the rows of the Coefficients table above do not correspond to the parameter. For one
thing, we would see three s and their sum would have to be equal to 0.
Also, the design matrix is the indicator coding we saw last time.
In [6]: %%R
head(model.matrix(rehab.lm))
1
2
3
4
5
6
(Intercept) Fitness2 Fitness3

1
0
0
1
0
0
1
0
0
1
0
0
1
0
0
1
0
0
There are ways to get different design matrices by using the contrasts argument. This is a bit above
our pay grade at the moment.
Upon inspection of the design matrix above, we see that the (Intercept) coefficient corresponds to the
mean in Fitness==1, while Fitness==2 coefficient corresponds to the difference between the groups
Fitness==2 and Fitness==1.
1.5
ANOVA table
Much of the information in an ANOVA model is contained in the ANOVA table.

In [8]: make_table(anova_oneway)
apply_theme(basic)
Out[8]: <ipy table.IpyTable at 0x107d8c250>
In [9]: %%R
anova(rehab.lm)
Analysis of Variance Table

Response: Time
Df Sum Sq Mean Sq F value
Pr(>F)
Fitness
2
672 336.00 16.962 4.129e-05 ***
Residuals 21
416
19.81
--Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1
Note that M ST R measures variability of the cell means. If there is a group effect we expect this
to be large relative to M SE.
We see that under H0 : 1 = = r = 0, the expected value of M ST R and M SE is 2 . This tells
us how to test H0 using ratio of mean squares, i.e. an F test.
1.6
Testing for any main effect
Rows in the ANOVA table are, in general, independent.

Therefore, under H0
F =
M ST R
=
M SE
SST R
dfT R
SSE
dfE
FdfT R ,dfE
the degrees of freedom come from the df column in previous table.

Reject H0 at level if F > F1,dfT R ,dfE .
In [10]: %%R
F = 336.00 / 19.81
pval = 1 - pf(F, 2, 21)
print(data.frame(F,pval))
F
pval
1 16.96113 4.129945e-05
1.7
Inference for linear combinations
Suppose we want to infer something about

r
X
ai i
i=1
where i = + i is the mean in the i-th group. For example:

H0 : 1 2 = 0
(same as H0 : 1 2 = 0)?
For example:
Is there a difference between below average and average groups in terms of rehab time?
We need to know
Var
r
X
!
ai Y i
i=1
r
X
a2
i
i=1
ni
After this, the usual confidence intervals and t-tests apply.

In [11]: %%R
head(model.matrix(rehab.lm))
1
2
3
4
5
6
(Intercept) Fitness2 Fitness3

1
0
0
1
0
0
1
0
0
1
0
0
1
0
0
1
0
0
This means that the coefficient Fitness2 is the estimated difference between the two groups.
In [12]: %%R
detach(rehab.table)
1.8
Two-way ANOVA
Often, we will have more than one variable we are changing.

1.8.1
Example
After kidney failure, we suppose that the time of stay in hospital depends on weight gain between treatments
and duration of treatment.
We will model the log number of days as a function of the other two factors.
In [14]: make_table(desc)
apply_theme(basic)
Out[14]: <ipy table.IpyTable at 0x107d8cd90>
In [15]: %%R
url = http://statweb.stanford.edu/~jtaylo/stats191/data/kidney.table
kidney.table = read.table(url, header=T)
kidney.table$D = factor(kidney.table$Duration)
kidney.table$W = factor(kidney.table$Weight)
kidney.table$logDays = log(kidney.table$Days + 1)
attach(kidney.table)
head(kidney.table)
1
2
3
4
5
6
Days Duration Weight ID D W

logDays
0
1
1 1 1 1 0.0000000
2
1
1 2 1 1 1.0986123
1
1
1 3 1 1 0.6931472
3
1
1 4 1 1 1.3862944
0
1
1 5 1 1 0.0000000
2
1
1 6 1 1 1.0986123
1.8.2
Two-way ANOVA model
Second generalization of t-test: more than one grouping variable.

Two-way ANOVA model:
r groups in first factor
m groups in second factor
nij in each combination of factor variables.
Model:
Yijk = + i + j + ()ij + ijk ,
ijk N (0, 2 ).
In kidney example, r = 3 (weight gain), m = 2 (duration of treatment), nij = 10 for all (i, j).
1.8.3
Questions of interest
Two-way ANOVA: main questions of interest

Are there main effects for the grouping variables?
H0 : 1 = = r = 0,
H0 : 1 = = m = 0.
Are there interaction effects:

H0 : ()ij = 0, 1 i r, 1 j m.
1.8.4
Interactions between factors
Weve already seen these interactions in the IT salary example.

An additive model says that the effects of the two factors occur additively such a model has no
interactions.
An interaction is present whenever the additive model does not hold.
1.8.5
Interaction plot
In [16]: %%R -h 800 -w 800

interaction.plot(W, D, logDays, type=b, col=c(red,
blue), lwd=2, pch=c(23,24))
When these broken lines are not parallel, there is evidence of an interaction. The one thing missing from
this plot are errorbars. The above broken lines are clearly not parallel but there is measurement error. If
the error bars were large then we might consider there to be no interaction, otherwise we might.
1.8.6
Parameterization
Many constraints are needed, again for identifiability. Lets not worry too much about the details
Constraints:
Pr
i=1 i = 0
Pm
j=1 j = 0
Pm
j=1 ()ij = 0, 1 i r
Pr
i=1 ()ij = 0, 1 j m.
We should convince ourselves that we know have exactly r m free parameters.
8
1.8.7
Fitting the model
Easy to fit when nij = n (balanced)

n
1X
Yijk .
Ybijk = Y ij =
n
k=1
Inference for combinations
r X
r
m
m
X
2 X X 2
Var
aij Y ij =
a .
n i=1 j=1 ij
i=1 j=1
Usual t-tests, confidence intervals.
In [17]: %%R
kidney.lm = lm(logDays ~ D*W)
summary(kidney.lm)
Call:
lm(formula = logDays ~ D * W)
Residuals:
Min
1Q
-1.33772 -0.51121
Median
0.06302
3Q
0.62926
Max
1.17950
Coefficients:
Estimate Std. Error t value
(Intercept)
1.0212
0.2317
4.407
D2
-0.1042
0.3277 -0.318
W2
0.8439
0.3277
2.575
W3
1.5271
0.3277
4.661
D2:W2
-0.4231
0.4634 -0.913
D2:W3
-0.4491
0.4634 -0.969
--Signif. codes: 0 *** 0.001 ** 0.01
Pr(>|t|)
5.01e-05 ***
0.7517
0.0128 *
2.10e-05 ***
0.3653
0.3368
* 0.05 . 0.1 1

F-statistic: 7.431 on 5 and 54 DF, p-value: 2.301e-05
1.9
Example
Suppose we are interested in comparing the mean in (D = 1, W = 3) and (D = 2, W = 2) groups. The

difference is
E(Y13 Y22 )
By independence, its variance is
2 2
.
Var(Y13 ) + Var(Y22 ) =
n
In [18]: %%R
estimates = predict(kidney.lm, list(D=factor(c(1,2)), W=factor(c(3,2))))
print(estimates)
sigma.hat = 0.7327 # from table above
n = 10 # ten observations per group
fit = estimates[1] - estimates[2]
upper = fit + qt(0.975, 54) * sqrt(2 * sigma.hat^2 / n)
lower = fit - qt(0.975 ,54) * sqrt(2 * sigma.hat^2 / n)
data.frame(fit,lower,upper)
1
2
2.548271 1.337719
fit
lower
upper
1 1.210551 0.5536058 1.867497
In [19]: %%R
head(model.matrix(kidney.lm))
1
2
3
4
5
6
(Intercept) D2 W2 W3 D2:W2 D2:W3

1 0 0 0
0
0
1 0 0 0
0
0
1 0 0 0
0
0
1 0 0 0
0
0
1 0 0 0
0
0
1 0 0 0
0
0
1.9.1
Finding predicted values
The most direct way to compute predicted values is using the predict function
In [20]: %%R
predict(kidney.lm, list(D=factor(1),W=factor(1)), interval=confidence)
fit
lwr
upr
1 1.021156 0.5566306 1.485681
1.9.2
ANOVA table
In the balanced case, everything can again be summarized from the ANOVA table
In [22]:
make_table(anova_twoway)
apply_theme(basic)
1.9.3
Tests using the ANOVA table
Rows of the ANOVA table can be used to test various of the hypotheses we started out with.
For instance, we see that under H0 : ()ij = 0, i, j the expected value of SSAB and SSE is 2
use these for an F -test testing for an interaction.
10
Under H0
F =
M SAB
=
M SE
SSAB
(m1)(r1)
SSE
(n1)mr
F(m1)(r1),(n1)mr
In [23]: %%R
anova(kidney.lm)
Response: logDays
Pr(>F)
D
1 2.3397 2.3397 4.3583
0.04156 *
W
2 16.9713 8.4856 15.8067 3.945e-06 ***
D:W
2 0.6357 0.3178 0.5920
0.55675
Residuals 54 28.9892 0.5368
--Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1
We can also test for interactions using our usual approach

In [24]: %%R
anova(lm(logDays ~ D + W, kidney.table), kidney.lm)
Model 1:
Model 2:
Res.Df
1
56
2
54
1.9.4
logDays ~ D + W
logDays ~ D * W
RSS Df Sum of Sq
F Pr(>F)
29.625
28.989 2
0.63566 0.592 0.5567
Some caveats about R formulae
While we see that it is straightforward to form the interactions test using our usual anova function approach,
we generally cannot test for main effects by this approach.
In [25]: %%R
lm_no_main_Weight = lm(logDays ~ D + W:D)
anova(lm_no_main_Weight, kidney.lm)
Model 1:
Model 2:
Res.Df
1
54
2
54
logDays ~ D + W:D
logDays ~ D * W
RSS Df Sum of Sq F Pr(>F)
28.989
28.989 0 3.5527e-15
In fact, these models are identical in terms of their planes or their fitted values. What has happened is
that R has formed a different design matrix using its rules for formula objects.
11
In [26]: %%R
lm1 = lm(logDays ~ D + W:D)
lm2 = lm(logDays ~ D + W:D + W)
anova(lm1, lm2)
Model 1:
Model 2:
Res.Df
1
54
2
54
1.10
logDays ~ D + W:D
logDays ~ D + W:D + W
RSS Df Sum of Sq F Pr(>F)
28.989
28.989 0 3.5527e-15
ANOVA tables in general
So far, we have used anova to compare two models. In this section, we produced tables for just 1 model.
This also works for any regression model, though we have to be a little careful about interpretation.
Lets revisit the job aptitude test data from last section.
In [27]: %%R
url = http://stats191.stanford.edu/data/jobtest.table
jobtest.table <- read.table(url, header=T)
jobtest.table$ETHN <- factor(jobtest.table$ETHN)
jobtest.lm = lm(JPERF ~ TEST * ETHN, jobtest.table)
summary(jobtest.lm)
Call:
lm(formula = JPERF ~ TEST * ETHN, data = jobtest.table)
Residuals:
Min
1Q Median
-2.0734 -1.0594 -0.2548
3Q
1.2830
Max
2.1980
Coefficients:
(Intercept)
2.0103
1.0501
1.914
0.0736 .
TEST
1.3134
0.6704
1.959
0.0677 .
ETHN1
-1.9132
1.5403 -1.242
0.2321
TEST:ETHN1
1.9975
0.9544
2.093
0.0527 .
--Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1
F-statistic: 10.55 on 3 and 16 DF, p-value: 0.0004511
Now, lets look at the anova output. Well see the results dont match.
In [28]: %%R
anova(jobtest.lm)
12

Response: JPERF
Pr(>F)
TEST
1 48.723 48.723 24.6266 0.0001412 ***
ETHN
1 5.247
5.247 2.6519 0.1229524
TEST:ETHN 1 8.666
8.666 4.3802 0.0526501 .
Residuals 16 31.655
1.978
--Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1
The difference is how the Sum Sq columns is created. In the anova output, terms in the response are
added sequentially.
We can see this by comparing these two models directly. The F statistic doesnt agree because the MSE
above is computed in the fullest model, but the Sum of Sq is correct.
In [29]: %%R
anova(lm(JPERF ~ TEST, jobtest.table), lm(JPERF ~ TEST + ETHN, jobtest.table))
Model 1:
Model 2:
Res.Df
1
18
2
17
JPERF ~ TEST
JPERF ~ TEST + ETHN
RSS Df Sum of Sq
F Pr(>F)
45.568
40.322 1
5.2468 2.2121 0.1552
Similarly, the first Sum Sq in anova can be found by:

In [30]: %%R
anova(lm(JPERF ~ 1, jobtest.table), lm(JPERF ~ TEST, jobtest.table))
Model 1: JPERF ~ 1
Model 2: JPERF ~ TEST
Res.Df
RSS Df Sum of Sq
F
Pr(>F)
1
19 94.291
2
18 45.568 1
48.723 19.246 0.0003555 ***
--Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1
There are ways to produce an ANOVA table whose p-values agree with summary. This is done by an
ANOVA table that uses Type-III sum of squares.
In [31]: %%R
library(car)
Anova(jobtest.lm, type=3)
13
Anova Table (Type III tests)

Response: JPERF
Sum Sq Df F value
(Intercept) 7.251 1 3.6647
TEST
7.594 1 3.8385
ETHN
3.052 1 1.5427
TEST:ETHN
8.666 1 4.3802
Residuals
31.655 16
--Signif. codes: 0 *** 0.001
Pr(>F)
0.07363 .
0.06775 .
0.23211
0.05265 .
** 0.01 * 0.05 . 0.1 1
In [32]: %%R
summary(jobtest.lm)
Call:
lm(formula = JPERF ~ TEST * ETHN, data = jobtest.table)
Residuals:
Min
1Q Median
-2.0734 -1.0594 -0.2548
3Q
1.2830
Max
2.1980
Coefficients:
(Intercept)
2.0103
1.0501
1.914
0.0736 .
TEST
1.3134
0.6704
1.959
0.0677 .
ETHN1
-1.9132
1.5403 -1.242
0.2321
TEST:ETHN1
1.9975
0.9544
2.093
0.0527 .
--Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1
F-statistic: 10.55 on 3 and 16 DF, p-value: 0.0004511
Fixed and random effects

In kidney & rehab examples, the categorical variables are well-defined categories: below average fitness,
long duration, etc.
In some designs, the categorical variable is subject.
Simplest example: repeated measures, where more than one (identical) measurement is taken on the
same individual.
In this case, the group effect i is best thought of as random because we only sample a subset of the
entire population.
14
2.0.1
When to use random effects?
A group effect is random if we can think of the levels we observe in that group to be samples from
a larger population.
Example: if collecting data from different medical centers, center might be thought of as random.
Example: if surveying students on different campuses, campus may be a random effect.
2.0.2
Example: sodium content in beer
How much sodium is there in North American beer? How much does this vary by brand?
Observations: for 6 brands of beer, we recorded the sodium content of 8 12 ounce bottles.
Questions of interest: what is the grand mean sodium content? How much variability is there from
brand to brand?
Individuals in this case are brands, repeated measures are the 8 bottles.
In [33]: %%R
url = http://stats191.stanford.edu/data/sodium.table
sodium.table = read.table(url, header=T)
sodium.table$brand = factor(sodium.table$brand)
sodium.lm = lm(sodium ~ brand, sodium.table)
anova(sodium.lm)
Response: sodium
Pr(>F)
brand
5 854.53 170.906 238.71 < 2.2e-16 ***
Residuals 42 30.07
0.716
--Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1
2.0.3
One-way random effects model
Assuming that cell-sizes are the same, i.e. equal observations for each subject (brand of beer).
Observations
Yij + i + ij , 1 i r, 1 j n
ij N (0, 2 ), 1 i r, 1 j n
i N (0, 2 ), 1 i r.
Parameters:
is the population mean;
2 is the measurement variance (i.e. how variable are the readings from the machine that reads
the sodium content?);
2 is the population variance (i.e. how variable is the sodium content of beer across brands).
15
2.0.4
Modelling the variance
In random effects model, the observations are no longer independent (even if s are independent

Cov(Yij , Yi0 j 0 ) = 2 + 2 j,j 0 i,i0 .
In more complicated models, this makes maximum likelihood estimation more complicated: least
squares is no longer the best solution.
Its no longer a plane!
This model has a very simple model for the mean, it just has a slightly more complex model for the
variance.
Shortly well see other more complex models of the variance:
Weighted Least Squares
Correlated Errors
2.0.5
Fitting the model
The MLE (Maximum Likelihood Estimator) is found by minimizing

2 log `(, 2 , 2 |Y ) =
r
X
(Yi )T (2 Ini ni + 2 11T )1 (Yi )
i=1

+ log det(2 Ini ni + 2 11T ) .
THe function `(, 2 , 2 ) is called the likelihood function.
2.0.6
Fitting the model in balanced design
Only one parameter in the mean function . - When cell sizes are the same (balanced),
b = Y =
1 X
Yij .
nr i,j
Unbalanced models: use numerical optimizer.

This also changes estimates of 2 see ANOVA table. We might guess that df = nr 1 and
b2 =
1 X
(Yij Y )2 .
nr 1 i,j
This is not correct.

In [34]: %%R
library(nlme)
sodium.lme = lme(fixed=sodium~1,random=~1|brand, data=sodium.table)
summary(sodium.lme)
Linear mixed-effects model fit by REML
Data: sodium.table
AIC
BIC
logLik
154.923 160.4735 -74.46152
Random effects:
16
Formula: ~1 | brand
(Intercept) Residual
StdDev:
4.612346 0.8461397
Fixed effects: sodium ~ 1
Value Std.Error DF t-value p-value
(Intercept) 17.62917 1.886939 42 9.342733
0
Standardized Within-Group Residuals:
Min
Q1
Med
Q3
-1.90551291 -0.68337933 0.08232268 0.79246858
Max
1.64968961
Number of Observations: 48
Number of Groups: 6
For reasons Im not sure of, the degrees of freedom dont agree with our ANOVA, though we do find the
correct SE for our estimate of :
In [35]: %%R
MSTR = anova(sodium.lm)$Mean[1]
sqrt(MSTR/48)
[1] 1.886939
The intervals formed by lme use the 42 degrees of freedom, but are otherwise the same:
In [36]: %%R
intervals(sodium.lme)
Approximate 95% confidence intervals
Fixed effects:
lower
est.
upper
(Intercept) 13.82117 17.62917 21.43716
attr(,"label")
[1] "Fixed effects:"
Random Effects:
Level: brand
lower
est.
upper
sd((Intercept)) 2.475221 4.612346 8.594683
Within-group standard error:
lower
est.
upper
0.6832445 0.8461397 1.0478715
In [37]: %%R
center = mean(sodium.table$sodium)
lwr = center - sqrt(MSTR / 48) * qt(0.975,42)
upr = center + sqrt(MSTR / 48) * qt(0.975,42)
data.frame(lwr, center, upr)
17
lwr
center
upr
1 13.82117 17.62917 21.43716
Using our degrees of freedom as 7 yields slightly wider intervals

In [38]: %%R
center = mean(sodium.table$sodium)
lwr = center - sqrt(MSTR / 48) * qt(0.975,7)
upr = center + sqrt(MSTR / 48) * qt(0.975,7)
data.frame(lwr, center, upr)
lwr
center
upr
1 13.16727 17.62917 22.09107
2.0.7
ANOVA table
Again, the information needed can be summarized in an ANOVA table.

In [40]: make_table(anova_oneway)
apply_theme(basic)
ANOVA table is still useful to setup tests: the same F statistics for fixed or random will work here.
Test for random effect: H0 : 2 = 0 based on
F =
2.0.8
M ST R
Fr1,(n1)r
M SE
under H0 .
Inference for
Easy to check that

E(Y ) =
Var(Y ) =
2 + n2
.
rn
To come up with a t statistic that we can use for test, CIs, we need to find an estimate of Var(Y ).
ANOVA table says E(M ST R) = n2 + 2 which suggests
Y
q
tr1 .
M ST R
rn
2.0.9
Degrees of freedom
Why r 1 degrees of freedom?

Imagine we could record an infinite number of observations for each individu
al, so that Y i + i .
To learn anything about we still only have r observations (1 , . . . , r ).
Sampling more within an individual cannot narrow the CI for .
18
2.0.10
Estimating 2
We have seen estimates of and 2 . Only one parameter remains.

Based on the ANOVA table, we see that
2 =
1
(E(M ST R) E(M SE)).
n
This suggests the estimate

1
2 = (M ST R M SE).
n
However, this estimate can be negative!
Many such computational difficulties arise in random (and mixed) effects models.
2.1
Mixed effects model
The one-way random effects ANOVA is a special case of a so-called mixed effects model:
Yn1 = Xnp p1 + Znq q1
N (0, ).
Various models also consider restrictions on (e.g. diagonal, unrestricted, block diagonal, etc.)
Our multiple linear regression model is a (very simple) mixed-effects model with q = n,
Z = Inn
= 2 Inn .
19

Anova 2

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Anova 2

Uploaded by

Copyright:

Available Formats

ANOVA

February 12, 2015

Often, especially in experimental settings, we record only categorical variables.

Example: recovery time

In [3]: %%R -h 800 -w 800

First generalization of two sample t-test: more than two groups.

Remember, its still a model (i.e. a plane)

Fitting the model

Model is easy to fit:

0 *** 0.001 ** 0.01 * 0.05 . 0.1 1

Residual standard error: 4.451 on 21 degrees of freedom

(Intercept) Fitness2 Fitness3

Much of the information in an ANOVA model is contained in the ANOVA table.

Analysis of Variance Table

Testing for any main effect

Rows in the ANOVA table are, in general, independent.

the degrees of freedom come from the df column in previous table.

Inference for linear combinations

Suppose we want to infer something about

where i = + i is the mean in the i-th group. For example:

After this, the usual confidence intervals and t-tests apply.

(Intercept) Fitness2 Fitness3

Often, we will have more than one variable we are changing.

Days Duration Weight ID D W

Two-way ANOVA model

Second generalization of t-test: more than one grouping variable.

Two-way ANOVA: main questions of interest

Are there interaction effects:

Interactions between factors

Weve already seen these interactions in the IT salary example.

In [16]: %%R -h 800 -w 800

Fitting the model

Easy to fit when nij = n (balanced)

Inference for combinations

Residual standard error: 0.7327 on 54 degrees of freedom

Suppose we are interested in comparing the mean in (D = 1, W = 3) and (D = 2, W = 2) groups. The

(Intercept) D2 W2 W3 D2:W2 D2:W3

Finding predicted values

Tests using the ANOVA table

We can also test for interactions using our usual approach

Some caveats about R formulae

ANOVA tables in general

Analysis of Variance Table

Similarly, the first Sum Sq in anova can be found by:

Anova Table (Type III tests)

** 0.01 * 0.05 . 0.1 1

Fixed and random effects

When to use random effects?

Example: sodium content in beer

One-way random effects model

Modelling the variance

Fitting the model

The MLE (Maximum Likelihood Estimator) is found by minimizing

Fitting the model in balanced design

Unbalanced models: use numerical optimizer.

This is not correct.

Using our degrees of freedom as 7 yields slightly wider intervals

Again, the information needed can be summarized in an ANOVA table.

Easy to check that

Why r 1 degrees of freedom?

We have seen estimates of and 2 . Only one parameter remains.

This suggests the estimate

Mixed effects model

0 * 0.001 0.01 * 0.05 . 0.1 1

We have seen estimates of and 2 . Only one parameter remains.