Lecture 11

Introduction to sample size
and power calculations

How much chance do we have to
reject the null hypothesis when
the alternative is in fact true?
(whats the probability of
detecting a real effect?)
Can we quantify how much
power we have for given
sample sizes?
Null
Distribution:
difference=0.
Clinically relevant
alternative:
difference=10%.
Rejection region.
Any value >= 6.5
(0+3.3*1.96)
study 1: 263 cases, 1241 controls
For 5% significance level,
one-tail area=2.5%
(Z
o/2
= 1.96)
Power= chance of being in the
rejection region if the alternative
is true=area to the right of this
line (in yellow)
Rejection region.
Any value >= 6.5
(0+3.3*1.96)
Power= chance of being in the
rejection region if the alternative
is true=area to the right of this
line (in yellow)
Power here:

% 85 = ) 06 . 1 > Z ( P
= )
3 . 3
10 5 . 6
> Z ( P
Critical value=
0+10*1.96=20
Power closer to
15% now.
2.5% area
Z
o/2
=1.96
Critical value=
0+0.52*1.96 = 1
Power is nearly
100%!
Study 2: 18 treated, 72 controls, STD DEV = 2
Clinically relevant
alternative:
difference=4 points
Critical value=
0+2.58*1.96 = 5
Power is about
40%
Study 2: 18 treated, 72 controls, STD DEV=10
Critical value=
0+0.52*1.96 = 1
Power is about
50%
Study 2: 18 treated, 72 controls, effect size=1.0
Clinically relevant
alternative:
difference=1 point
Factors Affecting Power
1. Size of the effect
2. Standard deviation of the characteristic
3. Bigger sample size
4. Significance level desired

average weight from samples of 100
Null
Clinically
relevant
alternative
1. Bigger difference from the null mean
2. Bigger standard deviation
3. Bigger Sample Size
4. Higher significance level
Rejection region.
Sample size calculations
Based on these elements, you can write
a formal mathematical equation that
relates power, sample size, effect size,
standard deviation, and significance
level

**WE WILL DERIVE THESE FORMULAS
FORMALLY SHORTLY**
Simple formula for difference
in means
Sample size in each
group (assumes equal
sized groups)
Represents the
desired power
(typically .84 for
80% power).
Represents the
desired level of
statistical
significance
(typically 1.96).
Standard
deviation of the
outcome variable
Effect Size
(the difference
in means)
2
2
/2
2
difference
) Z ( 2
o |
o +
=
Z
n
Simple formula for difference
in proportions
2
2 1
2
/2
) (p
) Z )( 1 )( ( 2
p
Z p p
n
+
=
o |
Sample size in each
group (assumes equal
sized groups)
Represents the
desired power
(typically .84 for
80% power).
Represents the
desired level of
statistical
significance
(typically 1.96).
A measure of
variability (similar
to standard
deviation)
Effect Size
(the difference
in proportions)
Derivation of sample size
formula.

Critical value= 0+.52*1.96=1
Power close to 50%
Study 2: 18 treated, 72 controls, effect size=1.0
SAMPLE SIZE AND POWER FORMULAS
Critical value=
0+standard error (difference)*Z
o/2

Power= area to right of Z
|
=

(diff) error standard
1) (here difference e alternativ - value critical =
=
|
Z
% 50 power ;
0
: here . . =
=
|
Z g e
|
|
o |
o |
o
|
Z Z
Z Z
Z
Z
Z
power
power
of right the to area the of left the to area the
Z
) error(diff standard
difference
difference
Z
difference - (diff) error standard * Z
/2
/2
/2
=
=
=
=
=
Power= area to right of Z
|
=

difference e alternativ - value critical
=
|
Z
Power is the area to the right of Z
|
. OR
power is the area to the left of - Z
|
.
Since normal charts give us the area to
the left by convention, we need to use
- Z
|
to get the correct value. Most
textbooks just call this Z
|
; Ill use
the term Z
power
to avoid confusion.

2 /
erence) error(diff standard
difference
o
Z Z
power
=
All-purpose power formula
2
2
1
2
) .( .
n n
diff e s
o o
+ =
1
2
1
2
) .( . : 1 group to 2 group of r ratio if
rn n
diff e s
o o
+ =
Derivation of a sample size
formula
Sample size is embedded in the
standard error.
2
1
2
2
/2
/2
1
2
/2
1
2
1
2
)
) 1 (
difference
( ) Z (
Z
) 1 (
difference
Z
difference
rn
r
Z
rn
r
Z
rn n
Z
power
power
power
o
o
o o
o
o
o
+
= +
+
=
+
=
Algebra
2
2
/2
2
1
2
/2
2 2
1
2
1
2
/2
2
difference
) Z ( ) 1 (
) Z ( ) 1 ( difference
difference ) Z ( ) 1 (
r
Z r
n
Z r rn
rn Z r
power
power
power
o
o
o
o
o
o
+ +
=
+ + =
= + +
2
2
/2
2
1
difference
) Z ( 2
then groups), (equal 1 r If
o
o +
= =
power
Z
n
2
2
/2
2
1
difference
) Z (
) 1 (
o
o +
+
=
power
Z
r
r
n
Sample size formula for
difference in means
2
2
/2
2
1
difference
) Z (
) 1 (
o
o +
+
=
power
Z
r
r
n

.05) for (1.96 level ce significan tailed - two to s correspond Z
power) 80% (.84 power to s correspond Z
outcome the of means in difference meaningful clinically e diffferenc
stic characteri the of deviation standard
group smaller to group larger of ratio r
group smaller of size n
: where
2 /
1
= =
= =
=
=
=
=
o
o
o
power
Examples
Example 1: You want to calculate how much power
you will have to see a difference of 3.0 IQ points
between two groups: 30 male doctors and 30 female
doctors. If you expect the standard deviation to be
about 10 on an IQ test for both groups, then the
standard error for the difference will be about:
= 2.57
30
10
30
10
2 2
+
Power formula
2 / 2 /
2
2 /
2
*
2
*
*) (
*
o o o
o
o
o
Z
n d
Z
n
d
Z
d
d
Z
power
= = =

P(Z -.79) =.21; only 21% power to see a difference of 3 IQ points.
79 . 96 . 1
2
30
10
3
2
*
or 79 . 96 . 1
57 . 2
3
*) (
*
2 / 2 /
= = = = = =
o | o |
o o
Z
n d
Z Z
d
d
Z
power
Z power
Z
Example 2: How many people would
you need to sample in each group to
achieve power of 80% (corresponds to
Z
|
=.84)
174
) 3 (
) 96 . 1 84 )(. 2 ( 100
*) (
) ( 2
2
2
2
2
2 /
2
=
+
=
+
=
d
Z Z
n
o |
o
174/group; 348 altogether
Sample Size needed for
comparing two proportions:
Example: I am going to run a case-control study
to determine if pancreatic cancer is linked to
drinking coffee. If I want 80% power to detect
a 10% difference in the proportion of coffee
drinkers among cases vs. controls (if coffee
drinking and pancreatic cancer are linked, we
would expect that a higher proportion of cases
would be coffee drinkers than controls), how
many cases and controls should I sample?
About half the population drinks coffee.

formula:
The standard error of the difference of two proportions is:
2 1
) 1 ( ) 1 (
n
p p
n
p p
+
n
n n
/ 5 .
) 5 . 1 ( 5 . ) 5 . 1 ( 5 .
=

Here, if we assume equal sample size and
that, under the null hypothesis proportions of
coffee drinkers is .5 in both cases and
controls, then
s.e.(diff)=
formula:
2 /
) statisti s.e.(test
statisti test
o
Z
c
c
Z
power
=
96 . 1
n / 5 .
10 .
=
power
Z
For 80% power
392
10 .
) 96 . 1 84 (. 5 .
5 .
10 .
) 96 . 1 84 (.
/ 5 .
10 .
96 . 1 84 .
96 . 1
/ 5 .
10 .
84 .
2
2
2
2
=
+
=
= +
= +
=
n
n
n
n
There is 80% area to the
left of a Z-score of .84 on
a standard normal curve;
therefore, there is 80%
area to the right of -.84.
Would take 392 cases and 392 controls to have 80% power!
Total=784
Question 2:
How many total cases and controls would I have
to sample to get 80% power for the same
study, if I sample 2 controls for every case?

Ask yourself, what changes here?

n n n n n n n
p p
n
p p
2
75 .
2
75 .
2
5 .
2
25 . 25 .
2
25 . ) 1 (
2
) 1 (
= = + = + =
2 /
statisti test
o
Z
c
c
Z
power
=
Different size groups
294
10 ). 2 (
) 96 . 1 84 (. 75 .
75 .
2 ) 10 (.
) 96 . 1 84 (.
2 / 75 .
10 .
96 . 1 84 .
96 . 1
2 / 75 .
10 .
84 .
2
2
2
2
=
+
=
= +
= +
=
n
n
n
n
Need: 294 cases and 2x294=588 controls. 882 total.

Note: you get the best power for the lowest sample size if you keep both groups equal (882 > 784).
You would only want to make groups unequal if there was an obvious difference in the cost or ease of
collecting data on one group. E.g., cases of pancreatic cancer are rare and take time to find.
General sample size formula
rn
p p r
rn
p p r
rn
p p
n
p p
rn
p p
diff e s
) 1 ( ) 1 ( ) 1 ( ) 1 ( ) 1 ( ) 1 (
) .( .
+
=
=
2
2 1
2
2 /
) (
) )( 1 (
1
p p
Z Z p p
r
r
n
power
+
+
=
o

General sample size needs
when outcome is binary:
2
2
2 /
2
) (
) ( 2
diff
Z Z
n
power o
o +
=
outcome the of s proportion in difference meaningful clinically p
: where
2 /
2 1
= =
= =
=
=
=
o
o
|
p
2
2 1
2
2 /
) (
) )( 1 (
1
p p
Z Z p p
r
r
n
+
+
=
o |
Compare with when outcome
is continuous:
2
2
/2
2
1
difference
) Z (
) 1 (
o |
o +
+
=
Z
r
r
n

outcome the of means in difference meaningful clinically e diffferenc
stic characteri the of deviation standard
: where
2 /
1
= =
= =
=
=
=
=
o
o
o
|

Question
How many subjects would we need to sample
to have 80% power to detect an average
increase in MCAT biology score of 1 point, if
the average change without instruction (just
due to chance) is plus or minus 3 points
(=standard deviation of change)?

Standard error here=

n n
change
3
=
o
2 /
statisti test
o
Z
c
c
Z
power
=
2
2
2 /
2
2
2
2
2 /
2 /
) (
) (
D
Z Z
n
D n
Z Z
Z
n
D
Z
power D
D
power
D
power
o
o
o
o
o
o
+
=
= +
=
Therefore, need:
(9)(1.96+.84)
2
/1 =
70 people total
Where D=change
from test 1 to test
2. (difference)
Sample size for paired data:
2
2
/2
2
difference
) Z (
o |
o +
=
Z
n
d

difference meaningful clinically e diffferenc
difference pair - within the of deviation standard
size sample n
: where
2 /
= =
= =
=
=
=
o
o
o
|
Paired data difference in
proportion: sample size:
2
2
2 /
2
) (
) ( 2
diff
Z Z
n
power o
o +
=
s proportion dependent in difference meaningful clinically p
group 1 for size sample n
: where
2 /
2 1
= =
= =
=
=
o
o
|
p
2
2 1
2
2 /
) (
) )( 1 (
p p
Z Z p p
n
+
=
o |

Lecture 11

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Lecture 11

Uploaded by

Copyright:

Available Formats

Introduction to sample size

and power calculations

You might also like