+
=
o 
Sample size in each
group (assumes equal
sized groups)
Represents the
desired power
(typically .84 for
80% power).
Represents the
desired level of
statistical
significance
(typically 1.96).
A measure of
variability (similar
to standard
deviation)
Effect Size
(the difference
in proportions)
Derivation of sample size
formula.
Critical value= 0+.52*1.96=1
Power close to 50%
Study 2: 18 treated, 72 controls, effect size=1.0
SAMPLE SIZE AND POWER FORMULAS
Critical value=
0+standard error (difference)*Z
o/2
Power= area to right of Z

=
(diff) error standard
1) (here difference e alternativ  value critical =
=

Z
% 50 power ;
(diff) error standard
0
: here . . =
=

Z g e


o 
o 
o

Z Z
Z Z
Z
Z
Z
power
power
of right the to area the of left the to area the
Z
) error(diff standard
difference
) error(diff standard
difference
Z
) error(diff standard
difference  (diff) error standard * Z
/2
/2
/2
=
=
=
=
=
Power= area to right of Z

=
(diff) error standard
difference e alternativ  value critical
=

Z
Power is the area to the right of Z

. OR
power is the area to the left of  Z

.
Since normal charts give us the area to
the left by convention, we need to use
 Z

to get the correct value. Most
textbooks just call this Z

; Ill use
the term Z
power
to avoid confusion.
2 /
erence) error(diff standard
difference
o
Z Z
power
=
Allpurpose power formula
2
2
1
2
) .( .
n n
diff e s
o o
+ =
1
2
1
2
) .( . : 1 group to 2 group of r ratio if
rn n
diff e s
o o
+ =
Derivation of a sample size
formula
Sample size is embedded in the
standard error.
2
1
2
2
/2
/2
1
2
/2
1
2
1
2
)
) 1 (
difference
( ) Z (
Z
) 1 (
difference
Z
difference
rn
r
Z
rn
r
Z
rn n
Z
power
power
power
o
o
o o
o
o
o
+
= +
+
=
+
=
Algebra
2
2
/2
2
1
2
/2
2 2
1
2
1
2
/2
2
difference
) Z ( ) 1 (
) Z ( ) 1 ( difference
difference ) Z ( ) 1 (
r
Z r
n
Z r rn
rn Z r
power
power
power
o
o
o
o
o
o
+ +
=
+ + =
= + +
2
2
/2
2
1
difference
) Z ( 2
then groups), (equal 1 r If
o
o +
= =
power
Z
n
2
2
/2
2
1
difference
) Z (
) 1 (
o
o +
+
=
power
Z
r
r
n
Sample size formula for
difference in means
2
2
/2
2
1
difference
) Z (
) 1 (
o
o +
+
=
power
Z
r
r
n
.05) for (1.96 level ce significan tailed  two to s correspond Z
power) 80% (.84 power to s correspond Z
outcome the of means in difference meaningful clinically e diffferenc
stic characteri the of deviation standard
group smaller to group larger of ratio r
group smaller of size n
: where
2 /
1
= =
= =
=
=
=
=
o
o
o
power
Examples
Example 1: You want to calculate how much power
you will have to see a difference of 3.0 IQ points
between two groups: 30 male doctors and 30 female
doctors. If you expect the standard deviation to be
about 10 on an IQ test for both groups, then the
standard error for the difference will be about:
= 2.57
30
10
30
10
2 2
+
Power formula
2 / 2 /
2
2 /
2
*
2
*
*) (
*
o o o
o
o
o
Z
n d
Z
n
d
Z
d
d
Z
power
= = =
P(Z .79) =.21; only 21% power to see a difference of 3 IQ points.
79 . 96 . 1
2
30
10
3
2
*
or 79 . 96 . 1
57 . 2
3
*) (
*
2 / 2 /
= = = = = =
o  o 
o o
Z
n d
Z Z
d
d
Z
power
Z power
Z
Example 2: How many people would
you need to sample in each group to
achieve power of 80% (corresponds to
Z

=.84)
174
) 3 (
) 96 . 1 84 )(. 2 ( 100
*) (
) ( 2
2
2
2
2
2 /
2
=
+
=
+
=
d
Z Z
n
o 
o
174/group; 348 altogether
Sample Size needed for
comparing two proportions:
Example: I am going to run a casecontrol study
to determine if pancreatic cancer is linked to
drinking coffee. If I want 80% power to detect
a 10% difference in the proportion of coffee
drinkers among cases vs. controls (if coffee
drinking and pancreatic cancer are linked, we
would expect that a higher proportion of cases
would be coffee drinkers than controls), how
many cases and controls should I sample?
About half the population drinks coffee.
Derivation of a sample size
formula:
The standard error of the difference of two proportions is:
2 1
) 1 ( ) 1 (
n
p p
n
p p
+
n
n n
/ 5 .
) 5 . 1 ( 5 . ) 5 . 1 ( 5 .
=
Here, if we assume equal sample size and
that, under the null hypothesis proportions of
coffee drinkers is .5 in both cases and
controls, then
s.e.(diff)=
Derivation of a sample size
formula:
2 /
) statisti s.e.(test
statisti test
o
Z
c
c
Z
power
=
96 . 1
n / 5 .
10 .
=
power
Z
For 80% power
392
10 .
) 96 . 1 84 (. 5 .
5 .
10 .
) 96 . 1 84 (.
/ 5 .
10 .
96 . 1 84 .
96 . 1
/ 5 .
10 .
84 .
2
2
2
2
=
+
=
= +
= +
=
n
n
n
n
There is 80% area to the
left of a Zscore of .84 on
a standard normal curve;
therefore, there is 80%
area to the right of .84.
Would take 392 cases and 392 controls to have 80% power!
Total=784
Question 2:
How many total cases and controls would I have
to sample to get 80% power for the same
study, if I sample 2 controls for every case?
Ask yourself, what changes here?
n n n n n n n
p p
n
p p
2
75 .
2
75 .
2
5 .
2
25 . 25 .
2
25 . ) 1 (
2
) 1 (
= = + = + =
2 /
) statisti s.e.(test
statisti test
o
Z
c
c
Z
power
=
Different size groups
294
10 ). 2 (
) 96 . 1 84 (. 75 .
75 .
2 ) 10 (.
) 96 . 1 84 (.
2 / 75 .
10 .
96 . 1 84 .
96 . 1
2 / 75 .
10 .
84 .
2
2
2
2
=
+
=
= +
= +
=
n
n
n
n
Need: 294 cases and 2x294=588 controls. 882 total.
Note: you get the best power for the lowest sample size if you keep both groups equal (882 > 784).
You would only want to make groups unequal if there was an obvious difference in the cost or ease of
collecting data on one group. E.g., cases of pancreatic cancer are rare and take time to find.
General sample size formula
rn
p p r
rn
p p r
rn
p p
n
p p
rn
p p
diff e s
) 1 ( ) 1 ( ) 1 ( ) 1 ( ) 1 ( ) 1 (
) .( .
+
=
=
2
2 1
2
2 /
) (
) )( 1 (
1
p p
Z Z p p
r
r
n
power
+
+
=
o
General sample size needs
when outcome is binary:
2
2
2 /
2
) (
) ( 2
diff
Z Z
n
power o
o +
=
.05) for (1.96 level ce significan tailed  two to s correspond Z
power) 80% (.84 power to s correspond Z
outcome the of s proportion in difference meaningful clinically p
group smaller to group larger of ratio r
group smaller of size n
: where
2 /
2 1
= =
= =
=
=
=
o
o

p
2
2 1
2
2 /
) (
) )( 1 (
1
p p
Z Z p p
r
r
n
+
+
=
o 
Compare with when outcome
is continuous:
2
2
/2
2
1
difference
) Z (
) 1 (
o 
o +
+
=
Z
r
r
n
.05) for (1.96 level ce significan tailed  two to s correspond Z
power) 80% (.84 power to s correspond Z
outcome the of means in difference meaningful clinically e diffferenc
stic characteri the of deviation standard
group smaller to group larger of ratio r
group smaller of size n
: where
2 /
1
= =
= =
=
=
=
=
o
o
o

Question
How many subjects would we need to sample
to have 80% power to detect an average
increase in MCAT biology score of 1 point, if
the average change without instruction (just
due to chance) is plus or minus 3 points
(=standard deviation of change)?
Standard error here=
n n
change
3
=
o
2 /
) statisti s.e.(test
statisti test
o
Z
c
c
Z
power
=
2
2
2 /
2
2
2
2
2 /
2 /
) (
) (
D
Z Z
n
D n
Z Z
Z
n
D
Z
power D
D
power
D
power
o
o
o
o
o
o
+
=
= +
=
Therefore, need:
(9)(1.96+.84)
2
/1 =
70 people total
Where D=change
from test 1 to test
2. (difference)
Sample size for paired data:
2
2
/2
2
difference
) Z (
o 
o +
=
Z
n
d
.05) for (1.96 level ce significan tailed  two to s correspond Z
power) 80% (.84 power to s correspond Z
difference meaningful clinically e diffferenc
difference pair  within the of deviation standard
size sample n
: where
2 /
= =
= =
=
=
=
o
o
o

Paired data difference in
proportion: sample size:
2
2
2 /
2
) (
) ( 2
diff
Z Z
n
power o
o +
=
.05) for (1.96 level ce significan tailed  two to s correspond Z
power) 80% (.84 power to s correspond Z
s proportion dependent in difference meaningful clinically p
group 1 for size sample n
: where
2 /
2 1
= =
= =
=
=
o
o

p
2
2 1
2
2 /
) (
) )( 1 (
p p
Z Z p p
n
+
=
o 