Professional Documents
Culture Documents
FS 2017
Chapter 13: Samples and Surveys
Chapter_13_14.tns
1 / 32
Car of the Year
2 / 32
Car of the Year
2 / 32
Car of the Year
Results
All the people
from the
who bought
people who
a new car
completed
in 2011
a survey
2 / 32
Car of the Year
Results
All the people
from the
who bought
people who
a new car
completed
in 2011
a survey
2 / 32
Car of the Year
Results
All the people
from the
who bought
people who
a new car
completed
in 2011
a survey
2 / 32
Populations and Samples
3 / 32
Populations and Samples
3 / 32
Populations and Samples
3 / 32
Populations and Samples
3 / 32
Populations and Samples
4 / 32
Representative and Bias
4 / 32
Representative and Bias
4 / 32
Representative and Bias
4 / 32
Representative and Bias
4 / 32
Representative and Bias
5 / 32
Gallup Poll
Question: What went wrong?
I 1936 was the middle of the Great Depression.
5 / 32
Gallup Poll
Question: What went wrong?
I 1936 was the middle of the Great Depression.
I Not everyone was able to afford a telephone and a
magazine subscription.
5 / 32
Gallup Poll
Question: What went wrong?
I 1936 was the middle of the Great Depression.
I Not everyone was able to afford a telephone and a
magazine subscription.
I Therefore the sample was biased to wealthy people.
5 / 32
Gallup Poll
Question: What went wrong?
I 1936 was the middle of the Great Depression.
I Not everyone was able to afford a telephone and a
magazine subscription.
I Therefore the sample was biased to wealthy people.
I Wealthy people tended to vote Republican.
5 / 32
Gallup Poll
Question: What went wrong?
I 1936 was the middle of the Great Depression.
I Not everyone was able to afford a telephone and a
magazine subscription.
I Therefore the sample was biased to wealthy people.
I Wealthy people tended to vote Republican.
Question: What went right?
5 / 32
Gallup Poll
Question: What went wrong?
I 1936 was the middle of the Great Depression.
I Not everyone was able to afford a telephone and a
magazine subscription.
I Therefore the sample was biased to wealthy people.
I Wealthy people tended to vote Republican.
Question: What went right?
I George Gallup correctly predicted the election even with a
reduced sample size.
5 / 32
Gallup Poll
Question: What went wrong?
I 1936 was the middle of the Great Depression.
I Not everyone was able to afford a telephone and a
magazine subscription.
I Therefore the sample was biased to wealthy people.
I Wealthy people tended to vote Republican.
Question: What went right?
I George Gallup correctly predicted the election even with a
reduced sample size.
I He also correctly predicted the results of the Digest’s poll!
5 / 32
Gallup Poll
Question: What went wrong?
I 1936 was the middle of the Great Depression.
I Not everyone was able to afford a telephone and a
magazine subscription.
I Therefore the sample was biased to wealthy people.
I Wealthy people tended to vote Republican.
Question: What went right?
I George Gallup correctly predicted the election even with a
reduced sample size.
I He also correctly predicted the results of the Digest’s poll!
I Gallup is still going strong today.
5 / 32
An Experiment in Randomness
6 / 32
An Experiment in Randomness
6 / 32
An Experiment in Randomness
6 / 32
An Experiment in Randomness
6 / 32
Humans versus Machines!
Humans versus Machines!
randSamp(seq(i,i,1,42),6,1)
randSamp(seq(i,i,1,42),6,1)
randSamp(seq(i,i,1,42),6,1)
randSamp(seq(i,i,1,42),6,1)
7 / 32
Humans versus Machines!
randSamp(seq(i,i,1,42),6,1)
randSamp(seq(i,i,1,42),6,1)
randSamp(seq(i,i,1,42),6,1)
randSamp(seq(i,i,1,42),6,1)
7 / 32
Randomization
8 / 32
Randomization
8 / 32
Randomization
8 / 32
Randomization
8 / 32
Randomization
Sample
8 / 32
Randomization
Population Sample
8 / 32
Randomization
Population Sample
8 / 32
Randomization
8 / 32
Inferential Statistics
9 / 32
Inferential Statistics
9 / 32
Sample Sizes
10 / 32
Sample Sizes
10 / 32
Sample Sizes
10 / 32
Sample Sizes
10 / 32
Simple Random Sample
11 / 32
Simple Random Sample
11 / 32
Simple Random Sample
11 / 32
Simple Random Sample
11 / 32
Identifying the Sample Frame
12 / 32
Identifying the Sample Frame
What we What we
HAVE WANT
A list of
all the
people who
WILL vote
12 / 32
Identifying the Sample Frame
What we What we
HAVE WANT
A list of
all the
people who
WILL vote
The people who do vote tend not to form an SRS of the voter’s list.
12 / 32
Hypothetical Populations
13 / 32
Hypothetical Populations
+ ⇒
10% BIGGER
13 / 32
Hypothetical Populations
+ ⇒
10% BIGGER
13 / 32
Hypothetical Populations
+ ⇒
10% BIGGER
13 / 32
Estimating Parameters
14 / 32
Estimating Parameters
µ σ2 p
14 / 32
Estimating Parameters
µ σ2 p
x s2 p
b
14 / 32
Estimating Parameters
µ σ2 p
x s2 p
b
14 / 32
Estimating Parameters
µ σ2 p
x s2 p
b
I Given p
b, what can we say about p?
14 / 32
Estimating Parameters
µ σ2 p
x s2 p
b
I Given p
b, what can we say about p?
15 / 32
Sampling Variation
15 / 32
Sampling Variation
15 / 32
Sampling Variation
sample1:=randSamp(price,5,1)
mean(sample1)
sample2:=randSamp(price,5,1)
mean(sample2)
15 / 32
Sampling Variation
sample1:=randSamp(price,5,1)
mean(sample1)
sample2:=randSamp(price,5,1)
mean(sample2)
15 / 32
Sampling Distribution
16 / 32
Sampling Distribution
16 / 32
Sampling Distribution
16 / 32
Sampling Distribution
16 / 32
We Know Nothing!
17 / 32
We Know Nothing!
BUY
Data is stored: how
much, what etc.
17 / 32
We Know Nothing!
BUY
Data is stored: how
much, what etc.
NO BUY
We know nothing!
How many did not buy?
Why did they not buy?
I Nothing suitable
I wrong size/colour
17 / 32
We Know Nothing!
BUY
Data is stored: how
much, what etc.
NO BUY
We know nothing!
How many did not buy?
Why did they not buy?
I Nothing suitable
I wrong size/colour
18 / 32
Exit Surveys
18 / 32
Exit Surveys
18 / 32
Exit Surveys
18 / 32
Exit Surveys
Proportions for
Customers who do not buy
each of the reasons
18 / 32
Exit Surveys
Proportions for
Customers who do not buy
each of the reasons
18 / 32
Exit Surveys
Proportions for
Customers who do not buy
each of the reasons
18 / 32
Alternative Sampling Methods
19 / 32
Alternative Sampling Methods
19 / 32
Alternative Sampling Methods
19 / 32
Alternative Sampling Methods
19 / 32
Alternative Sampling Methods
19 / 32
Alternative Sampling Methods
19 / 32
Alternative Sampling Methods
20 / 32
Checklist for Surveys
20 / 32
Checklist for Surveys
20 / 32
Checklist for Surveys
20 / 32
Checklist for Surveys
20 / 32
Checklist for Surveys
20 / 32
Sampling Distribution of the Mean.
21 / 32
GPS Chips
GPS Chips
GPS Chips
GPS Chips
GPS Chips
GPS Chips
GPS Chips
Random sampling
to test the process
22 / 32
HALT Testing
23 / 32
HALT Testing
23 / 32
HALT Testing
23 / 32
HALT Testing
23 / 32
HALT Testing
A HALT test has 15 stages. When a chip fails a test, the stage
of the test is noted down. If a chip survives all tests it gets a
score of 16.
23 / 32
HALT Scores
24 / 32
HALT Scores
24 / 32
HALT Scores
24 / 32
HALT Scores
BUT
24 / 32
HALT Scores
BUT
24 / 32
HALT Scores
BUT
24 / 32
HALT Data, X
25 / 32
HALT Data, X
25 / 32
HALT Data, X
25 / 32
HALT Data, X
25 / 32
HALT Data, X
26 / 32
Sample Mean Data, X
26 / 32
Sample Mean Data, X
xbar:=seq mean iffn(day=i,halt,_) ,i,1,21
26 / 32
Sample Mean Data, X
xbar:=seq mean iffn(day=i,halt,_) ,i,1,21
I It’s expected that a single chip might score low, but it is
unlikely that a whole sample scores badly.
26 / 32
The Benefits of Averaging
The Benefits of Averaging
1. mean(halt)
mean(xbar)
The Benefits of Averaging
1. mean(halt) 6.94
mean(xbar)
The Benefits of Averaging
1. mean(halt) 6.94
mean(xbar) 6.94
The Benefits of Averaging
1. mean(halt) 6.94
mean(xbar) 6.94
27 / 32
The Benefits of Averaging
1. mean(halt) 6.94
mean(xbar) 6.94
2. stdevsamp(halt)
stdevsamp(xbar)
27 / 32
The Benefits of Averaging
1. mean(halt) 6.94
mean(xbar) 6.94
2. stdevsamp(halt) 4.24
stdevsamp(xbar)
27 / 32
The Benefits of Averaging
1. mean(halt) 6.94
mean(xbar) 6.94
2. stdevsamp(halt) 4.24
stdevsamp(xbar) 1.19
27 / 32
The Benefits of Averaging
1. mean(halt) 6.94
mean(xbar) 6.94
2. stdevsamp(halt) 4.24
stdevsamp(xbar) 1.19
27 / 32
The Benefits of Averaging
1. mean(halt) 6.94
mean(xbar) 6.94
2. stdevsamp(halt) 4.24
stdevsamp(xbar) 1.19
X is
Normal
Normality
X is X is
Normal normal
Normality
X is X is
Normal normal
X is Non-
normal
Normality
X is X is
Normal normal
X is Non- X is asym-
normal ptotically
normal
28 / 32
Normality
X is X is
Normal normal
28 / 32
Normality
X is X is
Normal normal
Samples of a lar-
ge enough size
28 / 32
Sample Sizes
29 / 32
Sample Sizes
29 / 32
Sample Sizes
29 / 32
Normal Models
1 1
X = X1 + · · · + Xn
n n
30 / 32
Normal Models
1 1
X = X1 + · · · + Xn
n n
30 / 32
Normal Models
1 1
X = X1 + · · · + Xn
n n
I E(X ) =
30 / 32
Normal Models
1 1
X = X1 + · · · + Xn
n n
1
I E(X ) = n E(X1 ) + · · · + n1 E(Xn ) = =
30 / 32
Normal Models
1 1
X = X1 + · · · + Xn
n n
1 n·µ
I E(X ) = n E(X1 ) + · · · + n1 E(Xn ) = =
n
30 / 32
Normal Models
1 1
X = X1 + · · · + Xn
n n
1 n·µ
I E(X ) = n E(X1 ) + · · · + n1 E(Xn ) = = µ
n
30 / 32
Normal Models
1 1
X = X1 + · · · + Xn
n n
1 n·µ
I E(X ) = n E(X1 ) + · · · + n1 E(Xn ) = = µ
n
I Var (X ) =
30 / 32
Normal Models
1 1
X = X1 + · · · + Xn
n n
1 n·µ
I E(X ) = n E(X1 ) + · · · + n1 E(Xn ) = = µ
n
1 1
I Var (X ) = n2
Var (X1 ) + ··· + n2
Var (Xn ) = =
30 / 32
Normal Models
1 1
X = X1 + · · · + Xn
n n
1 n·µ
I E(X ) = n E(X1 ) + · · · + n1 E(Xn ) = = µ
n
1 1 n · σ2
I Var (X ) = n2
Var (X1 ) + ··· + n2
Var (Xn ) = =
n2
30 / 32
Normal Models
1 1
X = X1 + · · · + Xn
n n
1 n·µ
I E(X ) = n E(X1 ) + · · · + n1 E(Xn ) = = µ
n
1 1 n · σ2 σ2
I Var (X ) = n2
Var (X1 ) + ··· + n2
Var (Xn ) = =
n2 n
30 / 32
Normal Models
1 1
X = X1 + · · · + Xn
n n
1 n·µ
I E(X ) = n E(X1 ) + · · · + n1 E(Xn ) = = µ
n
1 1 n · σ2 σ2
I Var (X ) = n2
Var (X1 ) + ··· + n2
Var (Xn ) = =
n2 n
I SD(X ) =
30 / 32
Normal Models
1 1
X = X1 + · · · + Xn
n n
1 n·µ
I E(X ) = n E(X1 ) + · · · + n1 E(Xn ) = = µ
n
1 1 n · σ2 σ2
I Var (X ) = n2
Var (X1 ) + ··· + n2
Var (Xn ) = =
n2 n
σ
I SD(X ) = √
n
30 / 32
Standard Error of the Mean
31 / 32
Standard Error of the Mean
Hence σX = √σ .
n
31 / 32
Standard Error of the Mean
Hence σX = √σ .
n
31 / 32
Standard Error of the Mean
Hence σX = √σ .
n
31 / 32
Sampling Distribution
2
If X ∼ (µ, σ 2 ), then X ∼ N µ, σn .
Sampling Distribution
2
If X ∼ (µ, σ 2 ), then X ∼ N µ, σn .