Professional Documents
Culture Documents
Name: ________________
page 2
A manufacturer uses two machines to drill holes in pieces of sheet metal used in engine
construction. The production manager is interested in knowing whether one machine produces
more defective drillings than the other. Here is the data from one day of production, which we
will treat as a random sample from all the parts produced by the machines.
Number
defective
Machine 1
Machine 2
31
38
Number not
defective
191
143
Whole sample
69
334
Total
403
222
181
1. Use the data to fill in the following contingency table which takes which machine as the
explanatory variable. The sum of each row should be 100%, except for possible rounding.
Machine 1
Defective
13.97
Not defective
86.03
Totals
100%
Machine 2
21
79
100%
17.12
82.88
100%
Whole sample
2. Based on these percentage calculations, we can see that in the sample, there is a difference
between the two machines. To determine whether this is a significant difference (that is, one
that suggests a difference in the entire population), you will carry out a 2-test using the
following steps.
a. Write the null hypothesis using the word correlation or the word association.
b. Write the null hypothesis using words such as dependent, independent, depends on.
Expected counts
Machine 1
Machine 2
Whole sample
Defective
38.0099
Not defective
183.99007
Totals
222
30.99007
150.0099256
181
334
403
69
page 3
d. Fill in the table showing the 2 calculation for each cell.
2 calculation
Machine 1
Machine 2
Defective
Not defective
1.29
.27
1.59
.33
e. Find the total 2 value, and use the StatCrunch chi-square calculator to calculate the
P-value.
f.
g. Fill in the blanks. At = 0.05, we _____________ (did / did not) find evidence that
there (is / is not) an association between which machine it was and whether the part
was defective.
h. Fill in the blanks. At = 0.05, we _____________ (did / did not) find evidence that
whether a part is defective ________ (is / is not) dependent on which machine made
the part.
1. You are doing a chi-square test with a 5 by 9 table of counts (that is, 5 rows and 9
columns).
a. What is the degrees of freedom?
4 or 8
b. Suppose you get 2 = 45.23. Calculate the p-value.
P=.00001
2. Use StatCrunch to calculate the test statistic (that is, the 2 value) and the P-value for this
contingency table, which shows the income level for a random sample of adults cross
classified by gender. (In order to answer question #3, have StatCrunch give you the
expected counts when it runs the hypothesis test.)
Low
Middle
High
Total
Male
Female
23
26
30
28
29
20
82
74
Total
49
56
49
156
X2 value = 1.4994
page 4
P value= .4725
3. By examining the StatCrunch output, give the table of expected counts for the data in
problem #2.
Low
Middle
High
Total
Male
25.76
30.15
25.76
82
Female
23.24
26.56
23.24
74
56
49
156
Total
49
4. ( YES / NO ) Based on the data in problem #2, there is a significant association between
gender and income level (use significance level 0.05).
5. ( YES / NO ) Based on the data in problem #2, there is a significant difference between
the income levels for the two groups (males, females) (use significance level 0.01).
In a large survey of U.S. college students, 1630 of the 7180 men self-identified as binge drinkers,
and 1684 of the 9916 women self-identified as binge drinkers.
1. Calculate the proportion for the men ( ) and the proportion for the women ( ).
Pmen=.227
pwomen= .169826543
2. Calculate a point estimate for the ratio of the proportions for the two populations, using
/ . Fill in the blank: In the sample, men were _________1.3368___ times as
likely to be binge drinkers as women (that is, ______5.72______ % more likely).
3. Calculate a point estimate for the difference between the proportions, using
.
.0572
4. Use StatCrunch to calculate a 95% confidence interval for the difference between the two
proportions. Tell what you put into StatCrunch, and write the confidence interval in a
form similar to (55.17%, 83.21%).
page 5
5. Based on the interval, is it feasible that the proportions are actually the same for the two
populations? (YES / NO)