Professional Documents
Culture Documents
Random Sampling
Example
Find the mean of the random sample whose
observations are 20, 27, and 25.
Solution
The second most useful statistic for measuring the center of the set of data is the median.
If X1, X2,……..,Xn represent a random sample of size n,
then the sample median is defined by the statistic
Example
Find the median for the random sample
whose observations are 8, 3, 9, 5, 6, 8, and 5.
Solution
Arranging the observations in order of magnitude, 3,5,5,6,8,8,9 gives the median as 6.
Example
Find the median for the random sample whose observations are 10, 8, 4, and 7.
Solution
Arranging the observations in order of magnitude 4, 7,8,10,
the median is the arithmetic mean of the two middle values.
Therefore the median is (7+8)/2 = 7.5.
• The third and final statistic for measuring the center of a random sample
is the mode.
• If X1, X2,……..,Xn, not necessarily different, represent a random sample of
size n, then the sample mode M is that value of sample that occurs more
often or with greatest frequency. The mode may not exist, and when it
does it is not necessarily unique.
• Example
• The mode of the random sample whose observations are
2,4,4,5,6,6,6,7,7, and 8 is 6.
• Example
• The observations 3,4,4,4,4,7,7,8,8,8,8, and 9 have two modes, 4 and 8
occur with the greatest frequency. The distribution of the sample is said
to be bimodal.
• The relative merits of the three (mean, median, mode) are
• (1) The mean is the most common used measure of
central tendency in statistics.
• (2) The only real disadvantage of the mean is that it may
be affected adversely by extreme values.
• (3) The median has the advantage being easy to compute.
It is not influenced by extreme values.
• (4) The mode is the least used measure of the three. For
small set of data its value is almost useless, if in fact it exist
at all. Its only advantage is that it requires no calculation.
Variability Measurement
Example
Find the variance of the sample whose observations are 3,4,5,6,6, and7.
Solution
∑xi2 = 171 i=1 to 6, ∑xi = 31, i=1 to 6, n=6. Hence
S2 = ((6)(171) – (31)2)/(6)(5) = 13/6
.
Hence
P(Z˃2.646) = 1- P(Z˂2.646)
= 1 – 0.9959 = 0.0041
Sampling Distribution of (n-1)S2/σ2
• If S2 is the variance of a random sample of size n taken from a normal population having a
variance σ2, then the random variable
• χ2 = (n-1)S2/σ2
• has a chi-square distribution with ν degrees of freedom.
• The probability that a random sample produces a χ2 value greater than some specified
value is equal to the area under the curve to the right of this value.
• Example
• A manufacturer of car batteries guarantees that his batteries will last, on the average, 3
years with standard deviation of 1 year. If five of these batteries have lifetimes of
1.9,2.4,3.0,3.5, and 4.2 years, is the manufacture still convinced that his batteries have
standard deviation of 1 year.
• Solution
• S2 = ((5)(48.26)-(15)2)/((5)(4)) = 0.815
• Then χ2 = (4)(0.815)/(1) = 3.26
• from tables 95% of χ2 fall between 0.484 and 11.143, then the computed value is
reasonable.
t Distribution
• One of the most important distribution in applied statistics is the F Distribution. The
statistic F is defined to be the ratio of two independent chi-square random
variables, each divided by their degrees of freedom. Hence
• F = (U/ν1)/(V/ν2)
• where U and V are independent random variables having chi-square distributions
with ν1 and ν2 degrees of freedom, respectively.
• Let us define fα to be a particular value f of the random variable F above which we
find an area equal to α.
• Writing fα (ν1,ν2) for fα with ν1 and ν2 degrees of freedom we obtain
• f1-α(ν1,ν2) = 1/ fα (ν2,ν1)
• f0.95(6,10) = 1/ f0.05 (10,6)=1/4.06=0.246
• If S12 and S22 are the variances of independent random samples of size n 1 and n2
taken from normal populations with variances σ12 and σ22 , respectively, then
• F = (S12/ σ12)/( S22/ σ22) =( σ22S12)/( σ12 S22)