Math Statistic

Finish Line & Beyond
STATISTICS
Mean of Grouped Data
The mean (or average) of observations, is the sum of the values of all the
observations divided by the total number of observations.
If x1, x2,. . ., xn are observations with respective frequencies f1, f2, . . ., fn, then
this means observation x1 occurs f1 times, x2 occurs f2 times, and so on.
ow, the sum of the values of all the observations ! f1x1 + f2x2 + . . . + fnxn,
and sum of the number of observations ! f1 + f2 + . . . + fn.
"o, the mean x of the data is given by
x
!
fn . . . f2 f1
fnxn . . . f2x2 f1x1
+ + +
+ + +
x
!
=
=
n
i
n
i
fi
fixi
1
1
Example The mar#s obtained by $% students of &lass ' of a certain school in a
(athematics paper, consisting of 1%% mar#s are presented in table below. )ind the
mean of the mar#s obtained by the students.
Mar!s "#tained
Xi
1% *% $+ ,% -% -+ +% .% .* /% // 0* 0-
$um#er of Students
fi
1 1 $ , $ * , , 1 1 * $ 1
Solution To find the mean mar#s, we require the product of each xi with
the corresponding frequency fi.
Mar!s "#tained
xi
$um#er of Students
fi
fixi
1% 1 1%
*% 1 *%
$+ $ 1%/
,% , 1+%
-% $ 1-%
-+ * 11*
+% , *,%
.% , */%
.* 1 .*
/% 1 /%
// * 1.+
0* $ *.+
0- 1 0-
Total

fi
!$%

fixi
!1..0
www.excellup.com %&''( send your )ueries to en)uiry*ex+ellup,+om
x
-
fi
fixi
-
30
1779
- -0.$
Dire+t Method of Findin. Mean
In most of our real life situations, data is usually so large that to ma#e a meaningful
study it needs to be condensed as grouped data. "o, we need to convert given
ungrouped data into grouped data and devise some method to find its mean.
1et us convert the ungrouped data of 2xample 1 into grouped data by forming
class3intervals of width, say 1-. 4emember that, while allocating frequencies to each
class3interval, students falling in any upper class3limit would be considered in the
next class, e.g., , students who have obtained ,% mar#s would be considered in the
classinterval ,%3-- and not in *-3,%. 5ith this convention in our mind, let us form a
grouped frequency distribution table.
Class Inter/als 1% 6 *- *- 6 ,% ,% 6 -- -- 6 .% .% 6 /- /- 6 1%%
$um#er of Students * $ . + + +
ow, for each class3interval, we require a point which would serve as the
representative of the whole class. It is assumed that the frequency of each class3
interval is centred around its mid3point. "o the mid3point (or class mar#) of each
class can be chosen to represent the observations falling in the class.
Class mar! -
2
limit class Lower limit class Upper +

Class Inter/al $um#er of Students
0fi)
Class Mar!
0xi)
Fixi
1% 3*- * 1..- $-.%
*- 6 ,% $ $*.- 0..-
,% 6 -- . ,..- $$*.-
-- 6 .% + +*.- $.-.%
.% 6 /- + ...- ,+-.%
/- 6 1%% + 0*.- ---.%
Total fi= 30 fixi ! 1/+%.%
x
!
fi
fixi
!
30
1860
! +* (ean
This method of finding the mean is #nown as the Dire+t Method.
7ere, -0.$ is the exact mean,
while +* is an approximate mean.
Assumed Mean Method
"ometimes when the numerical values of xi and fi are large, finding the product
of xi and fi becomes tedious and time consuming. "o, for such situations, let us thin#
of a method of reducing these calculations.
othing can be done with the fi8s, but each xi can be changed to a smaller number
to ma#e easier calculations.
The first step is to choose one among the xi8s as the assumed mean, and denote
it by 9a8. :lso, to further reduce our calculation wor#, we may ta#e 9a8 to be that xi
which lies in the centre of x1, x*, . . ., xn. "o, we can choose a ! ,..- or a ! +*.-.
1et us choose a - ,..-.
The next step is to find the difference di between a and each of the xi8s, that is,
the de/iation of 9a8 from each of the xi8s.
i.e., di ! xi 6 a ! xi 6 ,..-
The third step is to find the product of di with the corresponding fi, and ta#e the sum
of all the fi di8s.
Class
Inter/al
$um#er of
Students (fi)
Class Mar!
(xi)
di = xi 6 ,..- fidi
1% 6 *- * 1..- 3$% 3+%
*- 6 ,% $ $*.- 31- 3,-
,% 3-- . ,..- % %
-- 6 .% + +*.- 1- 0%
.% 6 /- + ...- $% 1/%
/- 6 1%% + 0*.- ,- *.%
Total

fi
! $%

fidi
! ,$-
"o, the mean of ;eviations
d
!
fi
fidi
!
30
435
! 1,.-
"ince d is obtained by subtracting a from xi so x can be obtaines as follows<
x
!
d
= a
! 1,.- = ,..- ! +*
This method is called :ssumed (ean (ethod
Step De/iation Method of Findin. Mean
In the above method all the values of di are multiples of 1- which is equivalent to
class si>e. "o to ma#e calculations more simple let us divide all the values of di by
1- to arrive at smaller numbers for fidi
Class
Inter/al
$um#er of
Students (fi)
Class
Mar! (xi)
di = xi 6
,..-
ui=
h
a xi fiui
1% 6 *- * 1..- 3$% 3* 3,
*- 6 ,% $ $*.- 31- 31 3$
,% 3-- . ,..- % % %
-- 6 .% + +*.- 1- 1 +
.% 6 /- + ...- $% * 1*
/- 6 1%% + 0*.- ,- $ 1/
Total

fi
! $%

fiui
!*0
u
!
fi
fiui
!
30
29
x
!a=h
u
! ,..- = 1-
30
29
! +*
(Since u was arrived at after dividing d by the class size so a+d =a+hu)
This method is called "tep ;eviation (ethod.
Important to ote<
The step3deviation method will be convenient to apply if all the di8s have a
common factor.
The mean obtained by all the three methods is the same.
The assumed mean method and step3deviation method are ?ust simplified
forms of the direct method.
Mode of Grouped Data
: mode is that value among the observations which occurs most often, that is, the
value of the observation having the maximum frequency. It is possible that more
than one value may have the same maximum frequency. In such situations, the data
is said to be multimodal.
(ode ! =

2 0 1 2
0 1
f f f
f f
h
where ! lower limit of the modal class,
h ! si>e of the class interval (assuming all class si>es to be equal),
f1 ! frequency of the modal class,
f0 ! frequency of the class preceding the modal class,
f2 ! frequency of the class succeeding the modal class.
1et us calculate the mode from the above table
-,%
h ! 1-
f1 ! .
f0 ! $
f2 ! +
"o, (ode ! ,%=
6 3 14
3 7
1- ! -*
Median of Grouped Data
(edian is a measure of central tendency which gives the value of the middle3most
observation in the data. )irst step to find the median is to group data in ascending
order.
If the middle observation is at odd place then median !
2
1 + n

If the middle observation is at the even place then median !
2
1
2 2
+ +
n n
That is, it is the average of the middle and successor of middle observations.
1et us ta#e data from the first table<
(ar#s @btained )requency &umulative frequency
1% 1 1
*% 1 *
$+ $ -
,% , 0
-% $ 1*
-+ * 1,
+% , 1/
.% , **
.* 1 *$
/% 1 *,
// * *+
0* $ *0
0- 1 $%
7ere number of @bservations is $%
"o, (edian will be average of the 1-
th
and the 1+
th
observations.
1-
th
@bservation is +%
1+
th
@bservation is .%
:verage ! +- (edian
The mean is the most frequently used measure of central tendency because it
ta#es into account all the observations, and lies between the extremes, i.e., the
largest and the smallest observations of the entire data. It also enables us to
compare two or more distributions.
7owever, extreme values in the data affect the mean.
In problems where individual observations are not important, and we wish to find
out a 9typical8 observation, the median is more appropriate, e.g., finding the typical
productivity rate of wor#ers, average wage in a country, etc. These are situations
where extreme values may be there. "o, rather than the mean, we ta#e the median
as a better measure of central tendency.
In situations which require establishing the most frequent value or most popular
item, the mode is the best choice, e.g., to find the most popular T.A. programme
being watched.
2emar!s
1. There is a empirical relationship between the three measures of central tendency <
3 Median - Mode 4 & Mean
may not be the same on both the axes.

Math Statistic

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Math Statistic

Uploaded by

Copyright:

Available Formats

Finish Line & Beyond

You might also like