Professional Documents
Culture Documents
LEARNING UNIT 5:
When doing research, we often work with large sets of data. A data set is a collection of all
the scores that were collected during a specific study and which are normally presented in
table format. These data sets, at first glance, would not make any sense to us at all. In
learning unit 4 you learnt that we can summarise or simplify a set of data by displaying it in
visual form. The set of data presented below is based on information collected from the New
Stars activities during the first week. You can have a look at how the contestants have fared
in the first week. When you read their profile at the beginning you may already have taken a
liking or a dislike to one or two of them. The competition is on!
Scores for the preparation activities were allocated using a 10-point scale, with 1 being the
lowest and 10 being the highest. The scores that were obtained for the different activities in
the week are indicated in separate columns in the table below. The last column reflects the
total score obtained by adding the scores for all five activities.
Week 1: Preparation activities scores
Contestants
Acting classes
for stage
performance
Vocal
training
Image
makeover
Interview
skills
course
Fans &
media
training
Total scores
for Week 1
1. Vuyo K
38
2. Lebo
24
3. Sasha
32
4. Susan
31
5. David
26
6. Tshepo
28
7. Indresan
31
8. Shelley
37
9. Cameron
34
35
10. Kerry
79
IOP2601/MO001/4/2016
In this learning unit measures of central tendency will be discussed as alternative methods
for describing a data set.
We look at three measures of central tendency: the mean, the mode and the median. These
three measures all indicate the centre of a data set. The diagram below represents the three
measures graphically. As you proceed through the learning unit, you will be expected to fill in
the relevant blocks.
CENTRAL TENDENCY
5.1
Study the introduction to this tutorial on pages 40 to 41 in Tredoux and Durrheim (2013),
where a brief introduction to each measure of central tendency is provided.
80
IOP2601/MO001/4/2016
5.2
Study pages 41 to 43 in Tredoux and Durrheim (2013). Note how they define the term and
how the formula is used.
On page 43 the authors indicate that one of the limitations of using the mean is that it does
not account for the influence of any outlier scores. You can read their discussion of the
trimmed mean on pages 43 to 44. The trimmed mean compensates for the effect of any
outlier scores in the mean value of a data set.
Although you should take note of the trimmed mean, you are not required to know any detail
for examination purposes.
5.3
1.
2.
See if you can compute the means of the following data sets correctly:
(Please note: if you are familiar with the calculation of the mean you dont need to do
all six exercises, just do two and move on to the next question.)
2.1
2.2
2.3
2.4
2.5
2.6
3.
The scores for the acting class activity for stage performance for all 10
contestants
The scores for the vocal training activity for all 10 contestants
Susan's scores across all five activities
The scores for the interview skills course activity for all 10 contestants
The scores for the fans and media training activity for all 10 contestants
Shelley's scores across all five activities
What is the average score for the preparation activities for Week 1 for all 10
contestants?
81
IOP2601/MO001/4/2016
1.
The mean is the sum of all scores divided by the number of scores.
2.1
2.2
2.3
2.4
2.5
2.6
3.
31,6
From what you have learnt above, you should now understand the mean and be able to
x
x
define it
compute it
The other two measures of central tendency may be new to you, but with a little common
sense and reflection on how the two words are used in everyday language you should not
find the concepts too difficult.
5.4
Work through pages 45 to 46 of Tredoux and Durrheim (2013) and make sure that you know
the definition of the term and how to compute the median.
82
IOP2601/MO001/4/2016
5.5
1.
2.
........
Compute the median for the following data sets:
(Please note: if you are familiar with the calculation of the median you dont need to do
all six exercises, just do two and move on to the next question.)
2.1
2.2
2.3
2.4
2.5
2.6
The scores for the acting class activity for stage performance for all 10
contestants
The scores for the vocal training activity for all 10 contestants
Susans scores across all five activities
The scores for the interview skills course activity for all 10 contestants
The scores for the fans and media training activity for all 10 contestants
Shelleys scores across all five activities
3.
What is the median of the total scores of the preparation activities for Week 1 for all 10
contestants?
1.
The median is the score which falls at the exact centre of a distribution if the scores are
arranged in numerical order. This score need not necessarily be present in the original
data set.
2.1
Step 1:
83
IOP2601/MO001/4/2016
Step 2:
5; 6; 6; 7; 7; 8; 8; 8; 8; 9
Step 3:
7 + 8 / 2 = 7, 5 (If the median location falls between scores, add these scores
and divide the total by 2. This will give the median.)
2.2
2.3
2.4
2.5
2.6
6,5
6
6
5,5
8
3.
31,5
You may wonder why you should learn how to calculate another measure of central tendency
if you already know how to calculate the mean. The median differs somewhat from the mean
and can at times be used as an alternative to the mean.
Below is information about the difference between the median and the mean:
x
The main difference between the mean and the median is that the mean reflects the
value of each score in a distribution, whereas the median is the central point in a
distribution and does not take account of the specific values of many of the scores.
Take a good look at the following table:
Score
Mean
Median
12345
1 2 3 4 50
12
1 2 3 4 100
22
Only the last number in each set of scores differs. The mean reflects these differences,
the median does not. The specific values of the extreme scores influence the mean but
not the median.
In summary, if your data set contained a few extreme outlier data points, you could calculate
the median instead of the mean, so that your measure of central tendency would not be
affected by the outlier scores.
You have now worked through the second type of central tendency. You should be able to
x
x
84
IOP2601/MO001/4/2016
5.6
Study pages 46 to 47 in Tredoux and Durrheim (2013). Make sure you know what the term
mode means and how to compute it.
1.
2.
See if you can compute the mode for the following data sets correctly: (Please note: if
you are familiar with the calculation of the mode you dont need to do all six exercises,
just do two and move on to the next question.)
2.1
2.2
2.3
2.4
2.5
2.6
The scores for the acting class activity for stage performance for all 10
contestants
The scores for the vocal training activity for all 10 contestants
Susan's scores across all five activities
The scores for the interview skills course activity for all 10 contestants
The scores for the fans and media training activity for all 10 contestants
Shelley's scores across all five activities
3.
What is the mode for the total scores of the preparation activities for Week 1 for all 10
contestants?
1.
The mode is the score which occurs most often, that is, which has the highest
frequency.
2.1
2.2
2.3
2.4
2.5
2.6
3.
31
85
IOP2601/MO001/4/2016
5.7
Complete the diagram at the beginning of this learning unit by filling in the three measures of
central tendency in the appropriate squares.
Now that you have completed the diagram, have you considered what central tendency
actually means? Why are industrial psychologists (and other people) interested in measures
of central tendency? Of what use is it, and what are its advantages and disadvantages?
Consider the concept of central tendency. Now try to define central tendency on the basis of
the knowledge that you have acquired in this learning unit so far.
The following figures show the relative positions of the mode, the median and the mean for
different distributions:
86
IOP2601/MO001/4/2016
87