Professional Documents
Culture Documents
12 March 2015
Overview
1
Objectives
Lecture Recap
Applying Techniques
Objectives
Objectives
Lecture Recap
Applying Techniques
3.
Lecture Recap
Objectives
1. Central Tendency
Lecture Recap
Applying Techniques
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
4. Variations and Shape
Lecture Recap
Applying Techniques
Sample Mean
The mean () is mathematically shown by:
sum of values
1 + 2 + +
=
=
=
number of observations
Where:
represents the mean
denotes the number of observations
represents the th observation of
=1
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
Lecture Recap
Applying Techniques
Sample Median
The median () is the middle value of an ordered array.
If is odd,
=
If is even,
=
1
2
+1
2
+1
2
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
4. Variations and Shape
Lecture Recap
Applying Techniques
Sample Mode
The mode is the observation that appears the most.
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
4. Variations and Shape
Lecture Recap
Applying Techniques
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
4. Variations and Shape
Lecture Recap
Applying Techniques
25%
25%
25%
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
4. Variations and Shape
Lecture Recap
Applying Techniques
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
4. Variations and Shape
Lecture Recap
Applying Techniques
+1
4
+1
ranked value
ranked value
2
3(+1)
4
ranked value
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
4. Variations and Shape
Lecture Recap
Applying Techniques
Interquartile Range
Measures the spread of data in the middle 50% of the data:
= 3 1
Benefit of this is that it is not influenced by extreme values or
outliers.
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
Lecture Recap
Applying Techniques
Five-number Summary
Demonstrates the minimum, maxmimum and the three
quartiles:
, 1 , , 3 ,
25%
25%
25%
25%
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
4. Variations and Shape
Lecture Recap
Applying Techniques
Box-Plot
Graphical representation of the five-number summary.
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
4. Variations
Variation and Shape
Lecture Recap
Applying Techniques
Variation
Answers this question: How much does my data scatter
around the central value?
Common measures of variation:
Range
Variance
Standard Deviation
Interquartile Range
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
4. Variations and Shape
Lecture Recap
Applying Techniques
Range
Range is the difference between the largest and smallest
values in a data set.
=1
1
1
=1
Where:
represents the sample standard deviation.
2 represents the sample variance.
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
Lecture Recap
Applying Techniques
Coefficient of Variation
Sometimes when you measure the variability of two sets of
data with different means, we can standardise the mean
Interpretation:
Higher represents greater variation.
Lower represents lesser variation.
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
4. Variations and Shape
Lecture Recap
Applying Techniques
Shape
It shows the pattern of distribution. Common measures:
Sample skewness
Sample Kurtosis
They are hard concepts that if you interpret incorrectly, you will
lose marks. All you need to know is how to differentiate these
shapes:
mean = median
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
Lecture Recap
Sample Covariance
Measures strength of linear relationship between two variables:
1
, =
1
=1
4. Variations
Variation and Shape
Applying Techniques
(, )
Lecture Recap
Objectives
1. Central Tendency
2. Quartiles and percentiles and
interquartile range,
3. Five-number summary and
box-plot
4. Variations
Variation and Shape
Lecture Recap
Applying Techniques
Z-Score
Transforms an observation into a standardised normal
distribution:
=
1. Exercise 1
2. Exercise 2
3. Exercise 3
Lecture Recap
Applying Techniques
Do you know:
How to find the mean, median, variance and standard
deviation using Excel and STATCRUNCH?
What is the significance of the mean and median being close
in value?
Do you know how to identify skewedness?
Remember:
Do not put in figures in your tables that you are not going to
discuss.
Keep decimal places consistent and reasonable. E.g. if you
are dealing with integers, do not exceed 2 decimal places.
1. Exercise 1
Lecture Recap
Do you know:
How to decide which summary measure to use?
2. Exercise 2
3. Exercise 3
Applying Techniques
Remember:
Think with common sense.
Exercise 1
Exercise 2
Exercise 3
Lecture Recap
(a)
(i) T
(ii) F
(iii) F
(b)
(i) F
(ii) T
(iii) T
(iv) T
(c)
F
Applying Techniques
Applying Techniques
Objectives
Exercise 4
Exercise 5
Exercise 6
Exercise 7
Exercise 8
Exercise 9
Lecture Recap
Applying Techniques
Question 12
The file SUV contains the overall miles per gallon (MPG) of
2013 small SUVs:
a) Compute the mean, median and mode
b) Compute the variance, standard deviation, range, coefficient
of variation, and Z scores
c) Are the data skewed? If so, how?
Applying Techniques
Objectives
Exercise 4
Exercise 5
Exercise 6
Exercise 7
Exercise 8
Exercise 9
Lecture Recap
Applying Techniques
Applying Techniques
Objectives
Exercise 4
Exercise 5
Exercise 6
Exercise 7
Exercise 8
Exercise 9
Lecture Recap
Applying Techniques
Question 15
Is there a difference in the variation of the yields of different
types of investments? The file CD Rate contains the yields for
one-year certificates of deposits (CDs) and five year (CDs) for 23
banks in the United States, as of March 20, 2013.
a) For one-year and five-year CDs, separately compute the
variance, standard deviation, range, and coefficient of
variation.
b) Based on the results of (a), do one-year CDs or five-year
CDs have more variation in the yields offered? Explain.
Applying Techniques
Objectives
Exercise 4
Exercise 5
Exercise 6
Exercise 7
Exercise 8
Exercise 9
Lecture Recap
Applying Techniques
Applying Techniques
Objectives
Exercise 4
Exercise 5
Exercise 6
Exercise 7
Exercise 8
Exercise 9
Lecture Recap
Applying Techniques
Question 48
College football is big business, with coaches total revenues in
millions of dollars. The file College Football contains pay and
revenues for college football in the 124 schools that are part of
the Division 1 Football Subdivision.
a) Compute the covariance
b) Compute the coefficient of correlation
c) Based on (a) and (b), what conclusions can you reach
regarding the relationship between coaches total pay and
revenue.?