Professional Documents
Culture Documents
(MS184303)
• Summarize Data
◦ Central Tendency (distribusi frekuensi)
Mean
Median
Modus
median
• Summarize Data
◦ Central Tendency (distribusi frekuensi)
o Mean
o Median
o Modus
◦ Variation (Measure of Dispersion)
Range
Variance
Standard Deviation
Quartile
The larger the variance, the further the individual cases are from the mean,
The smaller the variance, the closer the individual scores are to the mean.
Total Sum of
Deviation
Deviation Square
Thus,
• all variances that are non-zero will be positive numbers.
• A large variance indicates that numbers in the set are far from the mean
and each other, while a small variance indicates the opposite.
Total Sum of
Deviation Variance
Deviation Square
REVIEW:
Deviation Deviation Squared Sum of Squared Variance
Standard Deviation
Definition:
A response variable measures an outcome of a study.
An explanatory variable may help explain or influence
changes in a response variable.
The most useful graph for displaying the relationship between two
quantitative variables is a scatterplot.
Definition:
A scatterplot shows the relationship between two quantitative
variables measured on the same individuals. The values of one
variable appear on the horizontal axis, and the values of the
other variable appear on the vertical axis. Each individual in
the data appears as a point on the graph.
Outlier
There is one possible outlier, the hiker with
the body weight of 187 pounds seems to be
carrying relatively less weight than are the
other group members.
Stem
Leaf
Test Scores
Stems Leaves
7 5 9
8 3 4 6 6 8
9 1 4 9
Use the data values to find the mean (40 + … + 94) ÷ 23 = 64.
Key: 4 0 means 40
To find the mode, look for the number that occurs most
often in a row of leaves. Then identify its stem. The mode is
63.
Relative
Line Plot Frequency Histogram
Dist.
Lower Upper
Lowest Quartile Median Quartile Highest
Value Value
Whisker Box Whisker
4 5 6 7 8 9 10 11 12
Box Plots
(a) Find the sample mean and sample median of the power-failure times.
(b) Find the sample standard deviation of the power failure times.
(b) From the plot in (a), does it appear as if a relationship exists between wear and load?
(c) Suppose we look at the individual wear values for each of the four specimens at each load level
(see the data that follow). Plot the wear results for all specimens against the three load values.
(d) From your plot in (c), does it appear as if a clear relationship exists? If your answer is different
from that in (b), explain why.