Professional Documents
Culture Documents
2) Basic Symbols
What do all these symbols mean!
Symbols are used in many formulas that are needed to calculate different statistics.
These are some of the basics:
= Sum
x = each variable score
SS = Sum of squared deviations
sqrt = Square root
df = degrees of freedom
Symbols used to describe a Population
-- = mean
-- = standard Deviation
--= variance
--N = Total number of population scores
Symbols used to describe a Sample
-- M = mean
-- s = Standard Deviation
-- s = variance
-- n = Total number of sample scores
Normal Distribution
This type of distribution is seen when the variables are clustered together with gradual decrease on
either side of the distribution.
This type of distribution is used often in calculations assuming a normal population distribution. I will
discuss this type of distribution later in further detail.
It is also called a Gaussian Curve or Bell Curve.
Negative Skew
A negative skew is when the variables in a distribution are clustered together with a few outliers
which change the distribution. (The tail of the graph points to the negative end)
outliers: These are variables that fall outside the normal trends for the distribution.
(ex. Lets say the variable for the graph above is shoe size and most of the data falls within sizes 7 to
10 but if a few individuals had a shoe size of 4. That would skew the distribution negatively.)
tip: greater than 50% of the scores are above the mean
Postive Skew
A positive skew is when the variables in a distribution are clustered together with a few outliers that
change the distribution positively.(The tail of the graph points to the positive end)
(ex. In this case given the same information as the previous example the outlier would have a shoe
size of 13 instead of 4. Making the distribution positively skewed.)
tip: greater than 50% of the scores are below the mean
Central Tendency: A measurement that uses only one score to describe a distribution of scores.
These are a few ways to measure central tendency:
-Mean ( or M): The average (sum of scores/# of scores)
-------Ex. (5,4,3) 5+4+3= 12, 12/3=4, Mean = 4
-Median:The score which divides all scores in half when put into ascending order.
7) Understanding Variance
Variability: A measurement to show the degree to which the the scores or data are spread out or
clustered in a distribution.
---A good way to describe a distribution in terms of distance (ex. lets say that most adults are within
a foot (12") of 5'5" tall. Variability would show that distance for a normal height and would represent
the heights most likely for someone to fall into if they are a part of that population. There would be
people who are much taller such as basketball players and those that are much shorter but it is
much rarer and would be seen in this distribution as outlying values. )
In certain cases the values may be much closer to the mean or farther from the mean.
---If you look at weight verses height you may find that the range of values for weight will be much
larger than the range of the values for height. An adult could weight between *90lbs or 500lbs* while
the height of a person is much more limited *3'5" to 7'6"*
* these numbers are only used to represent a point not to show accurate representation of the actual
range of weights or height. *
8) Z test
So what is a Z test?
A "Z test" is a way to standardize each score in a distribution and then determine a relation between
all the scores. (A way to know how a certain score compares to the other scores)
You can use a Z test when:
-- estimating a population parameter
-- and there is only one sample group
-- and there is only one score per subject in the group
-- and is given or can be calculated
Once a Z test is calculated then:
all scores are between -4.00 and +4.00
=0
=1
A Z score:
1) replaces the original scores, mean and variance.
2) changes data to have a normal distribution
2) is + or - (+ is above the mean and - is below the mean)
3) is a number that represents distance from the mean (Z score mean = 0)
Calculating Z scores:
(X-)/
9) T Test
What is a T Test?
A "T Test" is very much like a "Z test" in that it standardizes scores in a distribution; However, in a "T
Test" instead of using the population variance() we use the sample variance (s)
You can use a T test when:
-- estimating a population parameter