Professional Documents
Culture Documents
Nagarajan Krishnamurthy
Introduction to Business Statistics for EPGP 2015-16 batch
Indian Institute of Management Indore
Types of Data
Quantitative Data: Data for which arithmetic operations
makes sense. E.g.: Age, Salary, Length.
Visualization
Interpreting a Histogram
Notation: x
Mean: Example
Notations: x
Median: Example 1
Median: Example 2
Mode: Example
Effect of an Outlier
Effect of an Outlier
Effect of an Outlier
Outlier pulls mean towards it but may not affect median and
mode.
Kurtosis
Leptokurtic
Platykurtic
Mesokurtic
Range: max-min
Range: max-min
Variance:
Pn
x )2
i=1 (xi
n1
Standard deviation:
q Pn
x)
i=1 (xi
n1
Standard Deviation
Ans. 43.38
Quartiles
Notation: Q1
Quartiles
Notation: Q3
Exercise
Find the first and third quartile of 8,7,1,4,6,6,4,5,7,6,3,0.
Exercise
Find the first and third quartile of 8,7,1,4,6,6,4,5,7,6,3,0.
Quartiles
Exercise
Exercise
Q3-Q1=6.5-3.5=3.
Minimum
First quartile
Median
Third quartile
Maximum
Boxplot
*We will create a box plot for the Cafe data set.
Shape:
*Bar Chart
*Pie Chart
Probability
Proportion sample
Probability population
Example
Workforce distribution in the United States.
Industry
Probability
Agriculture
0.130
Construction
0.147
Finance, Insurance, Real Estate 0.059
Manufacturing
0.042
Mining
0.002
Services
0.419
Trade
0.159
Transportation, Public Utilities 0.042
Sample Space
Simple Events
Event
Exercise
A bowl contains three red and two yellow balls. Two balls are
randomly selected and their colors recorded. Use a tree
diagram to list the 20 simple events in the experiment, keeping
in mind the order in which the balls are drawn.
Example
Workforce distribution in the United States.
Industry
Probability
Agriculture
0.130
Construction
0.147
Finance, Insurance, Real Estate 0.059
Manufacturing
0.042
Mining
0.002
Services
0.419
Trade
0.159
Transportation, Public Utilities 0.042
Probability
P(Agriculture)
Probability
P(Agriculture) = 0.13
P(Either Agriculture or Construction or both)
P(Agriculture Construction)
Probability
P(Agriculture) = 0.13
P(Either Agriculture or Construction or both)
P(Agriculture Construction) = 0.13+0.147=0.277.
P(Agriculture and Construction)
P(Agriculture Construction)
Probability
P(Agriculture) = 0.13
P(Either Agriculture or Construction or both)
P(Agriculture Construction) = 0.13+0.147=0.277.
P(Agriculture and Construction)
P(Agriculture Construction) =0.
P(Not in Agriculture) P(Agriculturec )
Probability
P(Agriculture) = 0.13
P(Either Agriculture or Construction or both)
P(Agriculture Construction) = 0.13+0.147=0.277.
P(Agriculture and Construction)
P(Agriculture Construction) =0.
P(Not in Agriculture) P(Agriculturec ) = 1-0.13=0.87.
Compound Events
If A and B are two events then
Union event is A B
Intersection event is A B
Complement event is Ac
S
A
AUB
Probability Rules
1
2
Mutually Exclusive
Mutually Exclusive
P(Tomato only)
P(Mushroom-Tomato)
P(Mushroom-Tomato) =2/8=1/4;
P(Mushroom-Tomato-Fish)
P(Mushroom-Tomato) =2/8=1/4;
P(Mushroom-Tomato-Fish)=1/8.
P(Mushroom-Fish)
P(Mushroom-Tomato) =2/8=1/4;
P(Mushroom-Tomato-Fish)=1/8.
P(Mushroom-Tomato) =2/8=1/4;
P(Mushroom-Tomato-Fish)=1/8.
Union Rule
Union Rule
Ans. 6/8=3/4
Intersection Rule
What is the probability that your slice will have tomato and
mushroom?
Intersection Rule
What is the probability that your slice will have tomato and
mushroom?
Ans. 3/8
Complement Rule
What is the probability that your slice will not have tomato?
Complement Rule
What is the probability that your slice will not have tomato?
Ans. 3/8
Conditional Probability
You have pulled out a slice of pizza that has tomato on it.
What is the probability that your slice will have mushrooms?
Ans. 3/5.
Conditional Probability
Notation: A|B
Multiplication rule
P(A B) = P(A)P(B|A)
P(A B) = P(B)P(A|B)
Statistical Independence
Statistical Independence
Exercise 1
Gender
Female Male
0.25
0.18
0.33
0.24
Statistical Independence
Exercise 2
A business group owns three five-star hotels (say, A, B, and C)
in India. By studying the past behavior of the revenue
obtained from the three hotels month by month, it has been
observed that the probability of increase in revenue of either B
or C or both of them is 0.5. If As revenue increases in a given
month, the probability of increase in Bs revenue is 0.7, the
probability of increase in Cs revenue is 0.6, and the probability
of increase in both B and Cs revenue is 0.5. However if As
revenue does not increase in a given month, the probability of
increase in Bs revenue is 0.2, the probability of increase in Cs
revenue is 0.3, and the probability of increase in both B and
Cs revenue is 0.1. What is the probability that the revenue of
all the three hotels, A, B, and C, increase in a given month?
Exercise 3
You are a physician. You think it is quite likely that one of your patients has strep
throat, but you are not sure. You take some swabs from the throat and send them to
a lab for testing. The test is (like nearly all lab tests) not perfect. If the patient has
strep throat, then 70% of the time the lab says YES but 30% of the time it says NO.
If the patient does not have strep throat, then 90% of the time the lab says NO but
10% of the time it says YES. You send five succesive swabs to the lab, from the same
patient. You get back these results, in order; YNYNY. What do you conclude?
These results are worthless.
It is likely that the patient does not have the strep throat.
It is slightly more likely than not, that patient does have the strep throat.
It is very much more likely than not, that patient does have the strep throat.
Bayes Rule
Exercise
Bibliography