You are on page 1of 22

STATISTICS FOR MANAGEMENT ASSIGNMENT - 1 MBA 1 st

Semester Subject Code: MB0040

1. (a) Statistics is the backbone of decision-making. Comment.(b) Statistics is as good as the user. Comment.Answer:(a) Statistics is the backbone of decision-making Due to advanced communication network, rapid changes in consumer behaviour, variedexpectations of variety of consumers and new market openings, modern managers have adifficult task of making quick and appropriate decisions. Therefore, there is a need for themto depend more upon quantitative techniques like mathematical models, statistics,operations research and econometrics.Decision making is a key part of our day-to-day life. Even when we wish to purchase atelevision, we like to know the price, quality, durability, and maintainability of variousbrands and models before buying one. As you can see, in this scenario we are collectingdata and making an optimum decision. In other words, we are using Statistics.Again, suppose a company wishes to introduce a new product, it has to collect data onmarket potential, consumer likings, availability of raw materials, feasibility of producingthe product. Hence, data collection is the back-bone of any decision making process.Many organisations find themselves data-rich but poor in drawing information from it.Therefore, it is important to develop the ability to extract meaningful information from rawdata to make better decisions. Statistics play an important role in this aspect.Statistics is broadly divided into two main categories. The two categories of Statistics aredescriptive statistics and inferential statistics.

Descriptive Statistics : Descriptive statistics is used to present the general description of data which is summarised quantitatively. This is mostly useful in clinical research, whencommunicating the results of experiments.

Inferential Statistics : Inferential statistics is used to make valid inferences from the datawhich are helpful in effective decision making for managers or professionals.Statistical methods such as estimation, prediction and hypothesis testing belong toinferential statistics. The researchers make deductions or conclusions from the collecteddata samples regarding the characteristics of large population from which the samples aretaken. So, we can say Statistics is the backbone of decision-making. (b) Statistics is as good as the user: Statistics is used for various purposes. It is used to simplify mass data and to makecomparisons easier. It is also used to bring out trends and tendencies in the data as well asthe hidden relations between variables. All this helps to make decision making much easier.Let us look at each function of Statistics in detail. 1. Statistics simplifies mass data The use of statistical concepts helps in simplification of complex data. Using statisticalconcepts, the managers can make decisions more easily. The statistical methods help inreducing the complexity of the data and consequently in the understanding of any hugemass of data.2. Statistics makes comparison easier Without using statistical methods and concepts, collection of data and comparison cannotbe done easily. Statistics helps us to compare data collected from different sources. Grandtotals, measures of central tendency, measures of dispersion, graphs and diagrams,coefficient of correlation all provide ample scopes for comparison.3. Statistics brings out trends and tendencies in the data After data is collected, it is easy to analyse the trend and tendencies in the data by usingthe various concepts of Statistics.4. Statistics brings out the hidden relations between variables Statistical analysis helps in drawing inferences on data. Statistical analysis brings out thehidden relations between variables.5. Decision making power becomes easier

With the proper application of Statistics and statistical software packages on the collecteddata, managers can take effective decisions, which can increase the profits in a business.Seeing all these functionality we can say Statistics is as good as the user 2. Distinguish between the following with example.(a) Inclusive and Exclusive limits. (b) Continuous and discrete data.(c) Qualitative and Quantitative data(d) Class limits and class intervals.Answer :a) Inclusive and Exclusive limits. Inclusive and exclusive limits are relevant from data tabulation and class intervals point of view.Inclusive series is the one which doesn't consider the upper limit, for example,00-1010-2020-3030-4040-50In the first one (00-10), we will consider numbers from 00 to 9.99 only. And 10 will beconsidered in 10-20. So this is known as inclusive series.Exclusive series is the one which has both the limits included, for example,00-0910-1920-2930-3940-49Here, both 00 and 09 will come under the first one (00-09). And 10 will come under thenext one. b)

Continuous and discrete data. All data that are the result of counting are called quantitative discrete data. These datatake on only certain numerical values. If you count the number of phone calls you receivefor each day of the week, you might get 0, 1, 2, 3, etc.All data that are the result of measuring are quantitative continuous data assuming thatwe can measure accurately. Measuring angles in radians might result in the numbers /6,/3, /2, /, 3/4, etc. If you and your friends carry backpacks with books in them toschool, the numbers of books in the backpacks are discrete data and the weights of thebackpacks are continuous data. c) Qualitative and Quantitative data Data may come from a population or from a sample. Small letters like x or y generallyare used to represent data values. Most data can be put into the following categories: Qualitative Quantitative ualitative data

Qualitative data are the result of categorizing or describing attributes of a population. Hair color, blood type, ethnic group, the car a person drives, and the street a person lives on areexamples of qualitative data. Qualitative data are generally described by words or letters. For instance, hair color might be black, dark brown, light brown, blonde, gray, or red. Blood typemight be AB+, O-, or B+. Qualitative data are not as widely used as quantitative data becausemany numerical techniques do not apply to the qualitative data. For example, it does notmake sense to find an average hair color or blood type. Quantitative data Quantitative data are always numbers and are usually the data of choice because there aremany methods available for analyzing the data. Quantitative data are the result of counting or measuring attributes of a population. Amount of money, pulse rate, weight, number of peopleliving in your town, and the number of students who take statistics are examples of quantitative data. Quantitative data may be either discrete or continuous.All data that are the result of counting are called quantitative discrete data. These data takeon only certain numerical values. If you count the number of phone calls you receive for eachday of the week, you might get 0, 1, 2, 3, etc.Example 2: Data Sample of Quantitative Continuous DataThe data are the weights of the backpacks with the books in it. You sample the same fivestudents. The weights (in pounds) of their backpacks are 6.2, 7, 6.8, 9.1, 4.3. Notice thatbackpacks carrying three books can have different weights. Weights are quantitativecontinuous data because weights are measured.Example 3: Data Sample of Qualitative DataThe data are the colors of backpacks. Again, you sample the same five students. One studenthas a red backpack, two students have black backpacks, one student has a green backpack,and one student has a gray backpack. The colors red, black, black, green, and gray arequalitative data. Inclusive and Exclusive Class Intervals Inclusive Class Interval :When the lower and the upper class limit is included, then it is an inclusive class interval .For example - 220 - 234, 235 - 249 ..... etc. are inclusive type of class intervals. Usually inthe case of discrete variate, inclusive type of class intervals are used. Exclusive Class Interval :When the lower limit is included, but the upper limit is excluded, then it is an exclusiveclass interval

. For example - 150 - 153, 153 - 156.....etc are exclusive type of classintervals. In the class interval 150 - 153, 150 is included but 153 is excluded.

Statistics For Management Usually in the case of continuous variate, exclusive type of class intervals are used.Consider the frequency table shown below

. In a management class of 100 students three languages are offered as an additionalsubject viz. Hindi, English and Kannada. There are 28 students taking Kannada, 26taking Hindi and 16 taking English. There are 12 students taking both Kannada andEnglish, 4 taking Hindi and English and 6 that are taking Hindi and Kannada. Inaddition, we know that 2 students are taking all the three languages. i) If a student is chosen randomly, what is the probability that he/she is not taking any of these three languages?ii) If a student is chosen randomly, what is the probability that he/ she is taking exactly onelanguage?Let students taking Kannada as language be S(K) = 28Let students taking Hindi as language be S(H) = 28Let students taking English as language be S(E) = 28Let students taking Kannada and English be S (K E ) = 12Let students

taking Hindi and English be S ( H E ) = 4Let students taking Hindi and Kannada be S ( H K ) = 6Let students taking all the three subjects be S (K H E ) = 2If a student is chosen randomly, probability that he/she is not taking any of these three languagesis P ( not taking any language)= [ 1 { S(K)+ S(H)+S(E)+ S (K E )+ S ( H E )+ S ( H K )+ S (K H E} / 100 ]= [1 {28+26+26+6+4+12+2} / 100]= 1 (94 / 100)= 1 0.94 = 0.06If a student is chosen randomly, probability that he/ she is taking exactly one language isP (taking exactly one language) = [ { S(K)+ S(H)+S(E) } ] / 100= [28+26+16] = 70= 70= {70/100}= 0.7 4. List down various measures of central tendency and explain the difference betweenthem? Measures of Central Tendency Several different measures of central tendency are defined below. 1 Arithmetic Mean The arithmetic mean is the most common measure of central tendency. It simply the sum of then u m b e r s d i v i d e d b y t h e n u m b e r o f numbers. The symbol m is used for the mean of a population. The symbol M is used for the mean of a sample. T h e f o r m u l a f o r m i s s h o w n below:Where X is the sum of all the numbers in the numbers in the sample and N is the number of numbers in the sample. As an example, the mean of the numbers 1 + 2 + 3 + 6 + 8 = 20/5 = 4regardless of whether the numbers constitute the entire population or just a sample from thepopulation.The table, Number of touchdown passes (Table 1: Number of touchdown passes), shows thenumber of touchdown (TD) passes thrown by each of the 31 teams in the National FootballLeague in the 2000 season.The mean number of touchdown passes thrown is 20.4516 as shown below.Number of touchdown passesAlthough the arithmetic mean is not the only "mean" (there is also a geometric mean), it is byfar the most commonly used. Therefore, if the term "mean" is used without specifying whether it is the arithmetic mean, the geometric mean, or some other mean, it is assumed to refer to thearithmetic mean. 2 Median The median is also a frequently used measure of central tendency. The median is the midpointof a distribution: the same number of scores are above the median as below it. For the data inthe table, Number of touchdown passes (Table 1: Number of touchdown passes), there are 31scores. The 16th highest score (which equals 20) is the median because there are 15 scoresbelow the 16th score and 15 scores above the 16th score. The median can also be thought of as

the 50th percentile3. Let's return to the made up example of the quiz on which you made athree discussed previously in the module Introduction to Central Tendency4 and shown inTable 2: Three possible datasets for the 5-point make-up quiz.Three possible datasets for the 5-point make-up quizFor Dataset 1, the median is three, the same as your score. For Dataset 2, the median is 4.Therefore, your score is below the median. This means you are in the lower half of the class.Finally for Dataset 3, the median is 2. For this dataset, your score is above

the median andtherefore in the upper half of the distribution.Computation of the Median: When there is an odd number of numbers, the median is simplythe middle number. For example, the median of 2, 4, and 7 is 4. When there is an even number o f n u m b e r s , t h e m e d i a n i s t h e m e a n o f t h e t w o m i d d l e n u m b e r s . T h u s , t h e m e d i a n o f t h e numbers 2, 4, 7, 12 is 4+7/2 = 5:5. 3 mode The mode is the most frequently occuring value. For the d a t a i n t h e t a b l e , N u m b e r o f touchdown passes (Table 1: Number of touchdown passes), the mode is 18 since more teams(4) had 18 touchdown passes than any other number of touchdown passes. With continuousdata such as response time measured to many decimals, the frequency of each value is ones i n c e n o t w o scores will be exactly the same (see discussion of continuous v a r i a b l e s 5 ) . T h e r e f o r e t h e m o d e o f c o n t i n u o u s d a t a i s n o r ma l l y c o m p u t e d f r o m a g r o u p e d f r e q u e n c y distribution. The Grouped frequency distribution (Table 3: Grouped frequency distribution)t a b l e shows a grouped frequency distribution for the target response t i m e d a t a . S i n c e t h e interval with the highest frequency is 600-700, the mode is the middle of that interval (650).Grouped frequency distribution

Proportions and Percentages When the focus is on the degree to which a population possesses a particular attribute, themeasure of interest is a percentage or a proportion. A proportion

refers to the fraction of the total that possesses a certain attribute. For example, we might ask what proportion of women in our sample weigh less than 135pounds. Since 3 women weigh less than 135 pounds, the proportion would be 3/5 or 0.60. A percentage is another way of expressing a proportion. A percentage is equal to theproportion times 100. In our example of the five women, the percent of the total whoweigh less than 135 pounds would be 100 * (3/5) or 60 percent. Notation Of the various measures, the mean and the proportion are most important. The notation used todescribe these measures appears below: X: Refers to a population mean. x: Refers to a sample mean. P: The proportion of elements in the population that has a particular attribute. p: The proportion of elements in the sample that has a particular attribute. Q: The proportion of elements in the population that does not have a specified attribute.Note that Q = 1 - P. q: The proportion of elements in the

sample that does not have a specified attribute. Notethat q = 1 - p.

5. Define population and sampling unit for selecting a random sample in each of thefollowing cases. a) Hundred voters from a constituencyb) Twenty stocks of National Stock Exchangec) Fifty account holders of State Bank of Indiad) Twenty employees of Tata motors.Statistical survey or enquiries deal with studying various characteristics of unit belonging to agroup. The group consisting of all the units is called Universe or PopulationSample is a finite subset of a population. A sample is drawn from a population to estimate thecharacteristics of the population. Sampling is a tool which enables us to draw conclusions aboutthe characteristics of the population.I n s a m p l i n g t h e r e a r e t w o t y p e s n a m e l y d i s c r e t e a n d t h e o t h e r i s t h e c o n t i n u o u s . D i s c r e t e sampling is that the data given are of the finite and their calculations are made easy. Continuoussampling is one where the data are of infinite form. Its intervals are indicated by <, >, greater than but lesser than, lesser than and greater than..The finite number of items in a sample is size. in practice samples greater than 30 are largesamples and if less it is small samples.A measure associated with the entire population is called as population parameter. Or just anparameter.Given a population, suppose we consider all possible samples of a certain size N that can bed r a w n from the population. For each sample suppose we compute a statistic such as m e a n , standard deviation etc. These sample vary from sample to sample. We group these differentstatistics according to their frequencies which is called as frequency distribution to formed soc a l l e d a s s a m p l i n g d i s t r i b u t i o n . , s t a n d a r d d e v i a t i o n o f a s a m p l i n g d i s t r i b u t i o n i s c a l l e d i t s standard error.Suppose we draw all possible samples of a certain size N from a population and find the meanof X bar of each of these samples. Frequency distribution of these means is called as samplingdistribution of means. If the population is infinite , then , , be the standard deviation andmean respectively then the standard deviation denoted by is given by= / sqrt of Nis used to calculate the standard normal variate for the population where its size is more than 30.

. What is a confidence interval, and why it is useful? What is a confidence level? Under a given hypothesis H the sampling distribution of a

statistic S is a normal distribution withthe mean and the standard deviation then Z = is the standard normalvariate associated with S so that for the

distribution of z the mean is zero and the standarddeviation is 1. Accordingly for z the Z% confidence level is ( -z c , zc) this means that we can

beZ% confident that if the hypothesis H is true than the value of z lie between zc and zc. This isequivalent saying that when H is true there is (100 Z )

%chance that the value of z lies outsidethe interval (-zc . zc)if we reject a true hypothesis H on the grounds that the value of z liesoutside the

interval (-z, zc) we would be making a type 1 error and the probability of making thiserror is (100-Z)% the level of significance.Confi dence level is very

much useful as we can predict any assumptions can be made so that itwill not lead us to the wrong way even if it doesnt be so great. As explained the

confidence levelis between zc to z and the peak is at 100% which is the best.In some cases we predict but do not consider it , and sometimes we

will not predict buthypothesis need it so this is called as the TYPE 1 errors and TYPE 2 errors.According to the levels of the Z the confidence is assured.. in the

above the field shaded portionis the critical region

You might also like