Professional Documents
Culture Documents
Group Members:
Abdul Sami
12223009
Submission Date:
16-Jan-2014
Page 1
Acknowledgment
We starts our this project with the name of Allah who is most beneficent and
who born us with the clot of blood and who gives us the ability, talent and
potential to accomplished this task, and also gave us to make a command on
this subject without Allahs permission its impossible. In this way first of all
we thanks to helping and complete this project.
Then we thank our parents who pay for us for our education and our necessities
and our needs, and our Instructor specially our more worthy and more
respectable Mr. Zulfiqar Ahmed Who gave us complete attention during
whole semester and thought us their skills about Basic Statistics& Mathematics.
Which is our subject and also keep helping in this complete Project and then he
gave us a chance to apply in our daily life, through his valuable support and
encouragement which he has offered. His words of wisdom will always be
remembered, and I am convinced that the knowledge of Basic Statistics&
Mathematics that he has given us would help us all through our professional
career.
This report cannot be solely attributed to anyones effort but it is indeed the
joint effort of all group members. Individuals to whom I would like to offer my
gratitude.
So we would like to thank all our friends and those who will help us to complete
the project work.
Objectives
Objective of this project is to apply different statistical tools on 2 variables. One
variable is dependent and other is independent variable. The variables which we
selected are as follow;
Dependent Variable
Electricity
Independent Variables
Production
The objective of this project is to find out whether the variables which we have
been selected is best suited to see the growth of performance in selected areas
and we also see that how these variables are correlated with each other.
And also we learned that how to apply these statistical tools.
Table Of Contents
List of Tables..
History
Al Meraj Fan Industry..
Breeze Fan Industry.........
Statistical Tools Applied...........
Table of Companies..
Presentation of graph (Histogram)....
Introduction of Variables....
Production ...............................................
Electricity .........................
Calculation of statistical tools.
Descriptive Analysis
Comparison..
Box Whisker
Variance & Standard Deviation
Crux of Study
Recommendations.
Conclusion..
History:
List of Tables
Descriptive analysis
Electricity bill(Y)
Production (x)
Descriptive analysis
Mean
Median
Quartile
Decile
Percentile
Variance
Standard deviation
Box Whisker
Maximum Minimum
No.ofclasses
=
=9129
Maximum Minimum
No.ofclasses
=
=85
Table (Production)
Class
Interval
Tally
Frequenc
y(f)
C.b
Fx
C.f
1007-1091
1049
1006.5-1091.5
1049
1092-1176
1134
1091.5-1176.5
3402
1177-1261
1219
1176.5-1261.5
3657
1262-1346
7 f1
1304
1261.5-1346.5
9128
14
1347-1431
9 fm
1389
1346.5-1431.5
12501
23
1432-1516
6 f2
1474
1431.5-1516.5
8844
29
1517-1601
1559
1516.5-1601.5
1559
30
=30
40140
Graphical Representation
Maximum Minimum
Class Interval=
No.ofclasses
=
=15272
Table (Electricity)
Class
Interval
108745124016
124017139288
139289154560
154561169832
169833185104
185105200376
200377215648
Frequency(f)
fx
116380.5
465522
131652.5
658262.5
146924.5
734622.5
4 f1
162196.5
648786
7 fm
177468.5
1242279.5
4 f2
192740.5
770962
208012.5
208012.5
=30
c.b
108744.5124016.5
124016.5139288.5
139288.5154560.5
154560.5169832.5
169832.5185104.5
185104.5200376.5
200376.5215648.5
C.f
4
9
14
18
25
29
30
4728447
10
No of classes=1+3.33logn
=1+3.33log (30)
=5.91 Approx 6 no of classes.
Class Interval=
Maximum Minimum
No.ofclasses
=133
11
Table (Production)
Class
Interval
12021334
13351467
14681600
16011733
17341866
18671999
20002133
Tally
Frequency(f)
fx
1268
7608
1401
8406
1 f1
1534
1534
7 fm
1667
11669
6 f2
1800
10800
1933
5799
2066.5
2066.5
=30
C.B
1201.51334.5
1334.51467.5
1467.51600.5
1600.51733.5
1733.51866.5
1866.51999.5
1999.52133.5
C.f
6
12
13
20
26
29
30
47882.5
Graphical Representation
12
Introduction of Variables
Following variables are being fully discussed in our study
Dependent variable:
Electricity Bill:
We are going to measure Company performance with the help of the
independent variable discussed below. After that we will be able to
know that either Company is performing good or not.
Independent variables:
Production:
We are going to measure that how much Production the company has
produced as the Electricity are depending on the Production so,
13
Month
Electricity Bill
Productions
Nov-07
139,210
1421
Dec-07
144,818
1454
Jan-08
155,431
1510
Feb-08
138,760
1357
Mar-08
129,171
1218
Apr-08
136,165
1365
May-08
132,815
1355
Jun-08
143,891
1389
Jul-08
121,876
1321
10
Aug-08
101,213
1007
11
Sep-08
121,239
1187
12
Oct-08
134,730
1368
13
Nov-08
129,377
1290
14
Dec-08
139,716
1367
15
Jan-09
131,810
1342
16
Feb-09
117,718
1132
17
Mar-09
121,916
1149
18
Apr-09
133,181
1345
19
May-09
144,718
1447
20
Jun-09
132,183
1343
21
Jul-09
147,516
1460
22
Aug-09
141,871
1453
23
Sep-09
149,169
1475
24
Oct-09
137,162
1413
14
Nov-09
138,718
1422
26
Dec-09
122,819
1267
27
Jan-10
123,391
1256
28
Feb-10
133,661
1331
29
Mar-10
112,188
1233
30
Apr-10
155,985
1518
Sr. No.
Month
Electricity Bill
Productions
Nov-07
150,871
1650
Dec-07
163,391
1732
Jan-08
178,209
1800
Feb-08
145,372
1400
15
Mar-08
133,078
1234
Apr-08
177,879
1789
May-08
200,378
2000
Jun-08
186,984
1842
Jul-08
121,000
1432
10
Aug-08
116,899
1234
11
Sep-08
185,013
1690
12
Oct-08
126,865
1329
13
Nov-08
178,763
1880
14
Dec-08
169,469
1790
15
Jan-09
156,012
1349
16
Feb-09
132,831
1321
17
Mar-09
108,745
1202
18
Apr-09
122,672
1301
19
May-09
141,371
1423
20
Jun-09
190,121
1967
21
Jul-09
187,641
1758
22
Aug-09
167,131
1673
23
Sep-09
179,293
1689
24
Oct-09
132,219
1460
25
Nov-09
139,325
1479
26
Dec-09
129,290
1389
27
Jan-10
198,021
1934
28
Feb-10
178,325
1756
29
Mar-10
172,021
1731
30
Apr-10
152,349
1649
16
Mean
(Mean or Average of a list of numbers is the sum of the entire list divided by the number of items in
the list)
Mean of Production x
=1338
17
Mean of Production x
=1596
Median
(Median is the numeric value separating the higher half of a sample, a population, or a probability
distribution, from the lower half)
=128599.5+
c)
(15-8)
Median of Production = lb + (
c)
=1346.5+ (15-14)
c)
=154560.5+
(15-14)
Median of Production = lb + (
=1600.5+
c)
(15-13)
Mode
(Mode is the value that occurs the most frequently in a data set or a probability distribution)
( fm
f 1) h
( fm f 1) ( fm f 2)
19
=128599.5 + (
) (
( fm
f 1) h
( fm f 1) ( fm f 2)
=1346.5+ (
)
) (
( fm
f 1) h
( fm f 1) ( fm f 2)
=154560.5 +
(
(
)
) (
( fm
f 1) h
( fm f 1) ( fm f 2)
20
=1600.5 +
(
(
)
) (
Quartile
The quartiles of a set of values are the three points that divide the data set into four equal groups,
each representing a fourth of the population being sampled.
=128599.5 +
c)
)
8
=134990.5 (We use Quartile to divide the data into four equal parts;
electricity of 30 Months charged by the Wapda is divided into four equal parts).
=1346.5 +
)
14
c)
)
21
=169832.5 +
c)
- 18)
=179651.5 (We use Quartile to divide the data into four equal parts; electricity
of 30 Months charged by the Wapda is divided into four equal parts).
=1600.5 +
c)
)
13
=1638.5 (We use Quartile to divide the data into four equal parts, production of
30 Months make by the Company is divided into four equal parts).
Decile
Decile is one of the statistical tool that divides the data into ten equal parts.
c)
22
=137728.5 +
18)
=141151.5 (We use Decile to divide the data into ten equal parts, electricity
of 30 Months charged by the Wapda is divided into ten equal parts).
Decile of Production=D2= lb + (
=1261.5+
c)
7)
=1285.5 (We use Decile to divide the data into ten equal parts; production of 30
Months make by the Al Meraj Fan Industry is divided into ten equal parts).
c)
- 4)
=130125.3 (We use Decile to divide the data into ten equal parts, electricity of 30
Months charged by the Wapda is divided into ten equal parts).
c)
23
6)
=200.5 (We use Decile to divide the data into ten equal parts; production of 30 Months
make by the Breeze Fan Industryis divided into ten equal parts).
Percentile
Percentile is one of the statistical tool that divides the data into hundred equal parts.
=119470.5 +
c)
- 3)
=126043.5 (We use Percentile to divide the data into hundred equal parts, electricity of
30 Months charged by the Wapda is divided into hundred equal parts).
=1431.5 +
c)
- 23)
=1475 We use Percentile to divide the data into hundred equal parts, production of 30
Months make by the Al Meraj Fan Industry is divided into hundred equal parts.
24
=154560.5 +
c)
- 14)
=164105.5 (We use Percentile to divide the data into hundred equal parts, electricity of
30 Months charged by the Wapda is divided into hundred equal parts).
c)
- 6)
=160.9( We use Percentile to divide the data into hundred equal parts, production of 30
Months make by the Breeze Fan Industry is divided into hundred equal parts.)
25
Descriptive Analysis
In descriptive analysis we apply the tools of average, now we will apply all these statistical
tools on data to compare the results. We also apply the central tendency on the data to find
the averages. For this purpose we have 2 variables.
Electricity (y)
Production (x)
(Electricity Bills)
Al Meraj Fan
Breeze Fan
172630
157615
Median
134990.5
158378.5
Mode
Quartile
Decile
Percentile
135120.5
134990.5
141151.5
126043.5
177468.5
179651.5
130125.3
164105.5
Mean
Interpretation of Mean
It is also called arithmetic mean which is one of the measures of central tendency which is
used most frequently. It is obtained by dividing the sum of all values by the number of
interested data values. It is denoted by x and the population mean is denoted by .
The mean value of Al Meraj Fan is 172630 it shows that on average Al Meraj has made
172630 electricity bills, whereas the mean value Breeze is 157615 which show their average
Consumption of electricity.
26
Interpretation of Median
The Median is the "middle value" of the data. To find the median, your numbers have to be
listed in numerical order, so you may have to rewrite your list first. It is denoted by x .
The median value of the data of Al Meraj Fan is 134990.5 which indicate the central value of
the given data or we also can say that 134990.5 is the middle value of our given data. The
median value of the data for Breeze Fan is 158378.5
Interpretation of Quartile
The quartiles of a set of values are the three points that divide the data set into four equal groups, each
representing a fourth of the population being sampled. The quartile of Al Meraj Fan is 134990.5. The
quartile of Breeze Fan is 179651.5. It divides the whole data into 4 equal parts.
Interpretation of Decile
The deciles of a set of values are the nine points that divide the data set into ten equal groups, each
representing a tenth of the population being sampled. The Deciles of AL Meraj is 141151.5 . The
Deciles of Breeze is 130125.3 . It divides the whole data into 10 equal parts.
Interpretation of Percentile
The percentiles of a set of values are the ninety nine points that divide the data set into hundred equal
groups, each representing a 100th of the population being sampled. The Percentile of Al Meraj is
126043.5. The Percentile of Breeze is 164105.5. It divides the whole data into 100 equal parts.
27
Production (x)
Al Meraj Fan
Mean
Median
Mode
Quartile
Decile
Percentile
Breeze Fan
1338
1596
1355.5
1380.5
1423
1285.5
1475
1638.5
1714.5
1638.5
200.5
160.9
Interpretation of Means
It is also called arithmetic mean which is one of the measures of central tendency which is
used most frequently. It is obtained by dividing the sum of all values by the number of
interested data values. It is denoted by x and the population mean is denoted by .
The mean value of Al Meraj Production is 1338 it shows that on average Al Meraj has made
production, whereas the mean value Breeze Fan is 1596 and which shows their average
production of company
Interpretation of Median
The Median is the "middle value" of the data. To find the median, your numbers have to be
listed in numerical order, so you may have to rewrite your list first. It is denoted by x .
The median value of the data of Al Meraj is 1355.5 which indicate the central value of the
given data or we also can say that 1355.5 is the middle value of our given data. The median
value of the data for Breeze is 1638.5.
28
Interpretation of Decile
The deciles of a set of values are the nine points that divide the data set into ten equal groups, each
representing a tenth of the population being sampled. The Decile of Al Meraj is 1285.5. The Decile of
Breeze is 200.5. It divides the whole data into 10 equal parts.
Interpretation of Percentile
The percentiles of a set of values are the ninety nine points that divide the data set into hundred equal
groups, each representing a 100th of the population being sampled. The Percentile of Al Meraj is 1475.
The Percentile of Breeze is 160.9. It divides the whole data into 100 equal parts.
29
Box whisker:
In descriptive statistics, a box plot or box plot is a convenient way of graphically depicting
groups of numerical data through their quartiles. Box plots may also have lines extending
vertically from the boxes (whiskers) indicating variability outside the upper and lower
quartiles, hence the terms box-and-whisker plot and box-and-whisker diagram. Outliers may
be plotted as individual points.
(Production)
1007,1132,1149,1187,1218,1233,1256,1267,1290,1321,1331,1343,1345,1355,
1357,1356,1367,1368,1389,1413,1421,1422,1447,1453,1454,1457,1460,1510,
1518
Median=
Q1
1355 1357
2
= 1267
Q 3 = 1422
Inter Quartile Range (IQR) =
Q3 Q
=1422 1267
= 155
= 1.5 * 155 = 232.5
Lower Interface=
Q 3 232.5
= 1267 232.5
30
Upper interface=
Q 3 + 232.5
= 1422 + 232.5
= 1654.5
First Quartile
Third quartile
Q1
Q3
Median
Lower interface
1000
1050
1150
1200
1250
Upper interface
1300 1350
1400
1450
1500
1550
1600
1202,1234,1234,1301,1329,1329,1349,1389,1400,1423,1432,1460,1479,1649,
1650,1673,1689,1690,1731,1732,1756,1758,1789,1790,1800,1842,1880,1934,
1967,2000
Median =
1650 1673
2
Q1
= 1389
Q3
= 1789
Q3 Q
= 1789 1389
(IQR) = 400
400 * 1.5 =600
Lower Interface=
Q1
600
32
Upper interface =
Q 3 600
First Quartile
Third quartile
Q1
Q3
Median
Lower Interface
1000
1100
1200
1300
Upper Interface
1400
1500 1600
1700
1800
1900
2000
2100
33
Sr. no
( X X )2
(X- X )
139210
-33420
1116896400
144818
-27812
773507344
155431
-17199
295805601
138760
-33870
1147176900
129171
-43459
1888684681
136165
-36465
1329696225
132815
-39815
1585234225
143891
-28739
825930121
121876
-50754
2575968516
10
101213
-71417
5100387889
11
121239
-51391
2641034881
12
134730
-37900
1436410000
13
129377
-43253
1870822009
14
139716
-32914
1083331396
15
131810
-40820
1666272400
16
117718
-54912
3015327744
17
121916
-50714
2571909796
18
133181
-39449
1556223601
19
144718
-27912
779079744
20
132183
-40447
1635959809
21
147516
-25114
630712996
22
141871
-30759
946116081
23
149169
-23461
550418521
24
137162
-35468
1257979024
25
138718
-33912
1150023744
26
122819
-49811
2481135721
27
123391
-49239
2424479121
28
133661
-38969
1518582961
29
112188
-60442
3653235364
34
155985
-16645
277056025
_
( X X )2 =
49785398840
Variance= S 2 =
S2
(X X )
n 1
49785398840
30 1
S 2 1716737891
2
Standard deviation = S
(X X )
n 1
49785398840
30 1
S 1716737891
S=41433.53
35
( X X )2
(X- X )
1421
83
6889
1454
116
13456
1510
172
29584
1357
19
361
1218
-120
14400
1365
27
729
1355
17
289
1389
51
2601
1321
-17
289
10
1007
-331
109561
11
1187
-151
22801
12
1368
30
900
13
1290
-48
2304
14
1367
29
841
15
1342
16
16
1132
-206
42436
17
1149
-189
35721
18
1345
49
19
1447
109
11881
20
1343
25
21
1460
122
14884
22
1453
115
13225
23
1475
137
18769
24
1413
75
5625
25
1422
84
7056
26
1267
-71
5041
27
1256
-82
6724
28
1331
-7
49
29
1233
-105
11025
30
1518
180
32400
_
( X X )
=409931
36
Variance= S 2 =
S2
(X X )
n 1
409931
30 1
S 2 =14135.55
2
Standard Deviation=
(X X )
n 1
409931
30 1
S 14135.55
S= 118.9
37
( X X )2
(X- X )
150,871
-6,744
45481536
163,391
5,776
33362176
178,209
20,594
424112836
145,372
-12,243
149891049
133,078
-24,537
602064369
177,879
20,264
410629696
200,378
42,763
1828674169
186,984
29,369
862538161
121,000
-36,615
1340658225
10
116,899
-40,716
1657792656
11
185,013
27,398
750650404
12
126,865
-30,750
945562500
13
178,763
21,148
447237904
14
169,469
11,854
140517316
15
156,012
-1,603
2569609
16
132,831
-24,784
614246656
17
108,745
-48,870
2388276900
18
122,672
-34,943
1221013249
19
141,371
-16,244
263867536
20
190,121
32,506
1056640036
21
187,641
30,026
901560676
22
167,131
9,516
90554256
23
179,293
21,678
469935684
24
132,219
-25,396
644956816
25
139,325
-18,290
334524100
26
129,290
-28,325
802305625
27
198,021
40,406
1632644836
28
178,325
20,710
428904100
29
172,021
14,406
207532836
30
152,349
-5,266
27730756
_
( X X )
20726436668
38
Variance= S 2 =
S2
(X X )
n 1
20726436668
30 1
S 2 =714704713
Standard Deviation=
(X X )
n 1
20726436668
30 1
S 714704713
S= 26733.96
39
( X X )2
(X- X )
1650
54
2916
1732
136
18496
1800
204
41616
1400
-196
38416
1234
-362
131044
1789
193
37249
2000
404
163216
1842
246
60516
1432
-164
26896
10
1234
-362
131044
11
1690
94
8836
12
1329
-267
71289
13
1880
284
80656
14
1790
194
37636
15
1349
-247
61009
16
1321
-275
75625
17
1202
-394
155236
18
1301
-295
87025
19
1423
-173
29929
20
1967
371
137641
21
1758
162
26244
22
1673
77
5929
23
1689
93
8649
24
1460
-136
18496
25
1479
-117
13689
26
1389
-207
42849
27
1934
338
114244
28
1756
160
25600
29
1731
135
18225
30
1649
53
2809
_
( X X )2 =1673025
40
Variance= S 2 =
(X X )
n 1
S2
1673025
30 1
S 2 =57690.5172
Standard Deviation=
(X X )
n 1
1673025
30 1
S 57690.5172
S= 26733.96
Al Meraj Fan Industry:
Production
Variance
Standard Deviations
Electrician
14135.55
1716737891
118.9
41433.53
Electrician
57690.5172
714704713
240.18
26733.96
41
Crux of study
In this statistical project, throughout all phases, we applied many statistical tools which tell us
the average performance of the players, deviation come in variables, and relation of variables.
For all these we use mean, median, quartile, deciles, percentiles, variance and standard
deviation, Box Whisker models. By doing work on this report we get know that how to use
these tools in practical life and in decision making. Every statistical tool tells something new
and giving some different interpretation. Like mean and median tells average about the data.
In our report we chose 2 companies, now we able to conclude that which Company is better
than other and able to give better reputation in future. By using descriptive analysis we
interpret that data of companies are not normal and have outliers which shows our data is
negatively skewed and we could say that data include some hazardous results which are
totally different from the others.
Now we also make better decision by taking on eye on future by forecasts future analysis by
using statistical model and also get to know about relation of variables
After throughout analysis on companys performance of 2 different companies (Al Meraj Fan
Industry and Breeze Fan industry) we get to know different results regarding our variables
discussed in study.
42
Recommendations
After all findings and analysis we are now able to give some suggestion and recommendation to the
Universities under the light of findings and calculations.
Conclusion
In the light of above findings and analysis we conclude that the Al Meraj Fan Industry has the
more goodwill than Breeze Fan Industry and the overall performance of these two Companies
is better
43