Probability distributions and sampling formulas

Session 1, 2011
MATH2089
Page 6
Formula sheet
1
1.1
Probability distributions
The Binomial distribution
Assume X Bin(n, )
domain of variation :
SX = {0, 1, . . . , n}
probability mass function (pmf ) :

n x
p(x) =
(1 )nx ,
x
for x SX
cumulative distribution function (cdf ) :

x
X
n k
(1 )nk
F (x) =
k
k=0
(where x denotes the integer part of x).

expectation :
E(X) = n
variance :
1.2
Var(X) = n(1 )
The Poisson distribution
Assume X P()
SX = {0, 1, 2, . . .}
probability mass function (pmf ) :
x
,
for x SX
x!
p(x) = e
F (x) = e
x k
X
k=0
k!
(where x denotes the integer part of x).

expectation :
E(X) =
variance :
Var(X) =
Please see over . . .
Session 1, 2011
1.3
MATH2089
Page 7
The Uniform distribution
Assume X U[,]
SX = [, ]
probability density function (pdf ) :

f (x) =
1
,
for x SX

F (x) =
expectation :
E(X) =
variance :
1.4
x
,
Var(X) =
for x SX
+
2
( )2
12
The Exponential distribution
Assume X Exp()
SX = [0, +)

f (x) = ex ,
for x SX

F (x) = 1 ex ,
expectation :
variance :
E(X) =
Var(X) =
for x SX
1
1
2
Session 1, 2011
MATH2089
1.5
The Normal distribution
Page 8
Assume X N (, )
SX = (, +)
f (x) =
2
1 (x)
1
e 2 2 ,
2
for x SX

Z x
2
1 (y)
1
F (x) =
e 2 2 dy,
2
(no closed form)

expectation :
E(X) =
variance :
1.6
for x SX
Var(X) = 2
The Standard Normal distribution
Assume X N (0, 1)
SX = (, +)
1 2
1
(x) = e 2 x ,
2
for x SX

Z x
1 2
1
e 2 y dy,
for x SX
(x) =
2
(no closed form, see also the attached Standard Normal table)
expectation :
E(X) = 0
variance :
Var(X) = 1
standardisation : If X N (, ), then
Z=
X
N (0, 1)
Session 1, 2011
1.7
MATH2089
Page 9
The Students t distribution
Assume X t
SX = (, +)

+1
2
+1
x2
2
1+
f (x) =
,
for x SX

F (x) =

+1
2
+1
y2
2
1+
dy,
for x SX
(no closed form, see also the attached t-distribution critical values table)
expectation :
E(X) = 0
variance :
1.8
Var(X) =
for > 2
The 2 -distribution
Assume X 2
SX = [0, +)

f (x) =
1
2/2 (/2)
x/21 ex/2 ,

Z x
1
F (x) =
y /21 ey/2 dy,
/2 (/2)
2
for x SX
for x SX
(no closed form, see also the attached 2 -distribution critical values table)
expectation :
E(X) =
variance :
Var(X) = 2
Session 1, 2011
1.9
MATH2089
Page 10
The Fisher F-distribution
Assume X Fd1 ,d2

SX = [0, +)

f (x) =
((d1 + d2 )/2)(d1 /d2 )d1 /2 xd1 /21

,
(d1 /2)(d2 /2)((d1 /d2 )x + 1)(d1 +d2 )/2
for x SX

Z x
((d1 + d2 )/2)(d1 /d2 )d1 /2 y d1 /21
dy,
F (x) =
(d1 +d2 )/2
(d1 /2)(d2 /2)((d1 /d2 )y + 1)
for x SX
(no closed form, see also the attached F -distribution critical values table)
expectation :
d2
for d2 > 2
E(X) =
d2 2
variance :
Var(X) =
2d22 (d1 + d2 2)
d1 (d2 2)2 (d2 4)
for d2 > 4
Sampling distributions
2.1
Sample mean
2.1.1
known variance
be the sample average from a random sample of size n from a population

Let X
with mean and standard deviation . Under appropriate conditions,

X
Z= n
N (0, 1)
(exact result if the population distribution is normal, approximate result if the

population distribution is not normal but n > 30)
2.1.2
unknown variance
and S be the sample average and standard deviation from a random

Let X
sample of size n from a normal population with mean . Under appropriate
conditions,

X
T = n
tn1
S
If the population is not normal but n is large enough (n > 40), we can also
write

X
N (0, 1)
T = n
S
approximately
Session 1, 2011
2.2
MATH2089
Page 11
Sample proportion
Let p be the sample proportion of successes where the number of trials is n

and the true probability of a success is . Under appropriate conditions,
p
np
N (0, 1)
(1 )
approximately when n(1 ) > 5
2.3
Sample variance
Let S 2 be the sample variance from a random sample of size n from a normal
population with variance . Under appropriate conditions,
(n 1)S 2
2n1
2
2.4
2.4.1
Difference in sample means

variances 12 and 22 known
For two independent samples of size n1 and n2 from two populations with
i be the
means 1 and 2 and standard deviations 1 and 2 respectively, let X
sample average of sample i for i = 1 and 2. Under appropriate conditions,
1 X
) (1 2 )
(X
q2 2
N (0, 1)
22
1
n1 + n2
(exact result if both population distributions are normal, approximate result

if they are not but n1 , n2 > 30)
2.4.2
variances 12 and 22 unknown; 12 = 22
For two independent samples of size n1 and n2 from two normal populations
i
with means 1 and 2 respectively and common standard deviation , let X
and Si be the sample average and sample standard deviation of sample i for
i = 1 and 2. Under appropriate conditions,
1 X
2 ) (1 2 )
(X
q
tn1 +n2 2 ,
Sp n11 + n12
where Sp is the pooled sample standard deviation,

s
(n1 1)S12 + (n2 1)S22
.
Sp =
n1 + n2 2
Session 1, 2011
2.4.3
MATH2089
Page 12
variances 12 and 22 unknown; 12 6= 22
For two independent samples of size n1 and n2 from two normal populations
i
with means 1 and 2 and standard deviations 1 and 2 respectively, let X
and Si be the sample average and sample standard deviation of sample i for
i = 1 and 2. Under appropriate conditions,
1 X
) (1 2 )
(X
q2 2
t ,
S1
S22
+
n1
n2
where
(s21 /n1 + s22 /n2 )2

(s21 /n1 )2
n1 1
(s22 /n2 )2
n2 1
(rounded down to the nearest integer)
2.5
Ratio of sample variances
Let S12 and S22 be the sample variances from two independent random samples
of size n1 and n2 from normal populations with variances 12 and 22 respectively. Under appropriate conditions,
S12 /12
Fn1 1,n2 1
S22 /22
Simple linear regression
Consider the simple linear regression model

Y = 0 + 1 X +
where N (0, )
The least squares estimators 0 and 1 of 0 and 1 are
SXY
1 =
SXX
0 = Y 1 X
where
SXY =
X
i
i Y )
(Xi X)(Y
SXX =
X
i
2.
(Xi X)
An estimator of is
S=
i (Yi
0 1 Xi )2
.
n2
Under the simple linear regression model :

1 1
tn2
S/ SXX
Session 1, 2011
MATH2089
Page 13
0
q0
tn2
2
S n1 + SXXX
Let x0 denote the predictor value for a response yet to be observed :

i) a 100(1 )% confidence interval for the mean response at x0 is
s
)2
1 (x0 x
+
y(x0 ) s tn2;1/2
n
sxx
where y(x0 ) = b0 + b1 x0 ;
ii) and a 100(1 )% prediction interval for the response at x0 is
s
1 (x0 x
)2
y(x0 ) s tn2;1/2 1 + +
n
sxx
ANOVA
Total sum of squares :

ni
k X
X
SSTot =
i=1 j=1
2
(Xij X)
Treatment sum of squares :

SSTr =
k
X
i=1
2
i X)
ni (X
Error sum of squares :

SSEr =
ni
k X
X
i=1 j=1
i )2
(Xij X
Under the assumption of equality of means in each of the k groups of a

one-way Analysis of Variance,
F =
MSTr
Fk1,nk ,
MSEr
where n is the total number of observations.

Probability distributions and sampling formulas

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Probability distributions and sampling formulas

Uploaded by

Copyright:

Available Formats

Session 1, 2011

cumulative distribution function (cdf ) :

(where x denotes the integer part of x).

The Poisson distribution

(where x denotes the integer part of x).

Please see over . . .

The Uniform distribution

probability density function (pdf ) :

cumulative distribution function (cdf ) :

The Exponential distribution

probability density function (pdf ) :

cumulative distribution function (cdf ) :

Please see over . . .

The Normal distribution

cumulative distribution function (cdf ) :

(no closed form)

The Standard Normal distribution

cumulative distribution function (cdf ) :

Please see over . . .

The Students t distribution

cumulative distribution function (cdf ) :

probability density function (pdf ) :

cumulative distribution function (cdf ) :

Please see over . . .

The Fisher F-distribution

Assume X Fd1 ,d2

probability density function (pdf ) :

((d1 + d2 )/2)(d1 /d2 )d1 /2 xd1 /21

cumulative distribution function (cdf ) :

be the sample average from a random sample of size n from a population

(exact result if the population distribution is normal, approximate result if the

and S be the sample average and standard deviation from a random

Please see over . . .

Let p be the sample proportion of successes where the number of trials is n

approximately when n(1 ) > 5

Difference in sample means

(exact result if both population distributions are normal, approximate result

variances 12 and 22 unknown; 12 = 22

where Sp is the pooled sample standard deviation,

Please see over . . .

variances 12 and 22 unknown; 12 6= 22

(s21 /n1 + s22 /n2 )2

(rounded down to the nearest integer)

Ratio of sample variances

Simple linear regression

Consider the simple linear regression model

Under the simple linear regression model :

Please see over . . .

Let x0 denote the predictor value for a response yet to be observed :

Total sum of squares :

Treatment sum of squares :

Error sum of squares :

Under the assumption of equality of means in each of the k groups of a

where n is the total number of observations.

You might also like