Professional Documents
Culture Documents
[Read Ch. 5]
[Recommended exercises: 5.2, 5.3, 5.4]
Sample error, true error
Condence intervals for observed hypothesis
error
Estimators
Binomial distribution, Normal distribution,
Central Limit Theorem
Paired t tests
Comparing learning methods
74
n x2S
75
independently
2. Variance: Even with unbiased S , errorS (h) may
still vary from errorD (h)
76
Example
Hypothesis h misclassies 12 of the 40 examples in
12 = :30
errorS (h) = 40
What is errorD (h)?
77
Estimators
Experiment:
1. choose sample S of size n according to
distribution D
2. measure errorS (h)
errorS (h) is a random variable (i.e., result of an
experiment)
errorS (h) is an unbiased estimator for errorD (h)
Given observed errorS (h) what can we conclude
about errorD (h)?
78
Then
With approximately 95% probability, errorD(h)
lies in interval
vuu
uut errorS (h)(1 ? errorS (h))
errorS (h) 1:96
n
79
Then
With approximately N% probability, errorD(h)
lies in interval v
uuu error (h)(1 ? error (h))
S
S
errorS (h) zN ut
n
where
N %: 50% 68% 80% 90% 95% 98% 99%
zN : 0.67 1.00 1.28 1.64 1.96 2.33 2.58
80
0.14
0.12
P(r)
0.1
0.08
0.06
0.04
0.02
0
0
10
15
20
25
30
35
40
n
!
P (r) = r!(n ? r)! errorD (h)r (1 ? errorD (h))n?r
81
0.14
0.12
P(r)
0.1
0.08
0.06
0.04
0.02
0
0
10
15
20
25
30
35
40
83
p(x) = p 1 2 e? ( x? )
2
The probability that X will fall into the interval
(a; b) is given by
Zb
a p(x)dx
Expected, or mean value of X , E [X ], is
E [X ] =
Variance of X is
V ar(X ) = 2
Standard deviation of X , X , is
X =
-3
-2
-1
1
2
84
-2
-1
85
Then
With approximately 95% probability, errorS (h)
lies in interval
vuu
uut errorD (h)(1 ? errorD (h))
errorD (h) 1:96
n
which is approximately
vuu
uut errorS (h)(1 ? errorS (h))
errorS (h) 1:96
86
n i=1
87
88
d^
errorS1 (h1)(1
errorS1 (h1 ))
n1
errorS2 (h2 ))
n2
89
k i=1
vuu
uu 1 Xk
s t k(k ? 1) i=1(i ? )2
Note i approximately Normally distributed
90
91
Si fD0 ? Tig
hA LA(Si)
hB LB (Si)
i errorTi(hA) ? errorTi(hB )
3. Return the value , where
1 Xk i
k i=1
92
93