Kalman Filter and Parameter Identification: Florian Herzog 2013

Kalman Filter and Parameter
Identication
Florian Herzog
2013
Continuous-time Kalman Filter
In this chapter, we shall use stochastic processes with independent increments w
1
(.) and
w
2
(.) at the input and the output, respectively, of a dynamic system. We shall switch
back and forth between the mathematically precise description of these (normalized)
Brownian motions by their increments and the sloppy description by the corresponding
white noises v(.) and r(.), respectively, which is preferred in engineering circles
Q
1/2
(t)dw
1
(t) = [ v(t) u(t)]dt (1)
R
1/2
(t)dw
2
(t) = [ r(t) r(t)]dt , (2)
where Q
1/2
(t) and R
1/2
(t) are the volatility parameters.
We assume a linear dynamical system, where the inputs u(t) and the measurements
y(t) are disturbed by noise.
Stochastic Systems, 2013 2
Consider the following linear time-varying dynamic system of order n which is driven
by the m-vector-valued white noise v(.). Its initial state x(t
0
) is a random vector
and its p-vector-valued output y(.) is corrupted by the additive white noise r(.):
System description in the mathematically precise form:
dx(t) = A(t)x(t)dt + B(t)dv(t)
= [A(t)x(t) + B(t)u(t)]dt + B(t)Q
1/2
(t)dw
1
(t) (3)
x(t
0
) = (4)
y(t)dt = C(t)x(t)dt + dr(t)
= [C(t)x(t) + r(t)]dt + R
1/2
(t)dw
2
(t) , (5)
(6)
System description in the engineers form:
x(t) = A(t)x(t) + B(t) v(t) (7)
x(t
0
) = (8)
y(t) = C(t)x(t) + r(t) , (9)
where
: N(x
0
,
0
) (10)
v(t) : N(u(t), Q(t)) (11)
r(t) : N(r(t), R(t)) . (12)
Note that in the last two lines, we have deleted the factor
1
dt
which ought to multiply
Q(t) and R(t).
The ltering problem is stated as follows: Find the optimal lter with the state vector
x which is optimal in the following sense:
The state estimation is bias-free:
E{x(t) x(t)} 0.
The error of the state estimate has inmal covariance matrix:
opt
(t)
subopt
(t).
With the above-mentioned assumptions that , v, and r are mutually independent and
Gaussian, the optimal lter is the following linear dynamic system:
x(t) = A(t) x(t) + B(t)u(t) + H(t)[y(t)r(t)C(t) x(t)]

= [A(t)H(t)C(t)] x(t) + B(t)u(t) + H(t)[y(t)r(t)] (13)
x(t
0
) = x
0
(14)
with
H(t) = (t)C
T
(t)R
1
(t) , (15)
where (t) is the covariance matrix of the error x(t)x(t) of the state estimate
x(t).
(t) is the covariance matrix of the error x(t) x(t) of the state estimate x(t)
satisfying the following matrix Riccati dierential equation:
(t) = A(t)(t) + (t)A

T
(t)
(t)C
T
(t)R
1
(t)C(t)(t) + B(t)Q(t)B
T
(t) (16)
(t
0
) =
0
. (17)
The estimation error e(t) = x(t) x(t) satises the dierential equation
e = x

x
= Ax + Bu + B( vu) [AHC] x Bu HCx H( rr)
= [AHC]e + B( vu) H( rr) . (18)
The covariance matrix (t) of the state estimation errer e(t) is governed by the matrix
dierential equation
= [AHC] + [AHC]
T
+ BQB
T
+ HRH
T
(19)
= A + A
T
+ BQB
T
C
T
R
1
C
+ [HC
T
R
1
] R [HC
T
R
1
]
T
A + A
T
+ BQB
T
C
T
R
1
C . (20)
Obviously,

(t) is inmized at all times t for
H(t) = (t)C
T
(t)R
1
(t) . (21)
Given (19) for the covariance matrix (t):
= [AHC] + [AHC]
T
+ BQB
T
+ HRH
T
.
Its rst dierential with respect to H, at some value of H, with increment dH can be
formulated as follows:
d
(H, dH) =

(H)
H
dH
= dHC C
T
dH
T
+ dHRH
T
+ HRdH
T
= U[HRC
T
]TdH , (22)
where T is the matrix transposing operator and U is the operator which adds the
transposed matrix to its matrix argument.
Consider the following stochastic system of rst order with the random initial condition
: N(x
0
,
0
) and the uncorrelated white noises v : N(0, Q) and r : N(0, R):
x(t) = ax(t) + b[u(t) + v(t)]
x(0) =
y(t) = x(t) + r(t) .
Find the time-invariant Kalman lter! The resulting time-invariant Kalman lter is
described by the following equations:
x(t) =
a
2
+ b
2
Q
R
x(t) + bu(t) +
(
a +
a
2
+ b
2
Q
R
)
y(t)
x(0) = x
0
.
Continuous/discrete-time Kalman Filter
The continuous-time measurement (9) which is corrupted by the white noise error r(t)
with innite covariance,
y(t) = C(t)x(t) + r(t) ,
must be replaced by an averaged discrete-time measurement
y
k
= C
k
x(t
k
) + r
k
(23)
r
k
: N(r
k
, R
k
) (24)
Cov( r
i
, r
j
) = R
i
ij
. (25)
At any sampling time t
k
we denote measurements by x(t
k
|t
k1
) and error covariance
matrix (t
k
|t
k1
) before the new measurement y
k
has been processed and the state
estimate x(t
k
|t
k
) and the corresponding error covariance matrix (t
k
|t
k
) after the
new measurement y
k
has been processed.
As a consequence, the continuous-time/discrete-time Kalman lter alternatingly per-
forms update steps at the sampling times t
k
processing the latest measurement
information y
k
and open-loop extrapolation steps between the times t
k
and t
K+1
with
no measurement information available:
The Continuous-Time/Discrete-Time Kalman Filter
Update at time t
k
:
x(t
k
|t
k
) = x(t
k
|t
k1
)
+ (t
k
|t
k1
)C
T
k
[
C
k
(t
k
|t
k1
)C
T
k
+R
k
]
1
[y
k
r
k
C
k
x(t
k
|t
k1
)] (26)
(t
k
|t
k
) = (t
k
|t
k1
)
(t
k
|t
k1
)C
T
k
[
C
k
(t
k
|t
k1
)C
T
k
+R
k
]
1
C
k
(t
k
|t
k1
) (27)
or, equivalently:
1
(t
k
|t
k
) =
1
(t
k
|t
k1
) + C
T
k
R
1
k
C
k
. (28)
Continuous-time extrapolation from t
k
to t
k+1
with the initial conditions x(t
k
|t
k
) and
(t
k
|t
k
):
x(t|t
k
) = A(t) x(t|t
k
) + B(t)u(t) (29)
(t|t
k
) = A(t)(t|t
k
) + (t|t
k
)A
T
(t) + B(t)Q(t)B
T
(t) . (30)
Assuming that the Kalman lter starts with an update at the initial time t
0
, this Kalman
lter is initialized at t
0
as follows:
x(t
0
|t
1
) = x
0
(31)
(t
0
|t
1
) =
0
. (32)
Discrete-time Kalman Filter
We consider the following discrete-time stochastic system:
x
k+1
= F
k
x
k
+ G
k
v
k
(33)
x
0
= (34)
y
k
= C
k
x
k
+ r
k
(35)
: N(x
0
,
0
) (36)
v : N(u
k
, Q
k
) (37)
Cov( v
i
, v
j
) = Q
i
ij
(38)
r : N(r
k
, R
k
) (39)
Cov( r
i
, r
j
) = R
i
ij
(40)
, v, r : mutually independent. (41)
We have the following (approximate) zero-order-hold-equivalence correspondences to
the continuous-time stochastic system:
F
k
= (t
k+1
, t
k
) transition matrix (42)
G
k
=
t
k+1
t
k
(t
k+1
, t)B(t)dt zero order hold equivalence (43)
C
k
= C(t
k
) (44)
Q
k
=
Q(t
k
)
t
k+1
t
k
(45)
R
k
=
R(t
k
)
t
k
t
k
= averaging time at time t
k
(46)
The Discrete-Time Kalman Filter
Update at time t
k
:
x
k|k
= x
k|k1
+
k|k1
C
T
k
[
C
k
k|k1
C
T
k
+R
k
]
1
[y
k
r
k
C
k
x
k|k1
] (47)
k|k
=
k|k1
k|k1
C
T
k
[
C
k
k|k1
C
T
k
+R
k
]
1
C
k
k|k1
(48)
or, equivalently:
1
k|k
=
1
k1|k1
+ C
T
k
R
1
k
C
k
(49)
Extrapolation from t
k
to t
k+1
:
x
k+1|k
= F
k
x
k|k
+ G
k
u
k
(50)
k+1|k
= F
k
k|k
F
T
k
+ G
k
Q
k
G
T
k
(51)
Initialization at time t
0
:
x
0|1
= x
0
(52)
0|1
=
0
. (53)
Extended Kalman Filter
The linearized problem may not be a good approximation we stick to the following
philosophy:
Where it does not hurt: Analyze the nonlinear system.
Where it hurts: use linearization.
We consider the following nonlinear stochastic system:
x(t) = f(x(t), u(t), t) + B(t) v(t) (54)
x(t
0
) = (55)
y(t) = g(x(t), t) + r(t) (56)
: N(x
0
,
0
) (57)
v : N(v(t), Q(t)) (58)
r : N(r(t), R(t)) . (59)
The extended Kalman lter
Dynamics of the state estimation:
x(t) = f( x(t), u(t), t) + B(t)v(t) + H(t)[y(t)r(t)g( x(t), t)] (60)

x(t
0
) = x
0
(61)
with H(t) = (t)C
T
(t)R
1
(t) . The error covariance matrix (t) must be
calculated in real-time using the folloing matrix Riccati dierential equation:
(t) = A(t)(t) + (t)A

T
(t)
(t)C
T
(t)R
1
(t)C(t)(t) + B(t)Q(t)B
T
(t) (62)
(t
0
) =
0
. (63)
The matrices A(t) and C(t) correspond to the dynamics matrix and the output
matrix, respectively, of the linearization of the nonlinear system around the estimated
trajectory:
A(t) =
f( x(t), u(t), t)
x
(64)
C(t) =
g( x(t), t)
x
. (65)
Because the dynamics of the error covariance matrix (t) correspond to the linearized
system, we face the following problems:
This lter is not optimal.
The state estimate will be biased.
The reference point for the linearization is questionable.
The matrix (t) is only an approximation of the state error covariance matrix.
The state vectors x(t) und x(t) are not Gaussian even if , v, and r are.
Again, the continous-time measurement corrupted with the white noise r with innite
covariance
y(t) = g(x(t), t) + r(t) (66)
must be replaced by an averaged dicrete-time measurement
y
k
= g(x(t
k
), t
k
) + r
k
(67)
r
k
: N(r
k
, R
k
) (68)
Cov( r
i
, r
j
) = R
i
ij
. (69)
Continuous-time/discrete-time extended Kalman lter
Update at time t
k
:
x(t
k
|t
k
) = x(t
k
|t
k1
)
+ (t
k
|t
k1
)C
T
k
[
C
k
(t
k
|t
k1
)C
T
k
+ R
k
]
1
[y
k
r
k
g( x(t
k
|t
k1
), t
k
)] (70)
(t
k
|t
k
) = (t
k
|t
k1
) (71)
(t
k
|t
k1
)C
T
k
[
C
k
(t
k
|t
k1
)C
T
k
+ R
k
]
1
C
k
(t
k
|t
k1
) (72)
Extrapolation between t
k
and t
k+1
:
x(t|t
k
) = f( x(t|t
k
), u(t), t) + B(t)v(t) (73)
(t|t
k
) = A(t)(t|t
k
) + (t|t
k
)A
T
(t) + B(t)Q(t)B
T
(t) (74)
Continuous-time/discrete-time extended Kalman lter
Again, the matrices A(t) and C
k
are the dynamics matrix and the output matrix,
respectively, of the nonlinear system linearized around the estimated trajectory:
A(t) =
f( x(t), u(t), t)
x
(75)
C
k
=
g( x(t
k
|t
k1
), t
k
)
x
(76)
Initialization at t
0
:
x(t
0
|t
1
) = x
0
(77)
(t
0
|t
1
) =
0
. (78)
Kalman Filter and Parameter Identication
The system and the measurement equations in discrete time are given by
x
k+1
= F
k
x
k
+ G
k
u
k
+ [G
k
Q
1
2
k
]w
(1)
k
(79)
y
k
= C
k
x
k
+ r
k
+ [R
1
2
k
]w
(2)
k
(80)
The update equations of the Kalman Filter are
x
k|k
= x
k|k1
+
k|k1
C
T
k
[C
k
k|k1
C
T
k
+ R
k
]
1
v
k
(81)
v
k
= [y
k
r
k
C
k
x
k|k1
] (82)
k|k
=
k|k1
k|k1
C
T
k
[C
k
k|k1
C
T
k
+ R
k
]
1
C
k
k|k1
(83)
where v
k
denotes the prediction error.
The prediction equations are given by
x
k|k1
= F
k1
x
k1|k1
+ G
k1
u
k1
(84)
k|k1
= F
k1
k1|k1
F
T
k1
+ G
k1
Q
k
G
T
k1
(85)
The algorithm is carried out in two distinct parts:
Prediction Step
The rst step consists of forming an optimal predictor of the next observation, given
all the information available up to time t
k1
. This results in a so-called a priori
estimate for time t.
Updating Step
The a priori estimate is updated with the new information arriving at time t
k
that is
combined with the already available information from time t
k1
. The result of this
step is the ltered estimate or the a posteriori estimate
The prediction error is dened by
v
k
= y
k
y
k|k1
= y
k
E[y
k
|F
k1
] = y
k
r
k
C
k
x
k|k1
and its covariance matrix can be calculated by
K
k|k1
= Cov(v
k
|F
k1
)
= E[v
k
v
T
k
|F
k1
]
= C
k
E[ x
k|k1
x
T
k|k1
|F
k1
]C
T
k
+ E[w
(2)
k
w
(2)
T
k
|F
k1
]
= C
k
k|k1
C
T
k
+ R
k
This result is important for forming the log likelihood estimator.
We start to examine the appropriate log-likelihood function with our frame work of the
Kalman Filter. Let R
d
be the vector of unknown parameters, which belongs to the
admissible parameter space . The various system matrices of the state space models,
as well as the variances of stochastic processes, given in the previous section, depend
on . The likelihood function of the state space model is given by the joint density of
the observational data y = (y
N
, y
N1
, . . . , y
1
)
l(y, ) = p(y
N
, y
N1
, . . . , y
1
; ) (86)
which reects how likely it would have been to have observed the date if were the
true values of the parameters.
Using the denition of condition probability and employing Bayess theorem recursively,
we now write the joint density as product of conditional densities
l(y, ) = p(y
N
|y
N1
, . . . , y
1
; ) . . . p(y
k
|y
k1
, . . . , y
1
; ) . . . p(y
1
; ) (87)
where we approximate the initial density function p(y
1
; ) by p(y
1
|y
0
; ). Further-
more, since we deal with a Markovian system, future values of y
l
with l > k only
depend on (y
k
, y
k1
, . . . , y
1
) through the current values of y
k
, the expression in (87)
can be rewritten to depend on only the last observed values, thus it is reduced to
l(y, ) = p(y
N
|y
N1
; ) . . . p(y
k
|y
k1
; ) . . . p(y
1
; ) (88)
For the purpose of estimating the parameter vector we express the likelihood function
in terms of the prediction error. Since the variance of the prediction error v
k
is the
same as the conditional variance of y
k
, i.e.
Cov[v
k
|F
k1
] = Cov[y
k
|F
k1
] (89)
we are able to state the density function. The density function p(y
k
|y
k1
; ) is a
Gauss Normal distribution with conditional mean
E[y
k
|F
k1
] = r
k
+ C
k
x
k|k1
(90)
with conditional covariance matrix
Cov[y
k
|F
k1
] = K
k|k1
. (91)
The density function of a n-dimensional Normal distribution can be written in matrix
notation as
1
(2)
n
2
||
e
1
2
(x)
T
1
(x)
where denotes the mean value and covariance matrix. In our case x is replaced
by y
k
r
k
C
k
x
k|k1
= v
k
and the covariance matrix is K
k|k1
. The probability
density function thus takes the form
p(y
k
|y
k1
; ) =
1
(2)
p
2
|K
k|k1
|
e
1
2
v
T
k
K
1
k|k1
v
k
, (92)
where v
k
R
p
.
Taking the logarithm of (92) yields
ln(p(y
k
|y
k1
; )) =
n
2
ln(2)
1
2
ln |K
k|k1
|
1
2
v
T
k
K
1
k|k1
v
k
(93)
which gives us the whole log-likelihood function as the sum
L(y, ) =
1
2
N
k=1
nln(2) + ln |K
k|k1
| + v
T
k
K
1
k|k1
v
k
(94)
In order to estimate the unknown parameters from the log-likelihood function in (94)
we employ an appropriate optimization routine which maximizes L(y, ) with respect
to .
iven the simplicity of the innovation form of the likelihood function, we can nd
the values for the conditional expectation of the prediction error and the conditional
covariance matrix for every given parameter vector using the Kalman Filter algorithm.
Hence, we are able to calculate the value of the conditional log likelihood function
numerically. The parameter are estimated based on the optimization
ML
= arg max
L(y, ) (95)
where denotes the parameter space. The optimization is either an unconstrained
optimization problem if R
d
or a constrained optimization if the parameter space
R
d
.

Kalman Filter and Parameter Identification: Florian Herzog 2013

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Kalman Filter and Parameter Identification: Florian Herzog 2013

Uploaded by

Copyright:

Available Formats

Kalman Filter and Parameter

x(t) = A(t) x(t) + B(t)u(t) + H(t)[y(t)r(t)C(t) x(t)]

(t) = A(t)(t) + (t)A

x(t) = f( x(t), u(t), t) + B(t)v(t) + H(t)[y(t)r(t)g( x(t), t)] (60)

(t) = A(t)(t) + (t)A

You might also like