You are on page 1of 2

Copyright C IFAC ModeliDa and Comrol in Biomedical

Syaans. Galveam. Teua. USA. 1994

Regularization Networks for System Identification and Prediction


of Glucose Kinetics

Z. Trajanoskitl, W. Regittnig§, P. Ko~o§, F. Skrabal§, and P. Wach l

*Institute of Biomedical Engineering, Graz University of Technology. Inffeldgasse 18. 8010 Graz
§Deparunent of Internal Medicine. Krankenhaus der Barmherzigen Brtider. Marschallgasse 12. 8020 Graz. Austria

Abstract - A neural network for non-linear system NARMA model. The exact form of the function f
identification and prediction is presented. A describing a real-world system is usually unknown. Hence.
regularization network which is equivalent to radial the uknown function f had to be recovered from data set
basis function neural network was used for obtained by sampling.
identification of a Non-linear AutoRegressive Moving
Avarage model with/without eXogenous inputs B. Regularization Network
(NARMAXINARMA). NARMAX system identification
The inverse problem of recovering the function f . or an
and validation was performed from simulated data from
a publisbed model for the dynamics of blood glucose in a estimate of it from a sparse data set is clearly an ill-posed
diabetic state. A NARMA model was used for short- one. Hence, in order to reconstruct the function f one
term prediction of subcutaneous tissue glucose time- needs some a priori knowledge about it. In many cases the
series from healthy SUbjects. a priori assumption about the function f is that it is
smooth. Thus, it can be shown either in the context of
I. IN1RODUCTlON functional analysis [10] or in a probabilistic framework [7]
that the solution of the ill-posed problem can be obtained
In recent years. both theory and praxis of artificial neural by minimizing the functional:
networks and non-linear systems have advanced N
considerably. Feed forward multi-layered neural networks
have been increasingly employed for dynamic system
H[J] = L (J(xJ- y} +Njl[J] (2)
;=1
modelling and identification of systems which exhibit where cp[J] is a smoothness functional or stabilizer and A.
complex non-linear characteristics.
is the regularization parameter which controls the
In the present study we have applied regularization
compromise between the degree of smoothness of the
networks for identification of the glucose metabolism in a
solution and its closeness to the data. Depending on the
diabetic state and for modeling subcutaneous tissue glucose
choice of the stabilizer (which corresponds to different
time-series from healthy subjects.
assumptions of smoothness) different solutions can be
obtained [9]. For the class of radial stabilizers
11. MErnODS
cp[J(x)] = cp[f(Rx)] for any rotation matrix R the well
A. System Representation: the NARMAX -Model known radial basis function neural network [3, 8] is
recovered. It can be shown, that a radial stabilizer of the
The Non-linear AutoRegressive Moving Avarage model form :
with eXogenous inputs (NARMAX) is a concise
description of a system in terms of a non-linear functional (3)
expansion of past inputs. outputs. and prediction errors [2]:

y(t) = f(y{t -I) .... y{t - nJ,u{t -I) ... .u(t - nJ, -~
(1) where ](s) is the Fourier transform of f{x) , G(s) = e ~
E(t -1) .... E(t - nE) + E(t)
and ~ is a fixed parameter, leads to the approximation:
where y(t).U(t).E(t) are the system output (glucose), input n
(insulin), and prediction error. or residual respectively. and f(x}= Lc;G(lIx-t;ID (4)
f( x) is some non-linear function. For nu = 0 the ;=1
relationship (I) represents a non-linear time-series or a
where n < N , t; are some appropriately chosen centers.
tEmail : uajanoS@ibmLw-graz.ac.at G(r) =e-r' , and c; are constant coefficients.

303
In many practical applications, the components of the The identified networks have succesfully passed 11
input vector are of different type and the norm Ix - I
tj must validation tests for non-linear systems [6]. However, iJ
be considered as a weighted norm: contrast to out previuos worK [11], with the presentcG
regularization networK it was not necessary to use movin!
(5) average terms in the equation (1).

since the scale of the components would be otherwise IV. DISCUSSION


arbitrary. In this study, a diagonal matrix W was used, in
which case the elements of the matrix are equivalent to the In the case of system identification with neural networks,
inverse of the variance ~ of each component of the it is obvious that different prior assumptions about Ihe
multidimensional Gaussian in equation (4). smoothness of the function which should be reconstructcG
lead to different network architectures. The presented stOOy
C.ldentification and Validation . demonstrates that a regUlarization network can be obtained
for NARMAX identification of glucose kinetics. The resuJlS
Learning of the network (4), i.e. identification of the
of this worK encourage further studies of netwoR
NARMAX-Model is a non-linear optimization problem:
application for an adaptive closed loop control of glucose (I'
given a set of data, fmd the coefficients Cj' the coordinates
hypoglycemia prediction in patients with diabetes mellitus.
of the centers tj and the elements of the matrix W. The
functional H[t] is in general non-convex and a stochastic ACKNOWLEDGEMENT
gradient descent can be applied [9]. However, for a given
This worK was supported by the Austrian Science
matrix W and for known , centers tj the network (4) is
Foundation, Grant No. S4906 and P09142-Med.
linear in the parameters and the coefficients Cj can be
determined from the equation [9]: REfERENCES
c=(GTG+).gtGTy (6) [l] BERGMAN R.N., Y2. iDER, c.R. BOWDEN, AND C. COBEut
"Quultitative estimation of insulin sensitivity". Am J Physioi, 236:
with the n x n matrix gall = G( ta; t~). and the N x n matrix E667-E677, 1979
G,n = G(x,;t a ). Here we have used the following approach: [2] CHEN S. AND S.A. Bn..LINGs, "Representation of non-linear systems: die
1) use a priori knowledge for determining the matrix W, NARMAX model",lnll Control, 49 : 1013-1042, 1989.
2) select the significant centers, and 3) calculate the [3] CHEN. S .• C.F.N. COWAN, AND P.M. GRANT, "Orthogonalleast squares
coefficients Cj from equation (6). For the elements of the learning algorithm for radial basis function networks" , IEEE TraIlS
Neural Networks. 2: 302-309,1991.
scaling matrix W the modified indices of insulin
sensitivity and glucose effectiveness [1] were used. The [4] COB ELL! c., G. FEDERSPIL. G. PACINI, A. SALVAN, AND C.
SCANDELLARl; "An integrated mathematical model of the dynamics cl
centers can be selected from the available data points by the
blood glucose and its hormonal control". Math Biosci, 58:27-60. 1982
lC-means clustering technique (for on-line identification)
(5) CRAVEN P. ANDG. WAHBA: "Smoothing noisy data with splinefunctioru
[8] or the orthogonal least squares method (for off-line
- estimating the correct degree of smoothing by the method of generalized
identification) [3]. The regularization parameter A was
cross-validation ", Numer Math, 31 : 377-403, 1979
determined by the generalized cross-validation method [5].
(6) LEONTARITlS l.J. AND SA. Bn..LINGS. "Model selection and validatioo
After identification, the identified model was supplied to
methods for non-linear systems".lnt J Control. 45: 311-341, 1987.
validation tests for non-linear systems [6].
[7] MARROQulN J.L., S. MrrrER. AND T. POGGIO: "Probabilistic solution cl
ill-posed problems in computational vision. J Amer Stat Assoc, 82:76-
III. REsULTS
89, 1987
A published model [4] , adjusted to a diabetic state was (8) MOODY J. AND C. DARKEN, "Fast learning in networks of locaIly·tuned
used to generate the data for identification and validation. processing writs", Neural Computation. I: 281-294,1989.
The simulated data were corrupted by additive white noise [9] POGGIO T., AND F. GJRQ5I; "A theory of networks for approximation and
of zero mean and 2 % coefficient of variation and were learning", A.1. Memo 1140, Artificial Intellegence Laboralory,
then submitted to the network for identification. 300 data Massachusetts Institute of Technology, 1990
points from a simulated intravenous glucose tolerance test (10)TIKHONOV A.N. AND V.Y. ARsENIN: Solutions of ill.posed problenu,
and 200 data points from an oral glucose tolerance test New York. Wiley. 1977
were used for identification and validation, respectively. [1 I]TRAJANOSKI Z.• P. WArn. R. GROBLER. H. KONTSCHIEDER, P.
Additionally, subcutaneously measured glucose time-series KOTANKO. AND F. SKRABAL: "NARMA modeling of subcutaneous
from healthy subjects were used for NARMA identification tissue glucose time-series using neural networks", Proc 15th Ann 1nl
and one step a-head prediction as previously shown [11]. ConflEEE Eng Med Bioi Soc, San Diego, USA. 538-539,1993

304

You might also like