Professional Documents
Culture Documents
Introduction
of c, or the bestapproximation
of the originalc. Then,the
0043-1397/99/1998WR900093509.00
1220
KOUTSOYIANNIS:
OPTIMAL
DECOMPOSITION
OF COVARIANCE
MATRICES
random
variables
and lowercase
2.
Stochastic
Model
Y = aZ + bV
(1)
(3)
haveY -= Xt andZ = [(Xt-1)T, (Xt-2)r] r (wheretheexponent T denotesthe transposeof a matrix or vector), so that
rn = 2n. Similar is the situationfor the PAR(2) model. In
anotherexample,Valenciaand Schaake's[1972, 1973] disaggregationmodel,Y representsthe n = 12m monthlyhydrologicvariablesat rn sites,whereasZ representsthe rn annual
valuesat the samelocations.Someother disaggregation
models can be also reduced in the form (1) after appropriate
assignments
of the variablesY and Z.
The model parametersa and b are typicallydeterminedby
the moment
estimators
(4)
(5)
E{(,-
E[,])( r - E[]r)}
with E[
] denotingex-
(6)
(7)
KOUTSOYIANNIS:
OPTIMAL
DECOMPOSITION
3[V] = (b(3))-l{/.l,3[Y]-/3[aZ]}
(8)
OF COVARIANCE
MATRICES
1221
ticefollowed
inAR models
(e.g.,whenY -=Xt andZ -=Xt- 1),
whichmay causeinconsistencies
evenin the caseof usingthe
completeform of a and havingno missingdata: This occurs
covariance
matrices(e.g.,Cov[Xt,
vector, thatis,/3[] := E { ( - E[]) 3} andb(3)denotes whenthe contemporaneous
the matrixwhoseelementsare the cubesof b. (Throughoutthis
laggedcovariance
matrices
(e.g.,Coy[Xt, Xt- 1]) are
paperwewillextendthisnotation,
thatis,u for anymatrix whereas
estimatedusing(unavoidably)smallerlengths(e.g.,k - 1).
or vectoru and anypowerk.) Equation(8) is a generalization
Given all the aboveequationsfor the modelparameterswe
of those given by Matalas and Wallis [1976, p. 64], Todini
will focuson the estimationof b, whichalsoaffectsbothE[V]
[1980], and Koutsoyiannis
and Manetas [1996]. Moments of
and/3[V], as followsfrom (6)-(9).
order greaterthan 3 are not used,nor cantheybe estimatedin
a similar manner.
/.I,
3[V] -- (b(3))-l{/.i,3[Y]
- a(3)/.i,3[Z]}
(9)
is,
(12)
h := diag(1/f11,
' ' ' , 1/Cnn)
be replacedby a simplified
form so that if a/ is the only
nonzero element for row i, then
(13)
so that
k =j
aik = O
k - j
c' := hch
(10)
(14)
Cov[Xn,Xn-1]/Var
[XSn-1])
(11)
qo:= h(3){/3[Y
] - /3[aZ])
(8)
bbr. We mustemphasize
that the model(1) with parameter
(17)
----(b'(3))-lqo
(19)
1222
Equations
(16) and(19) constitute
thebasisfor theproposed late derivatives,
asis the casein the procedure
that we will
method.Since(16) doesnot alwayshavea real solution,we set develop.However,we recall that if
d := b'b'- c'
(20)
I111'=
(26)
.=
possible.
Thisrequirement
canbeexpressed
mathematically
as
n
minimize
I[dl[
2'= d
i--1
isthepth normof
(21)
j=l
<27>
timization
procedure
isfasterfor Ildll"
thanfor Ildll.
a constrained
optimization
problem
withthreeelements:
(1)
In addition,we mustseta restriction
that all diagonalele- theobjective
hnction(21)to be minimized,
(2) theequali
mentsof d shouldbe exactlyzero.To justifythisrequirement, constraint
(23), and (3) the inequaliconstraint
(27). Note
we observethatif the diagonalelementsof d are zero,thenall that(27)isrelated
tob' through
(19).In termsoftheLagrangdiagonal
elements
of bbr willequalthoseof c.In turn,thiswill ian method,the objectivefunctionand the constraintscan be
resultin preservation
of thediagonal
elements
of Cov[Y,Y] combined
intotheunconstrained
minimization
problem
(asimpliedfrom(5) and(12)), thatis,in preservation
of the
variancesof all elementsof Y. The preservation
of the vari-
minimize02(b' ,
facc) (28)
=
formeris relatedto thepreservation
of the marginaldistributionfunctions
of thecomponents
ofY. To express
thisrequire- whereX2andX3areLagrangian
multipliers
for the equali
mentmathematically,
we introducethe diagonalmatrix
constraint
(23)andtheinequaliconstraint
(27),respectively.
A generaltheoreticalframeworkfor solvingthis problem
d* '= diag (dll , ''', dnn
)
(22)
mightconsistof o steps:In the firststepwe assume
that
and demand that
X3 = 0 (i.e.,ignoretheinequaliW
constraint)
anddetermine
b'
IId*l[ = 0
byminimizing
f(b', X2,0). (Thisstepmaybeavoided
if c' is
(23) positivedefinite,in whichcaseb' canbe determined
by the
pical decomposition
methods.)
If thesolution
b' of thisstep
Anotherrestriction
arises
whenweconsider
thepreservation satisfies
(27), thenthissolutionis the optimalone.Otheise,
weproceed
to thesecond
step,wherewe performtheoptiminessof V (i.e., ) that preservethe coefficients
of skewness
of zationof the complete
function(28) (i.e.,with X3 0) to
Y are thoseobtainedby (19). However,(19) mayresultin simultaneously
determine
b', X2,andX3.For thisoptimization
arbitrarilyhighelements
of if no relevantprovision
for b is theKuhn-Tucker
optimaliconditions
(see,forexample,
Marsought.
Nevertheless,
in themodelapplication
phase(i.e.,the low [1993,p. 271]) mustbe satisfied.
generation
of synthetic
data),an arbitrarilyhighcoefficient
of
Apparently,
dueto thecomplehof theproblemequations,
skewness
ishardlyachievable.
Specifically,
it iswellknownthat an analyticalsolutionof equationsis not attainable,and a
of the coefficients of skewnessof Y. The coefficients of skew-
numericaloptimizationschememustbe established.
initial
bounded
[Wallis
etal.,1974;Kirby,1974;Todini,1980]between attemptto establish
a numerical
procedure
for (28) (using
--ub and {ub,where
pical available
solvers
for multivariate
constrained
optimNation)indicated
thatit isnoteasyto optimize
it simultaneously
k-2
for the parameters
b' andthe Lagrangemultipliers.
Particularly,it wasobseedthat the initiallyassumed
valuesof the
multipliers
do not advance
towardan optimalsoluIn particular,
a seriesof generated
valuesVr(r = 1, "', k) Lagrange
tion
but
remain
almost
constant
through
theconsecutive
steps
will haveskewness
q-ubonlyif all but onevalueof the series
of
numerical
optimization.
At
the
same
time,
the
objective
areequal.Apparently,
thisis not an accepted
arrangement
of
trajectothroughtheconsecthe generatedseries,and thusan acceptable
coefficient
of functionitselfhasa decreasing
owingto theappropriate
modification
of parameskewness
accmustbe muchlessthanub(e.g.,acc= 0'5ub; utivesteps,
ters
b'.
is
indicates
that
the
formulation
of
the
problem
in
fork = 1000,acc= 0.5X 10000'5
16).Consequently,
we
termsof (28) is not effectivein practice.
mustsetthe restriction
thatall n auxiliary
variables
Vi (i =
A directalternative
is the introduction
of penalties
for the
1, ..., n) havecoefficients
of skewness
lessthanacc,thatis,
constraints
[e.g.,Piece,1986,pp.21,333-340].is alternative
ub
=x/k x/
(24)
objective
functionand the constraints
into a singleunconThe handlingof (25) in an optimization
problemis not strainedobjectivefunction;aswe will see,thisfunctioncanbe
mathematically
convenient,
especially
whenwe wishto calcu- ve similarto (28). The essentialdifferencebeeen the o
KOUTSOYIANNIS:
formulations
OPTIMAL
DECOMPOSITION
the coeffi-
OF COVARIANCE
MATRICES
1223
nents
of 02withrespect
to theunknown
parameters
bi' This
may seema difficulttask to accomplishanalytically.However,
it is not at all intractable.
of the reader we
termxlld*ll
, whereX2is assigned
a largevaluesothatevena
slightdeparture
of IId*11
fromzeroresults
in a significant
"pen-
doz
ab[
abe2
Oc
Ooz
ab[n
ab. abe2
ab
(30)
x311llintotheobjective
function
(21),whereX3isaweighting
Ob'nl
Ob'n2
Obnn
'
enough
whenIIll -<
Theresulting
objective
function
in for the matrix of its partial derivativeswith respectto all b'ij,
this case is
of 02 are
(29)
Here,wehavedividedIldll
byn2, thenumberof elements
of d,
to convert
thetotal"square
error"IId[I
intoanaverage
square
errorfor eachelementof d; similarly,
wehavedividedIId*11
by
n becaused* is diagonal,so that the number of nonzero elementsisn. The coefficienthi in the firstterm of the right-hand
side of (29) may be consideredequal to 1 unlessspecified
otherwise(see the casestudyin section5). We observethat
(29) is similarto (28) from a practicalpoint of view:As discussedbefore,the theoreticaldifferencethat the parametersX
are presetconstantsin (29) hasno practicalimplication;also
= 4db'
db'
--=
db'
4d*b'
--db' = -611g
I}-w
(31)
(32)
(33)
the omission
in (29) of (2cc,
whichis a constant,
apparently
(34)
A reasonable
choicefor X2to ensurethatIId*11
will be very and is a vector definedby
closetozero(sothatconstraint
(23)willhold)isX2= 103;this
.= [(b,(3))-]rt(p-)
value assigns
a weightof the averagesquareerror of the diagonal elements3 orders of magnitudehigher than that of the
off-diagonalones.For the choiceof an appropriatevalueof X3
letusassume
thatanacceptable
valueof Ildll/n2 (thefirstterm
of theright-hand
sideof (29))willbe of theorderof 10-3
(recallthat c' is standardizedso that its off-diagonalelements
arelessthan1), whereas
an acceptable
valueof x311ll},
as
resultsfrom (24) andthe subsequent
discussion,
will be (.3k/
4), wherek is the sizeof the syntheticrecordto be generated.
Assumingthat the acceptable
valuesof the first and the third
term are of the sameorder of magnitude,so that both terms
haveaboutthe samesignificance
in the objectivefunction(29),
(35)
Thustherequired
matrixof derivatives
of 02withrespect
to
theunknown
parameters
bo is
t
dO2
431
432
d-= n2rib'+
d*b'-6x311ll?w (36)
It is apparentfrom the objectivefunction(29) and its derivatives(36) that we havea typicalnonlinearoptimizationproblem,whosesolutioncanbe achievedby iterations,startingwith
an initial matrix b '[1.In the/th iteration we start with a known
b andwefindan"improved"
matrixb'[+ll; werepeatthis
we conclude
that h.3k/4 10-3 or ,3 4 x 10-3/k. We procedureuntil the solutionconverges.Severalalgorithmsare
in theliterature
foradvancing
fromb'[q tob'[+11(see
may assumethat in typicalgenerationproblemsof stochastic known
hydrologyk does exceed40, so that a maximumvalue of X3 Maysand Tung[1996,p. 6.12], amongothers,who summarize
mustbe about10-4. We mayusea smallervalueof X3(e.g., the most commonones).Among them, the most suitablefor
10-5-10-6) eitherif the samplesizeis of a higherorderof our caseare the steepestdescentand Fletcher-Reevesconjumagnitudeor if we wish to give a greater importanceto the gate gradientmethods.These are chosenfor their mathematpreservationof covariances
rather than to that of the coeffi- ical simplicityand convenienceand their low memoryrequirecients of skewness.
ments. Specifically,the mathematical formulation of both
The establishment
of a numericalprocedure
for minimizing methods,which typicallyassumea vector arrangementof the
unknown parameters,can be directly adapted for our case,
the objectivefunction(29) is givenin the next section.
which
uses
a matrix
arrangement
oftheunknown
parameters.
4.
Optimization Procedure
1224
Studyfor Subperiods
s = 8 (May)ands - 1 = 7 (April)
i=1
i=2
i=3
i=4
i=5
i=6
Location
Means
97.3
53.1
111.2
69.5
59.8
43.2
100.4
65.3
20.1
9.4
31.5
19.3
35.0
20.2
57.6
30.8
17.4
18.8
53.6
26.5
10.7
7.2
27.1
14.4
Standard deviations
StDev[X,S.]
StDev[X]
Coefficients of skewness
cs[xf
cs[xf]
Cross-correlation
0.72
0.76
1.04
0.89
0.58
0.87
1.07
0.81
1.06
1.53
1.72
1.49
1.00
0.76
0.85
0.56
0.22
0.16
0.76
1.00
0.67
0.90
0.30
0.43
0.85
0.67
1.00
0.58
0.69
0.14
0.56
0.90
0.58
1.00
0,37
0.50
0.22
0.30
0.69
0.37
1.00
0.83
0.16
0.43
0.14
0.50
0.83
1.00
1.00
0.54
0.76
0.53
0.57
0.49
0.54
1.00
0.29
0.81
0.15
0.67
0.76
0.29
1.00
0.20
0.75
0.23
0.53
0.81
0.20
1.00
0.27
0.45
0.57
0.15
0.75
0.27
1.00
0.22
0.49
0.67
0.23
0.45
0.22
1.00
0.60
0.78
0.80
coefficients
Corr[X-, Xj- ]
j=
1
J=2
J=3
J=4
j=5
J=6
Co[xf,
j=
1
j=2
j=3
j=4
j=5
j=6
Autocorrelation
coefficients
Corr[X,X- i]
b'[1
istostartdecomposing
thematrix
c' bytriangular
decom-
position,
whichis the simplest
of all procedures
[Press
et al.,
Boththesemethodsaredescribed
bythefollowingcommon 1992,p. 97], and occasionally
performingsomecorrections
expression,
adaptedfrom the typicalexpression
givenin the when this procedurefails. This will be demonstratedmore
literaturefor vector-arranged
variables
[e.g.,Maysand Tung, clearlyin thecasestudybelow(section5).
1996,p. 6.12;Presset al., 1992,p. 422],
db']
q-
5. Case Study
dbt/[l
111 (37)To illustratethe developedmethod,we presenta partof a
realworldcasestudy.Thiscasestudyaimsat thegeneration
of
simultaneousmonthly rainfall and runoff at three basins,
[/-1] 2
namely,Evinos,Mornos,andYliki, supplying
waterto Athens,
Greece[Nalbantis
andKoutsoyiannis,
1997].Thisconstitutes
a
multivariate
generation
problemwithsixlocations
(twovarifor the Fletcher-Reeves
method,and[/+ 1] is a scalarwhose ablesx threebasins).
As a general
framework
for thegenervalueis obtainedby a line searchalgorithmfor eachiteration ation, the simpledisaggregation
model [Koutsoyiannis
and
l. For l = 0 (firststep)the Fletcher-Reeves
methodcannotbe Manetas,1996]was adopted.Specifically,
as a first stepthe
where3,[/1-- 0 forthesteepest
descent
method,
3,[l]:11(d02
2/ d02
II
38>
usedbecause
(dO2/db')
[-1 is not definedandthuswe must annual
variables
foralllocations
aregenerated
using
a multiusethesteepest
descent
method(i.e.,3,[1= 0). Fortheother variateAR(1) model.Thentheannualquantities
aredisaggre-
steps,the numerical
applications
haveindicated
thatthe gatedinto monthlyones,usinga combinationof a multivariate
Fletcher-Reeves
methodis fasterand thuspreferable.However,it wasobserved
that in someinstances
duringthe iteration procedurethe Fletcher-Reeves
methodmaybecometoo
slow;in suchinstances
theprocedure
is accelerated
if weperform oneiterationwith the steepestdescentmethodandthen
contemporaneous
seasonal
AR(1) (or PAR(i)) modeland an
accurateadjustingprocedure,as describedby Koutsoyiannis
theprevious
monthXs- . Themodelparameters
aregivenby
(11),(5), (6), (7), and(9), in whichY andZ mustbe replaced
b'[1.
Thisisnotsoimportant
because,
asdemonstrated
in the withX andX- , respectively.
casestudybelow(section5), the methodisveryfastin the first
In Table1 we display
the samplestatistics
of the monthly
phaseof the iterationprocedure.Evenif we startwith a b'[1
leadingto an unrealistically
highvalueof the objective
function, thisvalueis dramaticallyreducedin the first few iterations.The generalideafor the construction
of the initial matrix
rainfallandrunoffdatathat arenecessary
for estimating
the
contemporaneous
PAR(i) model parametersfor subperiod
(month)s = 8 (May;notethat the hydrologic
yearstartsin
October).Theseare the first threemarginalmomentsat all
KOUTSOYIANNIS:
OPTIMAL
DECOMPOSITION
OF COVARIANCE
MATRICES
1225
1226
KOUTSOYIANNIS:
OPTIMAL
DECOMPOSITION
OF COVARIANCE
'61
MATRICES
56
(c) 'I-i
-?1.0 (d)
--
0.5
....
_
.....
.....
::
... -P':i0.0
0.5 '''
(e),
o
:.
(fi
.o
b' u
.:-t..
."'-.-.,.(
'"'.!.:,
Figure 1. Plotsof the elementsof matricesb' for the varioussolutionsexaminedin the casestudy:(a) initial
solution0; (b) initialsolution0a;(c) finalsolution1; (d) finalsolutionla; (e) finalsolution2; (f) finalsolution
3 (seetext).
As a further investigation,we also assessedthe dltimate
To acquirean indicationof how low the error in preserving
the crosscorrelationscould ultimately be if we ignore com- lowest
valueof thecoefficient
of skewness
maxj({j)byignorpletely the preservationof skewness,
we performedanother ing the error in preservingcrosscovariances.Thus we peroptimization,setting3 = 0 in the objectivefunction.The formedanotherminimizationof the objectivefunction,setting
finalsolution
3
resultingfinal solution2 is showngraphicallyin Figure le, and = 0, 2 = 103,and3 = 10-s-Theresulting
the relevant values of the objectivefunction and its compo- is showngraphicallyin Figure lf, andthe relevantvaluesof the
nents are shown in Table 2. We observe that the further reobjectivefunctionand its componentsare shownin Table 2.
(Also,in Comparison
Withmaxj({)) fortheInitialandFinalSolutions
of theCaseStudy
Examined
Initial Solution 0
Initial Solution 0a
Final Solution 27
Final Solution 35
Ila*ll
Ilall
IIll
0.3238
0.3297
29,089.50
0.0012
o.6394
18.31
0.0000
o.1404
5.50
0.0000
o.1386
7.94
0.0000
o.8136
4.58
max(j)
29,089.50
18.31
5.37
7.19
4.57
4.1806
0.1076
0.0292
*Optimum
forthecombination
of IlallandI111
(-1= 1, '2 = 103,X3= 10-s).
?Optimum
for Ilall(-1-- 1, '2 -- 103,X3= 0).
$Optimum
forI111,
(x - 0, '2 -- 103,X3= 10-s).
0.0231
0.0145
KOUTSOYIANNIS:
OPTIMAL
DECOMPOSITION
OF COVARIANCE
MATRICES
IId*11,
Ildll,e
1227
I1{11
tl........
tS'utinO
S'utinOa/
10[[?!!!!5!!!
......
!!!!:.!!!!!
......
!!!!!!!:.!:.
.....
t : IId*ll---e'--IId*11
1000
100
-:::
10
12
14
:;;;;
16
18
7; r..
20
22
24
26
28
...........
30
32
10
36
Iteraaon
38
40
Final
number
observe that all final solutions 1, la, 2, and 3 are similar to each
methods.
coefficient
coefficients.
Table3. Cross-Correlation
Coefficients
Corr[X,Xj] Resulting
fromb' of theFinal
Solution
la
i=1
j=
1
j=2
J=3
j=4
j---5
J-6
1.00(0.00)
0.51(-0.03)
0.77(0.01)
0.54(0.01)
0.54(-0.03)
0.49(0.00)
i=2
0.51(-0.03)
1.00(0.00)
0.28(-0.01)
0.80 (-0.01)
0.17 (0.02)
0.67 (0.00)
i=3
0.77(0.01)
0.28 (-0.01)
1.00(0.00)
0.20 (0.00)
0.74(-0.01)
0.23 (0.00)
i=4
0.54(0.01)
0.80(-0.01)
0.20(0.00)
1.00(0.00)
0.26(-0.01)
0.45(0.00)
i=5
0.54(-0.03)
0.17(0.02)
0.74(-0.01)
0.26(-0.01)
1.00(0.00)
0.22(0.00)
Thevalues
in parentheses
arethedifferences
fromthecorresponding
values
of Table1.
i=6
0.49(0.00)
0.67 (0.00)
0.23 (0.00)
0.45 (0.00)
0.22 (0.00)
1.00(0.00)
1228
KOUTSOYIANNIS:
OPTIMAL
DECOMPOSITION
OF COVARIANCE
MATRICES
E E 2d,l
=2E E d,l(b
kjli
q-bjki)
011dll'
2 dlabj
+2 dilblj (A8)
lldll'
4 d,$q
(A9)
2d- - 2 dk(b
jS
+bj8ki)
= 4 d, jJki
= 4di,
bJj
(A10)
value (e.g.,givenas an outputof anothermodel).Obviously,
k=l
thisimpliesa linear constraintfor the elementsof a singlerow
of the decomposed
matrix.This constraintcan be easilyincorwhichproves(32).
poratedin the optimizationframework,the simplestwaybeing
To determine
thepartialderivatives
of I111,wemustfirst
the appendingof a penaltyterm into the objectivefunction.
find the partial derivativesof g. We denote
Appendix: Proof of (31)-(33)
g := (b'(3))-
(A11)
g = gqo
(A12)
dtcl
= E brbr-Cl
(A1)
r=l
Og
Og
,=
,
obo obo
so that
(a3)
gb'(3)= I
(A14)
Clearly,
0g
Obr
, = 8i8
Obo
0b(3)
, = o
b' (3)+
g Obo
(A3)
(a5)
where
thus
8o:=0
ij
8o-= 1
i=j
0g
(a4)
Therefore,
0b' (3)
0b' (3)
=
, g
=, (b'))-
-g Obo
Ob$ g Ob
o
(A16)
Odkl
---r -
bkrlirjq-
Obij r=l
0g
blrSkiSrj
r=l
....
obo
,
(AS)
-g
0b(3)
obo
, g
0b(3)
, g
gobo
(a7)
Consequently,
which results in
Odk
t
obo
- bj81iq-bjSki
__ n n Ob
;r3
Ob
i. r=s=
t9
k --EEgks
19%---t
r (A18)
----7--
(A6)
Thepartialderivative
of Ildll
2withrespect
tobi willbe
and since
KOUTSOYIANNIS:
OPTIMAL
Obtsr3
-- = 3bsr
'2 sisrj
obij
DECOMPOSITION
(A19)
we get
OF COVARIANCE
MATRICES
1229
Obo
-3 Z
r=l
srSsiSrjr
(A20)
s=l
or
Ok
b'2
= --3Z gki
ir rjr
Ob
ij
(A21)
r=l
Koutsoyiannis,
D., and A. Manetas,Simpledisaggregation
by accurate
adjustingprocedures,WaterResour.Res.,32, 2105-2117, 1996.
Lane, W. L., Applied stochastictechniques,user'smanual, Eng. and
and finally,
, = -3g:ib
abij
(A22)
York, 1993.
IIll. Assuming
thatp iseven,(26)canbewritten
Ilgllg
=
(A23)
=
so that
, = _6IlgllL"
ijj Z k-lgki
0ab
IIgllg
ij
k=l
1986.
(A25)
Press,W. H., S. A. Teukolsky,W. T. Vetterling, and B. P. Flannery,
NumericalRecipesin C, CambridgeUniv. Press,New York, 1992.
Rasmussen,P. F., J. D. Salas,L. Fagherazzi,J.-C. Rassam,and B.
Bob6e, Estimation and validation of contemporaneousPARMA
models for streamflow simulation, Water Resour. Res., 32, 31513160, 1996.
E -lg:i= i
(A26)
k=l
Salas,J. D., Analysisand modelingof hydrologictime series,in Handbookof Hydrology,edited by D. Maidment, chap. 19, McGraw-Hill,
New York, 1993.
Colo., 1988.
(A27)
k=l
so that, finally,
--ab = -611gll-w/
(A28)
whichproves(33).
P. Rasmussen
E. Todini
for
D. Koutsoyiannis,
Departmentof Water Resources,Facultyof Civil
Engineering,National TechnicalUniversity,Heroon Polytechniou5,
GR-157 80 Zografou,Athens,Greece.(dk@hydro.ntua.gr)