Professional Documents
Culture Documents
Mathematical Geology [mg] pp642-matg-453416 October 21, 2002 16:47 Style file version June 25th, 2002
Kriging with external drift allows one to estimate a target variable, accounting for a densely sam-
pled auxiliary variable. Contrary to cokriging, kriging with external drift does not make explicit the
structural link between target variable and auxiliary variable, for the latter is considered to be de-
terministic. In this paper, we show that kriging with external drift assumes implicitly an absence of
spatial dependence between the auxiliary variable and the residual of the linear regression of target
variable on auxiliary variable at same point. This is the simple model with orthogonal residual, where
cokriging is collocated and coincides with kriging of the residual. In this model, the cross-structure is
proportional to the structure of the auxiliary variable, and the linear regression of target variable on
auxiliary variable does not depend on the support.
INTRODUCTION
Kriging with external drift is a popular geostatistical method (e.g. Goovaerts, 1997;
Wackernagel, 1995). It allows one to estimate a target variable, accounting for a
densely sampled auxiliary variable. A very detailed documentation of kriging with
external drift, including its origin, can be found in Chiles and Delfiner (1999). Here
are the elements required for the present purpose. The target variable is supposed
to be sampled at some locations x (indexed by a Greek letter, e.g., ) and is
represented by a Random Function Z (x). By contrast, the auxiliary variable f (x)
is supposed to be known at all desired points xi (say, all points x and all target
points, e.g. on a grid, as usually in practice), and is considered as deterministic. It
is generally also very regular. We assume a linear relation between the variables,
defining the drift E[Z (x)] externally through f (x):
797
0882-8121/02/1000-0797/1
C 2002 International Association for Mathematical Geology
P1: ZBU
Mathematical Geology [mg] pp642-matg-453416 October 21, 2002 16:47 Style file version June 25th, 2002
798 Rivoirard
P
(or more generally l al f l (x), e.g. f 0 (x) = 1 and f 1 (x) = f (x) in present case):
the shape of the drift is given by f (x), its amplitude by a, and its level is then
finally given by b. Hence we can write
Let
X
Z 0 = Z + 0
where, for instance, f 0 is short for f (x0 ). The estimator is unbiased if we have
X X
E() = E(Z 0 Z 0 ) = a f0 f +b 1 0 = 0
X X X
Z 0 = Z + a f 0 f +b 1 (3)
On the Structural Link Between Variables in Kriging With External Drift 799
Z 0 = a f 0 + b + R0 (4)
with
X
R0 = R
Consequently, if both amplitude a and level b of the drift are known, the best linear
estimator of Z 0 corresponds to the simple kriging of the residual:
Z 0RSK = a f 0 + b + R0SK
If the amplitude a of the drift is supposed known but not its level b, this unknown
coefficient can be filtered out of the expected error by imposing the condition
X
= 1
Then the best linear estimator of Z 0 corresponds to the ordinary kriging of the
residual. In this case the residual needs only be defined up to an additive constant,
e.g. Z (x) a f (x), and we have
X X
Z 0ROK = a f 0 + (Z 0 a f 0 )OK = Z + a f 0 f
Finally if only the shape of the drift is known, both a and b being unknown, these
can be filtered out of the estimation error by imposing the conditions
X X
= 1 and f = f 0 (5)
or more generally
X
f l = f 0l whatever l
when is unknown. Then minimizing the estimation variance yields kriging with
external drift:
X
Z 0EDK = Z
P1: ZBU
Mathematical Geology [mg] pp642-matg-453416 October 21, 2002 16:47 Style file version June 25th, 2002
800 Rivoirard
The weights and the Lagrange parameters l introduced to fulfil the constraints
are the solution of the system made of
X X
cov(R , R ) + l f l = cov(R , R0 ) whatever
l
and of the constraints (5). Thus, kriging with external drift allows one to estimate
Z when the coefficients of the drift are unknown, e.g. varying with a moving neigh-
borhood. One difficulty comes from the estimation requiring the spatial structure
of the residual: as they depend on these coefficients, the exact values of the residual
at data points are not known. A cross-validation is then often used to estimate or
to refine the residual structure. The very subject of the inference of the residual
structure has not been developed in this paper. Note, however, that when using
an erroneous structure, kriging with external drift is no longer optimal but is still
unbiased.
Just like the estimation by kriging the residual, kriging with external drift
apparently only necessitates the residual structure. But a fundamental question
arises. The auxiliary variable being known everywhere, we have chosen to consider
it as deterministic. However, we could also have represented it by a Random
Function, and modeled its structural link with the target Random Function, in the
manner of cokriging for linear estimation. Is considering the auxiliary variable as
deterministic rather than probabilistic sufficient to evacuate a possible structural
link between the variables, when estimating the target by kriging the residual or
kriging with external drift? Or is there an implicit structural link in these methods?
While kriging with external drift is a very popular method, this problem does not
seem to have been worked out. Note that the auxiliary variable was also considered
as probabilistic by Chiles and Delfiner (1999), but in the purpose of accounting
for a lack of information on it.
Having presented kriging of the residual together with kriging with external
drift was intentional as it makes things simpler. As a matter of fact, let us randomize
the auxiliary variable f (x) in (1), i.e. represent it by a Random Function Y (x)
(the minor disadvantage of changing the letter here is compensated by it being
consistent with the usual literature on the subject, e.g. Chiles and Delfiner, 1999):
On the Structural Link Between Variables in Kriging With External Drift 801
Z 0CK = a Y0 + b + R0CK
E[R(x)Y (x + h)] = 0
As a way to make the auxiliary variable deterministic, let us see what happens
if we condition the model (6) by all values f i = f (xi ) taken by Yi = Y (xi ) (which
we will note in short Y = f). In the general case the problem is complex, because
the conditional distribution of R(x) may depend on Y = f, even if R is orthogonal
to Y (for instance, the conditional expectation of the residual can be a nonlinear
function of the values f ). To simplify and to stay in the linear framework, we
will assume here (multi-)normality (i.e. all linear combinations being normally
P1: ZBU
Mathematical Geology [mg] pp642-matg-453416 October 21, 2002 16:47 Style file version June 25th, 2002
802 Rivoirard
To prove (9), write the conditional residual as r (x) f + U (x), with E[U (x)] = 0.
Then the conditional covariance is E[U (x)U (x + h)], and is obtained from
Note in particular the expression of the expectation of Z (x) conditional to all values
Y = f (multiple regression),
which is different from its unconditional expectation and from the simple
regression
On the Structural Link Between Variables in Kriging With External Drift 803
f
E Z 0SCK | Y = f = E Z 0Y | Y = f = z 0
Because of the conditions on the weights, this is not the case of ordinary cokriging
nor of the estimation by ordinary kriging of the residual. However, this is also not
the case either for the estimation by simple kriging of the residual, or by kriging
with external drift. For these, the bias, conditional on Y = f, is respectively
f
E Z 0 Z 0RSK | Y = f = a f 0 + b + r0 a f 0 b
X f
X
rf = r0 rf
and
f
X
E Z 0 Z 0EDK | Y = f = z 0 z f
and these estimators, since they are computed with the unconditional covariance,
are not optimal.
Having seen what happens when R is spatially correlated with Y , we can
finally state that by using a residual with an expectation and a structure independent
of the auxiliary variable, the estimation by kriging the residual or by kriging with
external drift, implicitly assumes an absence of spatial correlation between R
and Y .
Moreover, if we assume that the expected residual is 0 for any realization of
Y (not just f ):
E[R(x) | Y] = 0
804 Rivoirard
DISCUSSION
Markov Models
Z LR (x) = a Y (x) + b
When the residual is spatially uncorrelated with Y (Eq. (7)), Z (x) can be split
into the two orthogonal terms a Y (x) + b and R(x):
orthogonal (10)
On the Structural Link Between Variables in Kriging With External Drift 805
CY Z (h) = a CY (h)
that is
Y Z (h) = a Y (h)
under the intrinsic hypothesis with stationary increments. So in this model, the
cross-structure between Z and Y is proportional to the structure of Y . This corre-
sponds to Markov model 2, in the terminology proposed by Journel (1999). We
also have
so that the structure of Z is the sum of a 2 times the structure of Y plus that of
the residual. In particular it cannot be more regular at the origin than the structure
of Y . Note also that in this model, the linear regression of Z (v) on Y (v) does not
depend on the support v used, since from (11), its slope cov(Z (v), Y (v))/var(Y (v))
is equal to a.
Similarly, there is in general no reason for the residual of the linear regression
of Y (x) on Z (x), say R 0 (x) = Y (x) a 0 Z (x) b0 , to have no spatial correlation
with Z (x). The exception is the model where Y (x) can be split into two orthogonal
terms a 0 Z (x) + b0 and R 0 (x):
orthogonal
806 Rivoirard
Journel (1999)). In this case, the regularity of Y cannot exceed that of Z , and the
linear regression of Y on Z is independent of the support.
In the particular case where all structuresthe structures of Z and Y and
their cross-structuresare proportional (the model being both MM1 and MM2),
R is orthogonal to Y and R 0 to Z . Then both regressions, as well as the correlation
coefficient between the variables, are independent of the support.
Models with orthogonal residual, characterized by a cross-structure pro-
portional to the structure of one variable, have been used in isotropic cases
(Bordessoule, Demange, and Rivoirard, 1989). They had been popularized in the
90s under the name of Markov models (e.g. Almeida and Journel, 1994; Xu and
others, 1992) essentially for the case of an auxiliary variable densely sampled, then
distinguishing the case where the cross-structure is proportional to the structure
of the target variable Z (MM1), or to that of the auxiliary variable (MM2) (Chiles
and Delfiner, 1999; Journel, 1999).
With an auxiliary variable densely sampled, MM1 was long thought to sup-
port strictly collocated cokriging (Almeida and Journel, 1994; Goovaerts, 1997;
Shmaryan and Journel, 1999; Xu and others, 1992), making use of the auxiliary
variable only at target point. In parallel, the method was extended to multicollo-
cated cokriging, where the auxiliary variable was used also at locations where the
target variable is known. It was shown however that cokriging was not collocated
using MM1, but that it was (multi-)collocated when all structures were proportional
(Chiles and Delfiner, 1999). Finally MM2 was shown to be the appropriate model
for collocated cokriging, that is, the model where the use of a (multi-)collocated
neighbourhood does not result in any loss of information for cokriging (Rivoirard,
2001).
The challenging question of a similarly optimal cokriging neighbourhood
for MM1 was then left unsolved. Of course, as mentioned above, cokriging is
collocated in MM1 when it is at the same time MM2, i.e. all structures being
proportional. However, true cokriging in MM1 is in general not collocated, and
can even be dislocated (making use of Y only at locations where Z is not known),
as is shown now.
Consider, in a general heterotopic configuration, MM1 with a pure nugget
residual (as used for instance to represent a variable Z (x) with measurements Y (x)
spoiled by a nugget error R 0 (x)). As it is pure nugget, knowing R 0 at a location x
provides no information to any other point when the means are known. In other
words, at a location x where Z (x) is known, knowing additionally Y (x) provides
no information to any other point. This is screened out in simple cokriging (this
can also be checked on the cokriging system), which then makes use of Y only at
locations where Z is not known.
P1: ZBU
Mathematical Geology [mg] pp642-matg-453416 October 21, 2002 16:47 Style file version June 25th, 2002
On the Structural Link Between Variables in Kriging With External Drift 807
CONCLUSION
Different linear methods are currently used to account for a densely sampled
auxiliary variable:
(simple or ordinary) kriging of the residual from the linear regression of
target variable on auxiliary variable at same point;
kriging with auxiliary variable as external drift;
cokriging.
The first two methods, kriging the residual and kriging with external drift,
only require explicitly the structure of the residual. However, as was shown above
and is also intuitively obvious, they are optimal when this residual is orthogonal to
the auxiliary variable. The target variable is then directly driven by the auxiliary
variable, which has a direct impact on it. This also corresponds to a cross-structure
between variables being proportional to that of the auxiliary variable (Markov
model 2).
On the other hand, cokriging, theoretically the best linear estimator, can ac-
count for many more various bivariate models of coregionalization. However, when
P1: ZBU
Mathematical Geology [mg] pp642-matg-453416 October 21, 2002 16:47 Style file version June 25th, 2002
808 Rivoirard
the auxiliary variable is densely sampled, full cokriging is generally not feasible
and has to be approximated through the choice of the neighbourhood. In particular,
only in the previous model with orthogonal residual does cokriging with collocated
neighbourhood coincide with full cokriging, but then is equivalent to kriging the
residual.
These points can be summarized in the following guidelines, depending on
the model, that is, the hypotheses, that are assumed:
assuming that the target variable is driven by the auxiliary variable (model
with orthogonal residual MM2), kriging of residual or kriging with external
drift should be preferred (simple kriging of residual when the regression is
known; ordinary kriging when only its amplitude is supposed known, not
its level; kriging with external drift when only the shape is known, not the
level nor the amplitude);
assuming another coregionalization model, cokriging should be preferred,
but with the problem of choosing a neighbourhood that approximates cor-
rectly full cokriging.
ACKNOWLEDGMENT
The author thanks the reviewers for their critical and helpful comments, no-
tably in designing guidelines.
REFERENCES
Almeida, A., and Journel, A., 1994, Joint simulation of multiple variables with a Markov-type core-
gionalization model: Math. Geol., v. 26, no. 5, p. 565588.
Bordessoule, J. L., Demange, C., and Rivoirard, J., 1989, Using an orthogonal residual between ore
and metal to estimate in-situ resources, in Armstrong, M., ed., Geostatistics: Kluwer, Dordrecht,
v. 2, p. 923934.
Chiles, J.-P., and Delfiner, P., 1999, Geostatistics: Modeling spatial uncertainty, Wiley, New York,
695 p.
Goovaerts, P., 1997, Geostatistics for natural resources evaluation, Oxford University Press, New York,
483 p.
Journel, A., 1999, Markov models for cross-covariances: Math. Geol., v. 31, no. 8, p. 955964.
Rivoirard, J., 2001, Which models for collocated cokriging?: Math. Geol., v. 33, no. 2, p. 117131.
Shmaryan, L. E., and Journel, A., 1999, Two Markov models and their application: Math. Geol., v. 31,
no. 8, p. 965988.
Wackernagel, H., 1995, Multivariate geostatistics : An introduction with applications, Springer, Berlin,
256 p.
Xu, W., Tran, T. T., Srivastava, R. M., and Journel, A. G., 1992, Integrating seismic data in reservoir
modeling: The collocated cokriging alternative. 67th Annual Technical Conference and exhibition,
SPE paper 24742, p. 833884.