Professional Documents
Culture Documents
a r t i c l e i n f o a b s t r a c t
Article history: NMR relaxation and diffusion data analysis commonly uses a wide range of methods from simple expo-
Received 4 January 2010 nential fitting to Laplace inversions. The pros and cons of these methods are often the subject of intense
Revised 12 February 2010 debate. We show that the ill-conditioned nature of such analysis gives rise to a range of solutions for
Available online 19 February 2010
every method resulting in uncertainty in the spectral solution. Such uncertainty is in fact characteristic
of the inversion method. We show a simple method of sparse spectral representation can be used to
Keywords: improve the statistics of multiple-exponential-based inversion schemes.
Monte Carlo
Ó 2010 Elsevier Inc. All rights reserved.
Laplace inversion
Uncertainty
Relaxation
Diffusion
1. Introduction both data misfit and model complexity, does provide an effective
metric for choosing the best stochastic spectral representation
Analysis of relaxation and diffusion data often uses many differ- for evaluating the uncertainty of particular spectral properties.
ent methods, such as single exponential fitting for samples of sin- In addition, we demonstrate a Monte Carlo algorithm for La-
gle molecular species, fitting with a few exponentials (e.g., two or place inversion that uses a small number of variable-width boxcar
three), stretched exponential fitting, and Laplace inversion [1–5]. functions instead of the small number of delta functions used in
Methods other than single exponential fitting are used to analyze multi-exponential fitting. Note that in the limit of a large number
the multi-exponential decays resulting from heterogeneity in the of delta functions assigned to fixed spectral locations, the multi-
underlying sample. Laplace inversion can be used to analyze arbi- exponential fit can be used to approximate the continuous spectral
trarily complex decay spectra. However, there is often a debate in range used in traditional Laplace inversion. We show the advan-
the literature on which criteria to use in choosing the appropriate tage of using a sparse boxcar representation, when applied to spec-
method for a particular application, and the benefit of using more tral uncertainty analysis, compared to other methods. Such an
complex spectral models. approach may also have advantages when performing uncertainty
The origin of this ambiguity is the ill-conditioned nature of mul- analysis on higher-dimensional NMR data because it requires few-
ti-exponential inversion [6]. Given a data set with finite noise, the er spectral parameters than either the multi-exponential or the La-
solution spectrum is not unique and many spectra fit the experi- place inversion approaches.
mental decay data within the statistics of the noise. As a result, Once an ensemble of solution spectra have been generated,
we should expect the data to be fitted by many different spectra, questions about the shape of the spectrum can be posed in a prob-
both with the same functional form for the spectra and with differ- abilistic form that allow the uncertainty of the answers to be quan-
ent ones. In fact, the true solution is the entire ensemble of the tified. In Section 4 we examine the uncertainty in functionals of the
solution spectra as demonstrated in our recent work [7]. In the spectrum that relate to the total porosity and the bound-fluid
present work, we apply a concept that is similar to these other porosity in a rock sample.
analysis methods, such as single- and multi-exponential fitting,
however an ensemble of solutions is obtained for each method. 2. Theory
For a given data set, many different fitting methods can yield solu-
tion spectra that are consistent with the data, meaning that data 2.1. General
misfit is not an adequate differentiator for choosing a method.
We show that the Akaike information criterion [8], which weighs NMR signals of T 2 decay in porous materials are well modeled
as a sum of decaying exponentials,
Z 1
* Corresponding author. mðtÞ ¼ f ðTÞ expðt=TÞd logðTÞ; ð1Þ
E-mail address: prange@slb.com (M. Prange). 0
1090-7807/$ - see front matter Ó 2010 Elsevier Inc. All rights reserved.
doi:10.1016/j.jmr.2010.02.010
M. Prange, Y.-Q. Song / Journal of Magnetic Resonance 204 (2010) 118–123 119
where mðtÞ is the signal as a function of time, f ðTÞ is the spectrum as tions the boxcar functions so that Eq. (7) can be analytically eval-
a function of relaxation time T satisfying f ðTÞ P 0. Using a more uated. Boxcar functions are constant within a given range of T
compact operator notation, and zero otherwise. Our boxcar functions are defined by
(
mðtÞ ¼ GðtÞf ; ð2Þ 1
; T Lo < T 6 T Hi
B T; T Lo ; T Hi ¼ log T Hi log T Lo ð8Þ
where GðtÞ is the kernel operator. Our goal is to describe the uncer- 0; otherwise;
tainty in f resulting from the inversion of a noisy measurement of m.
We start with noisy samples of m measured at discrete times where the normalization of Eq. (5) has been applied. Note that this
and put in the measurement vector m ¼ fmðti Þ; i ¼ 1; . . . ; Mg. Par- definition allow the basis functions to overlap, allowing, for exam-
ker and Song [9] have demonstrated that the noise in such a mea- ple, two overlapping basis functions to form a three-stepped func-
surement can often be modeled as an independent identically tion akin to a winners’ podium at a sports arena. In a limiting
distributed (IID) normal random process. In this case, with some case, this basis function can approach zero width, resulting in a
simplifying assumptions described below, the uncertainty of f is gi- properly-normalized delta function. Substituting this basis function
ven by the probability density function [7] into Eq. (7) yields
1 2 C0 t
C0 t
pðf Þ / exp km GðtÞf k ; f ðTÞ P 0; ð3Þ T Hi T Lo
2r2 /i t; T Lo Hi
i
i
;
i ; Ti ¼ ð9Þ
where r2 is the independently determined noise variance, t is the log T Hi
i log T Lo
i
m (arb nit)
prone to overfitting. The reason of this is that the low probability of
basis functions of both small width and small T 2 values makes it
f (T) (Arbitrary unit)
m (Arbitrary Unit)
0
0
c Delta Function N=4 d
f (T) (Arbitrary Unit)
m (Arbitrary Unit)
0
0
0 1 -2
10-3 10-2
10 -1
10 10 10-3 10 10 -1 100
T (s) T (s)
Fig. 2. The spectral uncertainty for the synthetic data shown in Fig. 1 is illustrated here for delta-function bases with (a) N ¼ 3 and (c) N ¼ 4. The spectral values are
superimposed as gray bars for 20 of the 104 sampled solutions. The true spectrum is overlain as a dashed curve. The statistics of the time-domain predictions associated with
the 104 spectral samples are shown in (b) for N ¼ 3 and (d) for N ¼ 4, with the median curve shown in bold, and the four dashed curves showing minimum values, the 10%
and 90% quantiles, and the maximum values. Only the minimum and maximum curves are readily apparent in (b), as the tightly-bounded quantile curves are hidden by the
thickness of the median curve.
M. Prange, Y.-Q. Song / Journal of Magnetic Resonance 204 (2010) 118–123 121
a b
m (Arbitrary Unit)
0
0
c d
f (T) (Arbitrary Unit)
m (Arbitrary Unit)
0
0
10-3 -2
10 10-1
10
0
10
1
10
-3
10
-2
10
-1 0
10
T (s) T (s)
Fig. 3. The spectral uncertainty for the synthetic data shown in Fig. 1 is illustrated here for boxcar-function bases with (a) N ¼ 2 and (c) N ¼ 3. The spectral values are
superimposed as gray curves for 20 of the 104 sampled solutions. The true spectrum is overlain as a dashed curve. The statistics of the time-domain predictions of these
spectral samples are shown in (b) for N ¼ 2 and (d) for N ¼ 3, with the median curve shown in bold, and the four dashed curves showing minimum values, the 10% and 90%
quantiles, and the maximum values. Only the minimum and maximum curves are readily apparent, as the tightly-bounded quantile curves are mostly hidden by the
thickness of the median curve.
ferred over the boxcar-basis. A more quantitative comparison can those made by an alternative algorithm presented in [7]. We
be made by translating DAIC into an odds ratio, which is the prob- choose to examine the uncertainty in T 2 spectra by examining
ability of one model over another [14]. This odds ratio is given by the impact of this uncertainty on functionals of the spectra, as
expðDAICÞ. Using an odds ratio, the best Delta function model was done in [7]. In this comparison we use the functionals for total
(N ¼ 3) is preferred to the best boxcar function model (N ¼ 2) by porosity (qT ) and bound porosity (qB ) defined by
about 2.3 to 1. This is a weak preference [14]. By contrast, the best Z 1
model is preferred over the next best models with k < 6 by more qT ¼ f ðTÞd logðTÞ and
than 108 to 1. This is decisive evidence that at least six parameters Z
0
ð12Þ
Tc
are needed to model the noisy time-domain measurements in this qB ¼ f ðTÞd logðTÞ;
example. As we will see later, the choice of delta-function versus 0
boxcar-function model depends also on how the spectrum is being
where we use T c ¼ 0:033 s, a typical value used for sandstone [15].
used, with smooth spectral models being preferred in some
Histograms of qT and qB were generated from Monte Carlo samples
applications.
of f ðTÞ.
One might suppose that spectral uncertainty would be better
4. Uncertainty comparisons using functionals examined by directly displaying the uncertainty in f, for example
via a plot of the mean spectrum along with 95% confidence limits.
Our objective here is to compute the uncertainty in one-dimen- Unfortunately such displays are misleading because they ignore
sional T 2 spectra and to compare these uncertainty estimates with the strong correlations that exist between parameters [9,7]. For
example, an increase in the spectrum for one value of T can be
compensated for by decreasing the spectrum at a neighboring va-
lue of T. This results in overly large confidence limits since such a
1000
plot only shows the marginal uncertainty in the spectrum for any
given value of T. We sidestep this complexity through the use of
functionals — functionals map the spectrum into a single random
100
ΔAIC+1
Corrected Fig. 6 for both delta- and boxcar-function bases. Since qB focuses
ρ
0.100 on the lower T 2 part of the spectrum, the effect of a positive bias
in that zone will have a proportionally larger impact than with
0.098 qT . The true value in this case is qB 0:014. As expected, for
k < 6 both bases provide poor estimates of the uncertainty. For
0.096 k P 6, the boxcar-function basis yields good estimates, while the
delta-function basis once again has a positive bias. Thus we have
2 5 10 20 50 100
k shown that in this example the delta-function basis provides a
poor estimator of qB even when the optimum k is chosen using
Fig. 5. Estimates of qT , along with their uncertainty, are plotted versus the number the AIC.
of parameters (k) as mean curves and the mean ± one standard deviation confidence A key difference between the delta-function and boxcar-basis
limits for delta-function bases (solid line and circles) and boxcar-function bases
estimators is highlighted in Fig. 7, where the two estimators are
(dashed lines and open circles). The true value of 0.1 is indicated by a dotted line.
The ‘‘uncorrected” and ‘‘corrected” values (open squares), taken from [7], are compared for k ¼ 6. The histogram for the boxcar-basis estimator
computed for the same synthetic data using 100 delta functions at fixed spectral is unimodal and is peaked near the true value. In contrast, the esti-
locations (k ¼ 100), where the correction is a post-processing step designed to mator using the delta-function basis is bimodal, with low probabil-
remove the bias.
ity at the true value. The reason for this bimodality is apparent in
Fig. 2a. When the leftmost of the three spikes is to the right of the
cutoff time, T c ¼ 0:033 s, the value of qB is zero, while it is non-zero
the trade-off between goodness of fit and variance is the founda- otherwise. As this spike dances around from sample to sample, an
tion of how statisticians frame the principle of parsimony. Parsi-
mony seeks to find the model with ‘‘the smallest number of
parameters for adequate representation of the data” [13, p. 31].
We use the AIC to manage this trade-off.
0.025
The AIC identifies k ¼ 6 as the optimum model complexity for
both the delta-function and boxcar-function bases. The k ¼ 6
0.020
uncertainty estimates for both basis types result in reasonable esti-
mates of qT , meaning that the true value is a probable sample from
0.015
ρB
artificial transition is created that is simply due to the infinitesimal boxcar-basis function with variable width does not exhibit such
width of the spike. This problem does not occur in the estimation deficiencies and possesses no positive bias. Moreover, the boxcar
of qT because there is no cutoff time in that case. It was this obser- basis provides a sparse representation of higher-dimensional spec-
vation led us to postulate that a more robust estimator might be tra, making it a good candidate for use in an efficient algorithm for
created by allowing the spikes to have width (the boxcar-basis), two- and higher-dimensional Laplace inversion.
thereby minimizing the effect of this transition.
Appendix A. Directed Gibbs’ sampler
5. Discussion
A standard Gibbs’ sampler, such as Algorithm A.39 in [11], eval-
The examples in Figs. 2 and 3 show clearly that models with uates the pðf Þ function (Eq. 10) along each parameter in k in turn,
dramatically different functional forms (and shapes of the relaxa- forming a probability distribution along that coordinate direction.
tion spectra) can all fit the data within the noise characteristics. The parameter value is then updated with a value sampled from
All solutions within each functional form comprise a portion of that distribution. Once all the parameters in k have been updated,
the entire ensemble of spectral solutions that satisfy the data. Since the resulting value of k is the new sample drawn from pðf Þ.
the spectral shapes are substantially different, we expect the solu- This standard approach becomes inefficient when the distribu-
tions of different functional forms occupy somewhat different re- tion is highly correlated, as it is in the spectral inversion problem.
gions of the solution space. Within each functional form, when Imaging a two-dimension problem in which the uncertainty ellipse
the number of degrees of freedom increases, there is more spectral is long, narrow and tilted at an angle to the coordinate axes. Since
variability and the size of the solution region increases. Thus it is the standard Gibbs’ sampler samples along the coordinate axes, the
natural to observe an increase in uncertainty with increasing k. narrow width of the uncertainty ellipse forces the distributions
When k is much less than the optimal value by the AIC criterion, along each coordinate axis to be commensurately narrow, thereby
the fit is generally poor and the significantly reduced uncertainty preventing long jumps along the long axis of the ellipse. This
does not reflect the uncertainty of the data. When k is larger than means that traversing the long axis of the ellipse will require many
optimal, the uncertainty can vary significantly for different bases steps of the sampler, i.e., the samples are highly correlated, making
and k. This is a direct consequence of the different solution regions an inefficient sampler.
obtained through different basis functions and methods. In fact, This problem is avoided by choosing the Gibbs’ sampling direc-
the choice of specific functional form can be a very strong regulator tions to align with the axes of the uncertainty ellipse. This makes
for reducing model variation and uncertainty provided that the large jumps possible. The inverse covariance matrix is periodically
assumption of the functional form is properly justified. Often, such estimated by computing the Hessian of log pðf Þ, and the Gibbs’
justification comes from understanding the physical, chemical and sampling directions are set to the eigenvectors of this matrix. Since
symmetry properties of the material and the experimental the number of k parameters is small and the Hessian need not be
techniques. frequently updated, the computation expense of this operation is
The boxcar-basis is an interesting example of the effect of basis insignificant.
function choice. Since there is no limit on the width of the boxes
and the Monte Carlo procedure is free to choose very narrow box References
widths (reducing to delta functions in the limit of zero box width),
the phase space for the boxcar-basis expansion contains and is [1] W.J. Wiscombe, J.W. Evans, Exponential-sum fitting of radiative transmission
functions, Journal of Computational Physics 24 (4) (1977) 416–444.
much larger than that for the delta-function basis. Thus it is rare [2] J.W. Evans, W.B. Gragg, R.J. LeVeque, On least squares exponential sum
to see a very narrow box in the spectral solution. This preference approximatin with positive coefficients, Mathematics of Computation 34 (149)
is likely the cause of the improved uncertainty estimates compared (1980) 203–211.
[3] K. Tittelbach-Helmrich, An integration method for the analysis of
to those using the delta-function basis. The concept of using more multiexponential transient signals, Measurement Science and Technology 4
restrictive basis functions is similar in spirit to the use of a regular- (1993) 1323–1329.
ization method (e.g., Ref. [5]). [4] Å. Ukkelberg, G.H. Sørlandb, E.W. Hansen, H.C. Widerøe, ANAHESS, a new
second order inverse Laplace transform algorithm, in: Presentation at the
Although we demonstrate our approach with one-dimensional Ninth International Bologna Conference on Magnetic Resonance in Porous
data, all aspects of the approach generalize straightforwardly to Media (MRPM9), Cambridge MA, USA, 2008.
higher dimensions. One of the motivations of this work was to [5] Y.-Q. Song, L. Venkataramanan, M.D. Hurlimann, M. Flaum, P. Frulla, C. Straley,
t 1 t2 correlation spectra obtained using a fast two-dimensional laplace
speed up the Monte Carlo method for two- and higher-dimensional
inversion, Journal of Magnetic Resonance 154 (2002) 261–268.
data by reducing the number of free parameters by using sparse- [6] C.L. Epstein, J. Schotland, The bad truth about Laplace’s transform, SIAM
basis representations. Although the boxcar-function is shown to Review 50 (3) (2008) 504–520.
[7] M. Prange, Y.-Q. Song, Quantifying uncertainty in NMR T2 spectra using Monte
be efficient, it is possible that other functions may also be used
Carlo inversion, Journal of Magnetic Resonance 196 (2009) 54–60.
to achieve similar results. [8] H. Akaike, A new look at the statistical model identification, IEEE Transactions
on Automatic Control 19 (6) (1974) 716–723.
[9] R.L. Parker, Y.-Q. Song, Assigning uncertainties in the inversion of NMR
6. Conclusions relaxation data, Journal of Magnetic Resonance 174 (2005) 314–324.
[10] D.S. Sivia, Data Analysis: A Bayesian Tutorial, second ed., Oxford University
This paper discusses the uncertainty of Laplace inversion results Press, Oxford, UK, 2007.
[11] C.P. Robert, G. Casella, Monte Carlo Statistical Methods, Springer, New York,
obtained from several different methods. Although all of these 2004.
methods fit the data well, their uncertainty behavior is markedly [12] C.L. Lawson, R.J. Hanson, Solving Least Squares Problems, Prentice-Hall,
different. The choice of different basis functions can be considered Englewood Cliffs, NJ, 1974.
[13] K.P. Burnham, D.R. Anderson, Model Selection and Multimodel Inference,
as applying a prior knowledge of the system, since they restrict the
second ed., Springer, New York, 2002.
range of spectral solutions as in a regularization approach. The [14] A.R. Liddle, Information criteria for astrophysical model selection, Monthly
commonly-used delta-function basis often possesses a positive Notices of the Royal Astronomical Society: Letters 377 (1) (2007) L74–L78.
bias and a large variance in the estimate of qT and qB . This positive [15] R.L. Kleinberg, Well logging, Encyclopedia of Nuclear Magnetic Resonance, vol.
8, Wiley, New York, 1996, pp. 4960–4969.
bias issue was also demonstrated in [7], where the spectrum is rep-
resented by a large number of spectral bins. On the other hand, the