Professional Documents
Culture Documents
ANDREAS KRAUSE Many persons might have attained to been proven very popular with financial insti-
is a lecturer in finance wisdom had they not assumed that they tutions and regulators alike—the Bank for
at the University of Bath
already possessed it. International Settlements and the SEC,
School of Management
in the UK. — Seneca among others.
mnsak@bath.ac.uk Originally VaR was intended to mea-
n the early 1990s we witnessed a number sure the risks in derivatives markets, but it
Ever since Markowitz [1952], it has become where ∆T denotes the time horizon of the possible losses.
common to measure the risk of an investment by the stan- In most cases it is more convenient to express the VaR in
dard deviation of outcomes. However, the usual percep- terms of returns. Define R* and µ such that
tion of risk is that of a possible loss relative to a benchmark.
Any risk measure therefore should concentrate on these
losses rather than the variability of outcomes which also V ∗ = (1 + R∗ )V0
includes profits, usually not regarded as risk. Using the E[V ] = (1 + µ)V0 (3)
standard deviation as a risk measure therefore does not
address the way risk is usually seen.
It would be more appropriate to concentrate on the We then can rewrite Equation (2) as
lower tail of the distribution of outcomes where the losses
are located. As being concerned with the size of these Zero
V aRc,∆T = −V0 R∗
possible losses, a useful risk measure would be one which
M ean
reflects the amount that can reasonably be lost. We there- V aRc,∆T = −V0 (R∗ − µ) (4)
fore use the expected worst loss that occurs with a cer-
tain probability as our risk measure. For example, we
could then report the risk by stating that with a proba- Exhibit 1 illustrates this definition of the VaR. In
bility of 99% (the confidence level) the losses do not order to apply VaR in risk management we now have to
exceed US$13m. Or, in other words, the losses exceed estimate the relevant parameters. While the determina-
US$13m with a probability of only 1%. Obviously we tion of the current value of the investment, V0, and the
have to specify the time horizon in which these losses expected return, µ, usually do not impose difficulties, it
may occur. At which confidence level the risk is reported, is much more complicated to estimate R* properly. There
depends on the risk aversion of the individuals involved, are generally two methods to estimate R*, the non-para-
besides regulatory requirements. metric method and the parametric method.
The above idea is the basis for the definition of The non-parametric method uses historical data and
value at risk (VaR). Let V be the value of an investment infers from them directly the (1 – c)-quantile of returns,
at the end of the time horizon. We then define a V * which gives R*. This estimate of R* is then used to cal-
such that culate the VaR according to Equation (4).
V∗ The alternative parametric method assumes that
P rob(V ≤ V ∗ ) = dF (V ) = 1 − c (1) returns follow a certain distribution. We then use past data
−∞
4 on returns to estimate the relevant parameters of the dis-
tribution. These parameter estimates are then used to cal-
where c denotes the confidence level and F(V ) the cumu- culate, in most cases analytically, the (1 – c)-quantile of this
lative distribution function of V. Hence the value of the distribution, which as before gives an estimate of R* that
investment is below V * with a probability of 1 – c. Based can be used to determine the VaR from Equation (4).
on this equation we can now define the VaR. We only The most widely used distribution for this purpose
have to define a benchmark such that any outcomes below is the normal distribution, although other distributions
this benchmark are regarded as losses. It is common to like the t-distribution, generalized error distribution, or
define losses either relative to the status quo, i.e., the cur- the Lévy-distribution have been used.
20 EXPLORING THE LIMITATIONS OF VALUE AT RISK: HOW GOOD IS IT IN PRACTICE? WINTER 2003
EXHIBIT 1 EXHIBIT 2
Definition of Value at Risk Determination of the VaR with
Normally Distributed Returns
V* E[V]
0.16
R* µ
Zero 0.16
VaR
ασ
0.12
VaR Mean 0.12
Density
Density
0.08
0.08
0.04
Prob (V < V*) = 1 – c 0.04 Prob (R < R*) = 1 - c
0.00
0.00
-3 -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 2.5 3
-3 -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 2.5 3
Profits US$m
Return in %
70
R∗ = µ − ασ (6)
60
50
20
Zero
V aRc,∆T = V0 (ασ − µ) 10
M ean 0
V aRc,∆T = V0 ασ (7) -3.79 R1* -2.44 R2*
-5.14 -1.09 0.26 1.60 2.95 More
Return in %
Exhibit 2 illustrates the determination of the VaR
under the assumption of normally distributed returns.
As we can see from Exhibit 3, the results of the two
methods do not have to be identical. This is due to the We see from Equation (7) that the standard devia-
fact that in reality the distribution of returns is not tion plays a crucial role in the determination of the VaR.
exactly normal, but sometimes substantial deviations By using historical data, the implicit assumption made is
are observed, especially in the tails that are evaluated that the distribution does not change over time. Empirical
for the VaR. The example compares the distribution of evidence clearly shows that this assumption cannot be
daily Nasdaq Composite Index returns from March 1999 upheld as the variance changes over time (heteroscedas-
to March 2001 with and without the assumption of a ticity). In many applications it is now common to model
normal distribution. the variance of returns as a stochastic process, usually of the
Density
days for applications in financial markets as required by
banking regulation). Hence we have to transform these
annual returns into returns for the required time horizon.
The simplest way to do this is to assume that returns are
identically and independently distributed in each time
period. With this assumption we can easily transform any
reference expected return, µ*, and standard deviation, σ*, -6 -5.5 -5 -4.5 -4 -3.5 -3 -2.5 -2 -1.5
into the required time horizon used to determine the VaR: Profits US$m
µ = µ∗ ∆T
√ distributions are shown in Exhibit 4. The first investment
σ = σ ∗ ∆T (8)
has a higher probability of losses around US$4m, while
the second investment has this higher probability close to
where ∆T is measured relative to the time length of the US$5m. Above the 5%-quantile the two distributions are
reference period; e.g., for µ* and σ* given annually, a Zero
identical, hence the 95% VaR, VaR 0.95,∆T , is the same in
time horizon of one month corresponds to ∆T = 1⁄12. both cases. As the losses beyond the VaR are larger in the
We will return to several problems associated with second case, we would reasonably conclude that this is the
estimating R* later, but will first explore whether VaR is riskier investment. The VaR, however, suggests that both
such a good risk measure as claimed above. investments have identical risk. Clearly VaR is not repre-
senting the risk of the two investments properly.
THE QUALITIES OF VAR AS A RISK MEASURE As a second example, suppose that you have 100
assets whose outcomes are independent of each other and
There is always a well-known solution to every the payoff profile at the end of the time horizon is iden-
human problem—neat, plausible, and wrong. tical for every i = 1, . . . , 100:
H.L. Mencken
Artzner et al. [1999] define desired properties of risk 5 with probability 0.99
Vi =
measures and call those fulfilling these properties coherent. −100 with probability 0.01 (9)
They ensure that investments which are more risky are
Now compare the following two portfolios, the first
assigned a higher value of the risk measure.
consisting of 100 units of asset k, 1 ≤ k ≤ 100, and the second
As long as returns of an investment have an ellipti-
portfolio consisting of one unit of each of the 100 assets.
cally shaped distribution, e.g., are normally distributed,
The diversification of risks in the second portfolio has clearly
it can be shown that VaR is always a coherent risk mea-
reduced the risk as a complete loss of the investment is highly
sure. However, in general VaR does not have to be Zero
unlikely. When calculating the 95% VaR, VaR0.95,∆T , we can
coherent as the following two examples show.
easily derive that for the first portfolio it is –500 and for the
The VaR is defined by a cutoff return, R*, beyond
second, diversified portfolio it is 200. Therefore, according
which the distribution of returns is not further evaluated.
to the VaR measure, the second portfolio should be riskier.
Hence any losses larger then R* are not weighted according
This clearly contradicts our previous argumentation. There-
to their size and are thus treated alike. Suppose you are
fore the risk measure is also in this case not coherent.
making one of the investments whose left tails of the return
22 EXPLORING THE LIMITATIONS OF VALUE AT RISK: HOW GOOD IS IT IN PRACTICE? WINTER 2003
In both of these examples the actual risk and the option prices for this purpose. The predicted and the real-
results from the risk measure are not consistent with each ized distribution of outcomes in many cases show large
other, and hence VaR is not a coherent risk measure. It differences, despite these forecasting efforts. Exhibit 5 shows
can be shown that the reason for this finding is that unless how much the distribution of outcomes, and hence the
specific conditions on the distribution of outcomes are VaR, may differ between subsequent years. We used the
met, VaR is not subadditive, i.e., VaR(V1 +V2) ≤ VaR(V1) daily returns of the Nasdaq Composite Index from March
+ VaR(V2) is generally not valid. 1999 to March 2000 and from March 2000 to March 2001,
In many cases the conditions for the validity of sub- respectively. Usually the distribution is estimated with only
additivity are (at least approximately) fulfilled and VaR is relatively few and recent data, e.g., from the past year, sup-
(nearly) coherent. But remarkably, the payoff distribu- posing that the change of the distribution is smaller the
tions of options and similar derivatives mostly violate these closer the data are to the present time. But as this example
conditions. And it was the use of such instruments that shows, even then large deviations may be observed.
caused many of the large losses in the early 1990s and led Even if the problem of forecasting the future distri-
to the development of VaR. Another example of payoffs bution of outcomes can be overcome with a reliable fore-
that violate the conditions for VaR being coherent are
catastrophic risks. Here the large losses that occur with a
very small probability, with the payoffs otherwise having EXHIBIT 5
the desired properties, cause this violation. Example of the Changes in the Distribution in
Subsequent Years for the Nasdaq Composite Index
We may summarize this section by stating that VaR
March 1999-March 2001
is not a coherent risk measure, and thus the VaR may in
certain situations not reflect the risk adequately. We show 5000.00
4000.00
ESTIMATING THE VAR
First get the facts, then you can distort ’em as much 3500.00
as you want.
— Mark Twain 3000.00
VaR US$m
icantly, increasing losses further by not allowing for the
2.5
diversification effect to work as anticipated. To overcome
this shortcoming, VaR should be complemented by stress 2
testing to enable risk managers to include such events into
their overall risk assessment. 1.5
1
Estimation Error 0 0.02 0.04 0.06 0.08 0.1
1–c
Unfortunately, even with this issue resolved, there
remains a problem with the estimation of the VaR itself. As
the true probability distribution is not known in general, EXHIBIT 7
it has to be estimated from the data. A good estimation of Standard Errors in Parametric and
the lower tail of the distribution is of essential importance Non-Parametric Estimation
for the VaR estimation. The complication arises from the 0.6
fact that by definition only few observations are made at the
Quantile estimation
tails, and hence the estimation errors are large. 0.5 Normal distribution
Kendall [1994] shows that the standard error of the
c-quantile q is given by
Standard Error
0.4
0.3
c(1 − c)
SE(q) = (10)
T f (q) 0.2
24 EXPLORING THE LIMITATIONS OF VALUE AT RISK: HOW GOOD IS IT IN PRACTICE? WINTER 2003
A further assessment of the error can be derived by EXHIBIT 8
comparing the estimated VaR with the maximal VaR that Estimated and Maximal VaR
is consistent with the data. This relationship can be derived
using the theorem of Bienaymé-Chebyshev, which states 25
that for each random variable x with a finite variance σ2 Estimated VaR
Maximum VaR
and expected value µ it is 20
VaR US$m
15
2
σ
P rob(|x − µ| ≥ ε) ≤ (12) 10
ε2
Assuming a symmetric distribution we get 5
0
1 σ2 0 0.02 0.04 0.06 0.08 0.1
P rob(x − µ ≤ ε) ≤ (13) 1–c
2 ε2
Companies are usually exposed to multiple risks, and Any statistical relationship will break down when
the correlations between those risks have a crucial impor- used for policy purposes.
tance for the VaR estimate as they determine the strength — Charles Goodhard
of the diversification effect which reduces the total risk. Exploiting the Estimation Bias
Like the VaR estimate, the estimate for the correlation
matrix is usually based on a relatively small number of past When using VaR to restrict the risk taking within a
data, typically one year of daily data or about 250 data company or as the basis for performance-related pay, the
easily do so by exploiting the bias in the VaR estimate as 0.8 Distribution after manipulation
explained in the previous section. They choose a portfolio
Cumulative Probability
0.7
whose VaR estimate is lower than the actual risk taken. As
0.6
the expected return is determined by the “true” risk, the
0.5
trader will, at least on average, generate a large profit with
a relatively low VaR, compensating him for the true risk. 0.4
26 EXPLORING THE LIMITATIONS OF VALUE AT RISK: HOW GOOD IS IT IN PRACTICE? WINTER 2003
market crash at the NYSE, where portfolio insurance activ- agement system can replace the sound judgment of man-
ities, designed to prevent large losses, actually aggravated agers, and those using it should be aware of its limits.
them. Hence a regulation of risk via VaR, and probably any The benefits of the simplicity of VaR cannot be
other strict mechanism, can under certain circumstances actu- underestimated. A much more precise and improved
ally increase the systemic risk that it was intended to reduce. method that is not understood by decision makers is of
much less value than an easily understood method, even
THE BENEFITS OF USING VAR if it gives only a rough estimate of the risks involved, pro-
vided these limits are understood. Results based on sys-
You don’t want the decimal point—you want the tems that are not understood are either ignored or used
order of magnitude. without the necessary precautions. In both cases deci-
— Lev Borodovsky sions are likely to be inferior.
28 EXPLORING THE LIMITATIONS OF VALUE AT RISK: HOW GOOD IS IT IN PRACTICE? WINTER 2003