Similarity Between Models

Similarity between Models
and the
Measurement of Model Risk
P. Hnaff
IAE Paris
Universit Paris 1 Panthon-Sorbonne
26 Nov 2012
IAE-Logo
Model Risk
Model Risk Defined
Model Risk
I
Risk related to the use of the wrong model (i.e. a model

that does not account for important risk factors)
Risk related to poor or unstable calibration of the model
Both issues translate into :

I
Inaccurate initial pricing
Inaccurate risk indicators, leading to ineffective hedging

strategy
All of this while the model correctly prices benchmark

instruments.
IAE-Logo
Model Risk
Model Risk Defined
Some References
An inventory of model risks : Derman (1996) Model Risk

Goldman Sachs
Dynamic hedging simulations and empirical assessment of

model risk : Rubinstein (1985), Bakshi, Cao and Chen
(1997), Madan & Eberlein (2007).
Overall, incorporating stochastic volatility and
jumps is important for pricing and internal
consistency. But for hedging, modeling stochastic volatility alone yields the best performance.
IAE-Logo
Model Risk
Model Risk Defined
Some References
N. El Karoui, et al. Robustness of the Black and Scholes

formula. Mathematical Finance (1998).
T.J. Lyons. Uncertain volatility and the risk-free synthesis

of derivatives. Applied Mathematical Finance (1995).
R. Cont. Model uncertainty and its impact on the pricing of

derivative instruments, Mathematical Finance (2006).
PH and C. Martini. Model validation : theory, practice and

perspectives, The Journal of Risk Model Validation (2011).
IAE-Logo
Model Risk
Model Risk Defined
From Observation to Measurement
Basel Committee directive of July 2009 on model risk :

I
Quantify model risk
Provision the risk with Tiers I capital
Needed : A measure of model risk

I
expressed in currency terms
standardized across asset classes and models, to allow for

meaningful comparisons.
IAE-Logo
Model Risk
Problem Statement
A definition of model risk (R. Cont [2006])

I Set of benchmark instruments,
HiI payoffs,
the mid-market prices, C [C bid , C ask ].
CiI
i
i
i
Q Set of n models, consistent with the market prices
of benchmark instruments :
Q Q E Q [Hi ] [Cibid , Ciask ], i I
(1)
Upper and lower price bounds over the family of models :

(X ) = sup E Qj [X ] (X ) =
j=1,...,n
inf
j=1,...,n
E Qj [X ]
Risk measure :
IAE-Logo
Q = (X ) (X )
Model Risk
Problem Statement
Focus of talk
The set of models Q, consistent with the market prices of

benchmark instruments :
(2)
How to normalize Q in order to make meaningful comparisons

between various models and asset classes ?
IAE-Logo
Model Risk
Problem Statement
Focus of talk
(3)
Define a notion of size for Q, in order to normalize the risk

measure.
n
o
Q = Q|E Q [Hi ] [Cibid , Ciask ], i I, | D(Q, P) <
(4)
Where P is the reference model, and D(Q, P) a measure of
similarity between Q and P.
IAE-Logo
Similarity between models

Observe n samples Xi , i = 1, . . . , n from a multivariate
distribution. ci : number of occurrences of Xi .
Definition (Multivariate Likelihood)

Probability of observing an empirical distribution Q : qi =
model P is true.
n! Y ci
L(Q|P) = Q
pi
ci !
ci
n
if
0 L(Q|P) 1
Average log-likelihood :

L(Q|P)
= log L(Q|P)1/n
IAE-Logo
Kullback-Leibler Divergence from Q to P
lim L(Q|P)
=

qi log
qi
pi
= DKL (Q|P)
eDKL (Q|P) : average likelihood of observing distribution Q when
P is the actual process.
IAE-Logo
Illustration : Bivariate Normal Density

Avg. likelihood of P
if Q is true
0 2 4
X
0.8
0.4
Avg. Likelihood
0.0
Bivariate normal densities
0 8 43 8 2
F IGURE: KL Divergence of 2 bivariate normal densities, as a function

IAE-Logo
of angle between PC axis of covariance matrix.
Illustration : Bivariate Normal Density

Avg likelihood of P
if Q is true
0 2 4
X
0.8
0.4
0.0
=0
= 0.8
Avg. Likelihood
Bivariate normal densities
1.0
0.0
1.0
F IGURE: KL Divergence of 2 bivariate normal densities, as a function

IAE-Logo
of correlation.

Similarity of model features
Model Features
Definition (Model Features)
The model features X are the set of observations on the risk
factors that are needed to determine the payoff of a financial
instrument.
Example :
I
European option on a futures at expiry (T ) :

X = {F (T , T )}
Option expiring at T1 , on a spread between two futures

contract expiring at T1 and T2 :
X = {F (T1 , T1 ), F (T1 , T2 )}
IAE-Logo

Why Focus on Model Features ?

For some payoffs, different models have similar features.
dF (t, T )
F (t, T )
= dWt
Var(ln(F (T , T ))) = 2 T
dF (t, T )
F (t, T )
= e(T t) dWt
Var(ln(F (T , T ))) =

2
1 e2T
2
(5)
(6)
(7)
(8)
IAE-Logo
Feature F (T , T ) is log-normal in both cases.

Why Focus on Model Features ?

... but the feature F (t, T ), t < T does not have the same
variance.
0.10
0.05
0.00
Integrated variance
0.15
BS
Onefactor
0.0
0.5
1.0
1.5
2.0
Time
F IGURE: Integrated variance of log(F (t, T ))
IAE-Logo

Computation of KL Divergence
Calculation of KL-Divergence
Kullback-Leibler divergence between two equivalent probability
distributions, with observed features in R d .
I
Discrete density
D(Q, P) =

qi log
i
I
qi
pi
Continuous density

Z
D(Q|P) =
Rd
fQ (x) log
fQ (x)
fP (x)

dx
IAE-Logo

Kernel-based density estimation

Let Xi , i = 1, . . . , n be a sample of random vectors in U R d ,
drawn from a distribution with density function f . The kernel
density estimate of f , noted f is defined by :
n
X
f (X ) = 1
KH (X Xi )
n
i=1
where :
X is a feature vector in R d
H is a symmetric, positive definite matrix of rank d
K is the kernel function : a symmetric multivariate
density :
1
KH (X ) = kHk 2 K (H 2 X )
IAE-Logo

Kernel function
Use a uniform density over a domain which is function of the
local density of sample points.
IAE-Logo
F IGURE: Uniform density over domains of varying size

Uniform kernel density
K (x) =
1kxxi kRx
Rxd cd
with :
Rx radius of uniform density
d dimension of features space
cd volume of unit sphere in R d
Define Rx as the distance to the k -th nearest neighbor of x.
IAE-Logo

KL Divergence
Using the expression for density in terms of k-nn statistics one
gets :

d X
k (U, X )
|U|
+
log
(9)
D(Q, P) = log
|V |
|V |
k (V , X )
X V
With :
d dimension of feature space
U, V sample sdrawn from Q and P
k (U, X ) distance to k -th nearest neighbor of X in sample
U.
See : Boltz, Debreuve and Barlaud. High-dimensional statistical
measure for region-of-interest tracking. IEEE Transactions on IAE-Logo
Image Processing (2009).

Determining the optimal k
Total Calculation time T is proportional to nk

T = C n (1 + k )
Accuracy, measured by the variance of the D(Q|P)

estimator, increases with k .
IAE-Logo

Variance of kNN density estimator

(x) be a kNN density
Assume a sample of size n, and let p
estimator with k neighbors. Let V (k , x) be the volume of the
sphere containing k neighors of x.
(x) =
p
k 1
nV (k , x)
Proposition

(x)) =
VAR (p

(k 1)(n 1)
(x)2
1 p
(k 2)n)
See : D.G. Luenberger and P. Woehrmann. On kNN Density

Estimation. Working Paper Series, FINRISK.
IAE-Logo

Variance of D(Q, P)
D(Q, P) = C +
d X
[log (k (U, X )) log (k (V , X ))] (10)
|V |
X V
We need to estimate (ignoring correlation between k (U, X )

and k (V , X )) :
VAR log (k (U, X ))
IAE-Logo

Variance of D(Q, P)
U (x) =
p
k 1
nk (U, x)cd
IAE-Logo

Variance of D(Q, P)
U (x) =
p
k 1
nk (U, x)cd
Use :
k (U, x) =
k 1
U (x)cd
np
and
Var (f (X )) f 0 (E(X ))2 Var(X )
IAE-Logo

Variance of D(Q, P)
U (x))
Use Luenbergers expression for Var(p
U (x) = 1/n)
The samples points are iid (p
V (x) = 1/n
Approximate also p
to get :

Var(k (U, x)) =

(k 1)(n 1)
1
(k 2)n
IAE-Logo

Variance of D(Q, P)
Finally :
Var(DL(Q, P)) =
2n
|U|

(k 1)(n 1)
1
(k 2)n
IAE-Logo

Optimization of k for a given computation time

Following Luenbergers method : Compute D(Q|P) by
repeating the calculation N times for fixed n and k , with n = k .
1X
D(Q, P)i
D(Q,
P) =
N
i
Total calculation time is :

T = Nn(1 + k )
and

Var D(Q,
P) =
1
N(k 2)
IAE-Logo

Optimization of k for a given computation time

min
k
1
N(k 2)
such that :
Nn(1 + k ) = T
= n
k
which has the solution :
s
k =2+
4+
IAE-Logo

Estimation of T = C(1 + k ). = 0.091
1.1
1.0
0.9
0.8
0.7
0.6
0.5
0.4
10
12
14
16
F IGURE: D(Q, P) computation time as a function of k
IAE-Logo

Empirical validation
0.017
0.016
0.015
0.014
0.013
0.012
0.011
0.010
0.0092
10
12
14
F IGURE: D(Q, P) computation time Variance as a function of k .

Empirical minimum is k = 8 vs. predicted k = 7
IAE-Logo

Summary : Construction of Q
1. Simulate a sample V of feature vectors X for reference
model P.
2. Define a list {Qj } of candidate models. This list may be
built by perturbation of the parameters of P, or by
postulating alternate stochastic processes. There are no
constraints on the method used.
3. Simulate samples Uj of feature vectors X for each model
Qj .
4. Retain models Qj such that :

E Qj (Hi ) C bidi , C aski
i I
IAE-Logo

A Toxicity Index
Toxicity Index at level
When using model P to price payoff X with features F . Define

model risk as :
Q = (X ) (X )
Over normalized set
Q :{Q|eD(Q|P) > , E Q [Hi ] [Cibid , Ciask ]}
0<1
IAE-Logo

A Toxicity Index
Calculation Steps for Toxicity Index
Calculation of model risk associated with the use of model P for

pricing derivative X .
1. Calibrate model P on a set of benchmark instruments.
2. Compute the minimum and maximum value of the exotic
derivative according to each model in set Q .
3. Model risk is finally computed by
Q = (X ) (X )
IAE-Logo
Numerical Illustration
data
Illustration
I
I
Benchmark :SPX options

Three families of models :
1. Double Exponential Jump Diffusion (Kou & Wang 2001)
2. Stochastic Volatility (Heston 1993)
3. Stochastic Volatility with Jumps (Bates 1996)
Model risk of exotic options on SPX
IAE-Logo
data
Data
IAE-Logo
F IGURE: SPX implied volatility by log moneyness and time to expiry
Model Universe
Double Exponential Jump Diffusion [Kou & Wang,

2001]
dSt
=
St

Nt
X
1 2
dt + Wt +
Yi
2
i=1
(11)
fY (y ) = p1 e1 y 1y 0 + (1 p)2 e2 y 1y <0
Nt is a Poisson process with rate .
1
16.15
2
1.85
.206
p
.225
.118
IAE-Logo
Model Universe
Calibration of Double Exponential Jump Diffusion

Model
F IGURE: Bid/Ask implied volatility and fitted volatility.
IAE-Logo
Model Universe

Model
F IGURE: Fit by quartile. red : worst, green : best, yellow : middle
IAE-Logo
Model Universe

Model
F IGURE: Best quartile. Blue dot : best fit
IAE-Logo
Model Universe
Calibration of Hestons Model
dSt
= dt + t dWts
St
dt = ( t )dt + t dWt
.1104
.5045
.3591
-.74
t
.0272
IAE-Logo
Model Universe
F IGURE: Bid/Ask implied volatility and fitted volatility.
IAE-Logo
Model Universe
F IGURE: Fit by quartile. red : worst, green : best, yellow : middle
IAE-Logo
Model Universe
F IGURE: Best quartile. Blue dot : best fit
IAE-Logo
Model Universe
Model Risk : Asian option in Hestons model

25
Asian option 2Yr ATM - Heston
Price range (P=103.40)
20
15
10
5
0
1.00
0.95 0.90 0.85 0.80 0.75 0.70 0.65 0.60 0.55
Average Likelyhood
F IGURE: Model risk as a function of average likelihood of observing

market data when the alternate model is true
IAE-Logo
Model Universe
Model Risk : Asian option in Bates model

Asian option 2Yr ATM - Bates
25
Price range (P=103.40)
20
15
10
5
00.9
0.8
0.7
0.6
0.5
Average Likelyhood
0.4
0.3
0.2

market data when the alternate model is true
IAE-Logo
Model Universe
Model Risk (Heston+Bates)
80
60
40
Model Risk
Asian
Lookback
Range
20
0.0
0.2
0.4
0.6
0.8
1.0
KL Divergence

market data when the alternate model is true (inverted x axis)
IAE-Logo
Conclusion
Conclusion
I
We have described a procedure, inspired from the field of

image processing, for computing the degree of similarity
between models, independently of the definition of these
processes.
We then use this measure to compute a normalized

measure of model risk.
The definition of the set of alternate models Q is crucial to

the measure.
We have presented an implementations that is

computationally intensive but does not put any restrictions
on the composition of Q.
IAE-Logo
Technical Details
Implementation details
model calibration and asset pricing : QuantLib/python

I
I
I
I
QuantLib : www.quantlib.org
Python wrapper : pyql www.github.com
size of Q : 2000 parameter perturbations per model.

calculation of D(Q|P) :
I
I
20,000 scenarios per Qj

KL-Divergence by k -nn algorithm : FNN R package
(www.r-project.org)
IAE-Logo

Similarity Between Models

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Similarity Between Models

Uploaded by

Copyright:

Available Formats

Similarity between Models

Risk related to the use of the wrong model (i.e. a model

Risk related to poor or unstable calibration of the model

Both issues translate into :

Inaccurate initial pricing

Inaccurate risk indicators, leading to ineffective hedging

All of this while the model correctly prices benchmark

An inventory of model risks : Derman (1996) Model Risk

Dynamic hedging simulations and empirical assessment of

N. El Karoui, et al. Robustness of the Black and Scholes

T.J. Lyons. Uncertain volatility and the risk-free synthesis

R. Cont. Model uncertainty and its impact on the pricing of

PH and C. Martini. Model validation : theory, practice and

From Observation to Measurement

Basel Committee directive of July 2009 on model risk :

Quantify model risk

Provision the risk with Tiers I capital

Needed : A measure of model risk

expressed in currency terms

standardized across asset classes and models, to allow for

A definition of model risk (R. Cont [2006])

Upper and lower price bounds over the family of models :

The set of models Q, consistent with the market prices of

How to normalize Q in order to make meaningful comparisons

Q Q E Q [Hi ] [Cibid , Ciask ], i I

Define a notion of size for Q, in order to normalize the risk

Similarity between models

Similarity between models

Definition (Multivariate Likelihood)

Similarity between models

Kullback-Leibler Divergence from Q to P

Similarity between models

Illustration : Bivariate Normal Density

Bivariate normal densities

F IGURE: KL Divergence of 2 bivariate normal densities, as a function

Similarity between models

Illustration : Bivariate Normal Density

Bivariate normal densities

F IGURE: KL Divergence of 2 bivariate normal densities, as a function

Similarity between models

European option on a futures at expiry (T ) :

Option expiring at T1 , on a spread between two futures

Similarity between models

Why Focus on Model Features ?

Feature F (T , T ) is log-normal in both cases.

Similarity between models

Why Focus on Model Features ?

F IGURE: Integrated variance of log(F (t, T ))

Similarity between models

Similarity between models

Kernel-based density estimation

Similarity between models

F IGURE: Uniform density over domains of varying size

Similarity between models

Uniform kernel density

Similarity between models

Similarity between models

Determining the optimal k

Total Calculation time T is proportional to nk

Accuracy, measured by the variance of the D(Q|P)

Similarity between models

Variance of kNN density estimator

See : D.G. Luenberger and P. Woehrmann. On kNN Density