Professional Documents
Culture Documents
A time series spot price forecast model for the Nord Pool market
Tarjei Kristiansen
9000 lborg, Denmark
a r t i c l e
i n f o
Article history:
Received 11 January 2013
Received in revised form 16 February 2014
Accepted 7 March 2014
Available online 2 April 2014
Keywords:
Nord pool
Spot price forecasting
Regression model
Electricity market
a b s t r a c t
We present three relatively simple spot price forecast models for the Nord Pool market based on historic
spot and futures prices including data for inow and reservoir levels. The models achieve a relatively
accurate forecast of the weekly spot prices. The composite regression model achieves a mean absolute
percentage error (MAPE) of around 7.5% and under-forecasts the actual spot price by some 1.4 NOK/
MW h in the sample period. Out of sample testing achieves a MAPE of around 7.4% including a match
of the actual spot price. A myopic model using the previous weeks spot price as a predictor for the next
weeks spot price achieves a MAPE of 7.5% and under-forecasts the actual spot price by some 0.9 EUR/
MW h. A futures model using the futures price for next week as a predictor for next weeks spot price
achieves a MAPE of 5.3% and over-forecast the actual spot price by some 4.3 EUR/MW h.
2014 Elsevier Ltd. All rights reserved.
Introduction
Short-term price forecasting presents a crucial activity for the
protability of generation companies, end-user companies and
trading companies. It enables them to optimize their trading strategies. The planning horizon for generation and consumer companies typically spans weeks and/or months. Thus there is a need
for models that can forecast the average weekly spot prices. Time
series models represent an important category of models [1]. These
models typically utilize some external variables related to supply
and/or demand to explain pricing behavior. Other approaches include multiple support vector machine [2], extreme learning machine [3], auto-regressive fractionally integrated moving average
[4] and Grey models [5].
The most commonly applied model for medium term spot price
forecasting in the Nord Pool market is the EMPS model (or Samkjringsmodellen). It is a fundamental stochastic dynamic programming model for hydro-thermal scheduling. The results from
the model are proprietary and differ for each market player making
it difcult to facilitate a comparison. Likewise the power market
analytics service provider Point Carbon [6] utilizes a stochastic
dual dynamic programming (SDDP) model which they claim yields
improved accuracy compared to the EMPS model. However they
provide no numerical results to back up their statements. In sum,
these models may appear as black box approaches to outsiders
which do not have access to the inputs. Therefore they are unable
to adjust these if necessary.
Tel.: +45 201144160.
E-mail address: tarjeikr@yahoo.com
http://dx.doi.org/10.1016/j.ijepes.2014.03.007
0142-0615/ 2014 Elsevier Ltd. All rights reserved.
In this paper we develop three relatively simple spot price models that utilize previous weeks spot prices, futures prices including
data for inow and reservoir levels. The models provide relative
high precision spot price forecasts for the Nord Pool market.
The structure of the paper is as follows. Section Literature
review provides a literature review. Section The Nordic power
market describes the Nordic power market and its fundamental
parameters. Section Spot forecasting models develops three spot
price forecasting models. Section Model performance measures
the model performances. Section Conclusion concludes.
Literature review
Weron [7] analyzes forecasting of day-ahead electricity prices
but nds that only some approaches are suitable. Time series models are among the models with strongest forecasting ability. Weron
[7] also identies that single hour model specications achieve
better forecasting accuracy than multi-hour model specications.
Time series models include:
Auto-regressive (AR) models.
Auto-regressive moving average (ARMA).
Auto-regressive integrated moving average (ARIMA) and seasonal
ARIMA models [8,9].
Auto regressions with heteroskedastic [10].
Heavy-tailed innovations [11].
AR models with exogenous (fundamental) variables.
Dynamic regression (or ARX) and transfer function (or ARMAX)
models [12].
Vector auto regressions with exogenous effects [13].
21
Torro [1] developed an ARIMAX model to forecast weekly futures prices in the Nord Pool market for the period 19972003.
The time-series model contains external variables such as temperature, precipitation, reservoir levels and the basis (futures price
minus the spot price) which, overall, reects the typical seasonal
patterns in the weekly spot price. Torro [1] achieved an average error of 0.78 NOK/MW h for one week ahead futures but provided
no model for spot price forecasting. Torro [1] also proposed two
alternative and simpler forecasting methods:
A myopic method that utilizes the present spot price as a forecast for next week. Torro [1] explains that a myopic method can
be considered as the minimum accuracy deemed from any forecasting method.
A futures method that utilizes the present futures price as a
proxy for next weeks spot price.
We utilize these models as benchmarks in this paper.
Fig. 1. Average, minimum and maximum weekly spot prices in the Nord Pool market from 1999 to 2010.
Fig. 2. Average, minimum and maximum reservoir levels in percentage in Norway for the period 19992010.
22
Fig. 3. Average, minimum and maximum weekly inow levels in GWh for Norway and Sweden for the period 19992010.
drier than the period 19962001 (i.e. lower reservoir levels). The
period 20072010 included high reservoir levels for 2007 and
2009 while 2010 had low reservoir levels.
The relationship between reservoir levels and electricity futures
prices at Nord Pool has been studied by Gjolberg and Johnsen [20]
and Botterud et al. [21]. The authors consistently found that the
reservoir level and its seasonality dependence largely explain the
price formation for spot and future prices. Storage theory states
that seasonality generates seasonal patterns in spot and futures
electricity prices [1]. Reservoir levels impact the sensitivity of demand and supply shocks. A shock occurring at a high reservoir level is uncritical. Conversely a shock at a low reservoir level is
harder to match and may cause rising prices. Full reservoirs may
cause spill and reduced prots. Thus a negative convenience yield
(i.e. high reservoir levels) may cause producers to sell at lower
prices rather than risking spill [1]. The result would be lower spot
prices than futures prices. Conversely a positive convenience yield
(i.e. low reservoir level), may cause higher spot prices than futures
prices. Likewise, precipitation expectations inuence the power
price formation.
The inow levels to reservoirs from snow melting peak between
weeks 18 and 27 as shown in Fig. 3. Conversely during the winter,
inows are at their lowest levels for the weeks 117.
Table 1
Correlation between the natural logarithm of the de-meaned weekly spot prices in week t and week t 1, the natural logarithm of inow levels in week t 1 in Norway and
Sweden in GWh and the natural logarithm of change in reservoir level in week t 1 minus week t 2 in Norway in percent of maximum reservoir level.
ln average spot
week t
1
0.97
0.32
1
0.32
0.21
0.22
0.91
23
Time series
5.4914
0.01
15
in the previous week and the preceding week in percent of maximum reservoir level, a, b, d are regression coefcients and et is assumed to be independent and identically distributed with zero
mean and nite variance.
To estimate the parameters of the suggested model we have
obtained historical data for weekly spot prices, inow levels in
Norway and Sweden and reservoir levels in Norway for the period
from week 44 in 1999 to week 52 in 2010. The results of the regression are shown Tables 2 and 3. The R square equals 0.97 and is
relatively high. In regression analysis, the t-stat, coupled with its
p-value, indicates the statistical signicance of the relationship between the independent and dependent variable. All p-values are
below 0.05 and indicates that the selected variables provide a suitable description of the spot price in the current week.
The characteristics for a stationary time series are a constant
mean and variance. Therefore a time series with trend is nonstationary. Table 4 shows the augmented dickey fuller (ADF) test
for stationary1 conditions for the time series from 1999 to 2010.
The critical values for the ADF test based on F-Statistic are 3.98
for p = 0.01 and 3.68 for p = 0.025 when the number of observations is equal to 500. The ADF test gives p = 0.01 and a Dickey Fuller
test statistic of 5.4914.
We calculated the Hurst exponent to 0,9775 which indicates
that the time series have persistence or long term positive autocorrelation.
In addition we utilize the models proposed by Torro [1] as
benchmarks:
A myopic method that utilizes the present spot price as a forecast for next week. Torro [1] explains that a myopic method can
be considered as the minimum accuracy required from any
forecasting method
A futures method that utilizes the closing price of futures for the
last workday in the preceding week as a proxy for next weeks
spot price.
Model performance
To measure the performance of the models we have used the
mean absolute percentage error (MAPE) dened as the average
absolute difference between the actual value and the forecast value
divided by the actual value:
Table 2
Regression statistics for the auto-regressive model.
Regression statistics
Multiple R
R Square
Adjusted R square
Standard error
Observations
0.9746
0.9498
0.9495
0.1056
581
Table 3
Regression coefcients and associated statistics for the auto-regressive model for the
19992010 data period.
Intercept
ln S t 1
ln inf t 1
ln delta res t 1
Table 4
Augmented Dickey Fuller (ADF) test for the period 19992010.
Coefcients
Standard error
t Start
p-Value
0.3285
0.9684
0.0412
0.5858
0.1122
0.0101
0.0142
0.2027
2.9279
96.3215
2.9038
2.8899
0.0035
0.0000
0.0038
0.0040
n
1X
At F t
MAPE
n t1 At
where A is actual value and F is forecast value.
The performance and accuracy of the price forecasting models
are summarized in Table 5. Likewise we have charted the forecasting models against the actual spot price in Fig. 4.
1
Stationarity, is dened as a quality of a process in which the statistical parameters
(mean and standard deviation) of the process do not change with time. The most
important property of a stationary process is that the auto-correlation function (ACF)
depends on lag alone and does not change with the time at which the function was
calculated. A weakly stationary process has a constant mean and ACF (and therefore
variance). A truly stationary (or strongly stationary) process has all higher-order
moments constant including the variance and mean.
24
Table 5
Weekly MAPE and absolute price deviation (NOK/MW h) for the three proposed
models.
Model
Regression
model
Myopic
model
Futures
model
Deviation forecast
and spot price
(NOK/MW h)
7.5%
1.4
19.6
7.5%
0.9
19.9
5.3%
4.3
14.2
MAPE
Table 6
Regression coefcients and associated statistics for the 19992006 data period which
is used for the out sample model testing.
Intercept
ln S t 1
ln inf t 1
ln delta res t 1
Coefcients
Standard error
t Stat
p-Value
0.3888
0.9589
0.0497
0.6289
0.1380
0.0128
0.0175
0.2483
2.8179
74.8375
2.8340
2.5332
0.0051
0.0000
0.0048
0.0117
Table 7
Weekly MAPE and absolute price deviation (NOK/MW h) for the out of sample period.
The regression model achieves a mean absolute percentage error of 7.5% and under-forecasts the spot price by 1.4 NOK/MW h.
This result is comparable with what Kristiansen [17] achieved with
a weekly mean absolute percentage error (WMAE) of around 67%.
However the price deviation was somewhat higher and in the
range 1.72.6 NOK/MW h.
The myopic model achieves a MAPE 7.5% and over-forecasts the
spot price by some 0.9 NOK/MW h. The futures model achieves a
MAPE 5.3% and over-forecasts the spot price by some 4.3 NOK/
MW h.
Model
MAPE
7.4%
0.0
Fig. 5. Autocorrelation function for the residual (condence interval in red). (For interpretation of the references to colours in this gure legend, the reader is referred to the
web version of this paper.)
25
Fig. 6. Partial autocorrelation function for the residual (condence interval in red). (For interpretation of the references to colours in this gure legend, the reader is referred
to the web version of this paper.)
Fig. 7. Autocorrelation function for the squared residual (condence interval in red). (For interpretation of the references to colours in this gure legend, the reader is referred
to the web version of this paper.)
Fig. 8. Partial autocorrelation function for the squared residual (condence interval in red). (For interpretation of the references to colours in this gure legend, the reader is
referred to the web version of this paper.)
spot and the regression models there were some larger deviations
for the end of 2002.
To further test the regression model we performed out of sample test by estimating the regression coefcients in Table 6 from
the period week 44 in 1999 to week 52 in 2006. The R square
was 0.97. We subsequently applied the regression parameters to
data for the period from 2007 to 2010.
26