You are on page 1of 5

Engineering, 2013, 5, 637-641

http://dx.doi.org/10.4236/eng.2013.58076 Published Online August 2013 (http://www.scirp.org/journal/eng)

Comparisons of Oil Production Predicting Models


Yishen Chen1,2, Xianfeng Ding3, Haohan Liu1,4*, Yongqin Yan5
1
School of Graduate of Southwest Petroleum University, Chengdu, China
2
China Petroleum Engineering Construction Corporation, Beijing, China
3
School of Science of Southwest Petroleum University, Chengdu, China
4
Sichuan College of Architectural Technology, Deyang, China
5
School of Petroleum Engineering Management of Southwest Petroleum University, Chengdu, China
Email: *tsinghua616@163.com

Received June 11, 2013; revised July 11, 2013; accepted July 18, 2013

Copyright 2013 Yishen Chen et al. This is an open access article distributed under the Creative Commons Attribution License,
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

ABSTRACT
Feasibility of oil production predicting results influences the annual planning and long-term field development plan of
oil field, so the selection of predicting models plays a core role. In this paper, three different predicting models are in-
troduced, they are neural network model, support vector machine model and GM (1, 1) model. By using these three dif-
ferent models to predict the oil production in XINJIANG oilfield in China, advantages and disadvantages of these mod-
els have been discussed. The predicting results show: the fitting accuracy by the neural network model or by the support
vector machine model is higher than GM (1, 1) model, the prediction error is smaller than 10%, so neural network
model and support vector machine model can be used to short-term forecast of oil production; predicting accuracy by
GM (1, 1) model is not good, but the curve trend with GM (1, 1) model is consistent with the downward trend in oil
production, so GM (1, 1) predicting model can be used to long-term prediction of oil production.

Keywords: Oil Field; Oil Production; Model; Predicting Accuracy

1. Introduction diction. In this paper, neural network model and support


vector machine model have been used to predict the oil
Oil production prediction is very important in the oilfield production of XINJIANG oil field, and good predicting
development; study on oil production predicting method results have been achieved. Meanwhile, GM (1, 1) model
is a key topic of petroleum science. At present, there are has also been used to predict oil production. Predicting
four kinds of predicting methods [1-6] with physical result shows that this model is adaptable to the case
meanings: Empirical formula such as Arps method, Hub- which lacks data and hard to establishes model with pro-
bert model, water-flood decline curve method; Hydro- babilistic method.
dynamic model based on fluid mechanics model; Mate-
rial balance equation model; Numerical reservoir simula- 2. Predicting Model
tion model. Besides the above mentioned four methods,
there is a typical type of prediction model related to mo- 2.1. Neural Network Predicting Model
dern optimization, this model type is composed of GM (1, At present, in the application of artificial neural networks
1) model, neural network model, support vector machine (ANN) [7,8], most of them are back propagation (BP)
model, etc. Oil field development system is a complex ANN and their variations. It has been proved that BP
multi variables non-linear dynamical systems, different neural network can approximate any multivariate contin-
predicting model has different characteristics like pre- uous function. Kolmogorov rule guaranteed that any con-
dicting accuracy. Neural network model and support vec- tinuous function or mapping can be achieved by a 3-layer
tor machine model are two effective methods to solve ANN.
multi nonlinear mapping problem. At present they are 3-layer BP ANN is used to establish the ANN model
used in many disciplines, even in the oil production pre- with prediction function. The first layer of BP-ANN is
input layer, the second layer of it is middle layer, and the
*
Corresponding author. third layer of it is the output layer, see Figure 1.

Copyright 2013 SciRes. ENG


638 Y. S. CHEN ET AL.

Output layer:
p

input : ml v jl h j l
k k
l 1, 2, , n
j 1 (2)
output:x k f m k
l l
After training of N samples, the network error is:
1 N n k

xl xlk
2
Etotal .
2 k 1 l 1
Error of output layer unit:

dlk xlk xlk xlk 1 xlk (3)

Error of middle layer unit:


n
Figure 1. BP neural network structure.
l 1

ekj dlk v jl h j k 1 h kj (4)

Network Simulation The connection weights V jl and the threshold val-


Assume N-samples (input-output data samples) are used ue t can be modified by the output layer error
during the training process:
The input variables are

dlk and output value of middle layer unit h kj :
N
U t u1 t , u2 t , , um t , the output variables are
T
V jl q 1 V jl q dlk h kj (5)
k 1

X t x1 t , x2 t , , xn t . F o r t h e k - s a m p l e
T
N
l q 1 l q dlk (6)
k 1, 2, , N ,
l e t. U k u1k , u2k , , usk b e in p u t k 1

The connection weights Wij and the threshold val-


be the expectation
mode vector; X k x1k , x2k , , xnk
ue j can be modified by the middle layer error
output vector; G g , g , , g be the middle layer
k k k k

ekj and input value of input layer

1 2 p
U k u1k , u2k , , umk :
input vector; H h , h , , h be the output unit
k k
1
k
2
k
p N
Wij q 1 Wij q ekj uik
vector of middle layer; M m , m , , m be the
k k k k (7)
1 2 n k 1

input vector of output layer; X x , x , , x be k k k k N


1 2 n
j q 1 j q ekj (8)
the output vector of output layer; k 1

Wij , i 1, 2, , m; j 1, 2,, p be the connection Repeat the above-mentioned learning mode, until the
weights from input layer to middle layer; network converges to a given error range.
V jt , j 1, 2,, p; l 1, 2,, n be the connection
weights from middle layer to output layer; 2.2. Support Vector Regression (SVR) Predicting
j , j 1, 2, , p be the threshold value of middle Model
layer; l , l 1, 2, , n be the threshold value of out-
SVR [9-11] is an effective method to solve the regression
put layer; 0 1 is the learning rate.
problem. This regression problem can be described in
Let the response function of ANN f x be Sigmoid- mathematics:
Given training set T x1 , y1 , , xl , yl x, y ,
l


1
type function: f x 1 e x
where xi x R n , yi y R , i 1, , l . The training
The input and output values of each neural unit satisfy set is composed of independent and identically distrib-
the following relationship: uted sample points following certain probability distribu-
Middle layer: tion p x, y . After giving loss function c x, y, f , the
regression function f x wT x b will be found
s
to make the expected risk R f c x, y, f dp x, y
intput:g j wij ui j i 1, 2, , m; j 1, 2, , p
k k

i 1 reach its minimum value, where x is the nonlinear

j
output:h k f g k k 1, 2, , N
j
mapping, it maps the data x into a high dimensional fea-
ture space; w and b are weight vector and bias value,
(1) separately.

Copyright 2013 SciRes. ENG


Y. S. CHEN ET AL. 639

If we denote the regression problem by minimum risk


problem with the loss function
1

x x 1 , x 2 , , x n ,
1 1 1


c x, y, f x max 0, y y x 0 , then the where x k x
1
k
0
i , k 1, 2, , n, denote the
basic rule of SVR method is established: by introducing
kernel function k xi , x j xi x j into the non-
i 1

linear mapping, the nonlinear regression problem in low- grey derivative of x1 by:
d k x
0
dimensional input space transformed into a linear regres- k x1 k x1 k 1
sion problem in high dimensional feature space and the
z x 2 , x 3 , , x n be the
1 1 1 1
key is to solve the parameter w and b in the regres- Let gener-
ated data series of x , then
1
sion function. Hence, the basic rule of SVR in solving
regression problem is to solve an optimal problem with z k x k 1 x k 1
1 1 1

the following
Hence, denote the gray differential equation model of

i* i *j j K xi , x j
l
1
min GM (1, 1) [12,13] by:
* R 2 l 2 i , j 1

d k az k b
1


l l
i yi i ,
*
i
*
i
i 1 i 1 It is,
form: (9)
i* i 0,
l
x
0
s.t. k az 1 k b (11)
i 1
In Equation (11) x k is the grey derivative, a is
0
C
0 i , i* , i 1, 2, , l the developing coefficient, z k is the albino back-
1
l
ground value, b is the grey functional variable.
where i , i* are Lagrange multipliers; C is a con- Introducing k 2,3, , n into Equation (13) gives
stant, called penalty factor; is a given positive value,
x 0 2 az 1 2 b
it is a maximum error of regression. 0
By solving Equation (9) gives the optimal Lagrange x 3 az 1 3 b


T
multipliers a a1 , a1* , , al , al* , and then gives re-
0 1
gression function when the expected risk R f gets its x n az n b
minimum:
Let

l
f x i* i K xi , x b (10) x 0 2 z 1 2 1
i 1 0 1
x 3 a z 3 1
where the sample satisfied i* i 0 is the support Y ; u b ; B

l
vector; w i i K xi , x .
*
i 1 x0 n z 1 n 1
C
Select j or k* in interval 0, ;
l Then GM (1, 1) model can be expressed as Y Bu ,
If j is selected, then now the question comes down to find the value of a, b .
Introducing least square method to solve the estimat-

b y j i* i K xi , x j
l

ing value:
i 1
a

1
If k* is selected, then u B T B B TY .

b

l
b yk i* i K xi , xk
In Equation (13), if x k is continuous variables
0
i 1
when k 2,3, , n , then x is a function of time t ,
1

when solving regression problem, the proper kernel func- it is x t , so the derivative of x k become
1 0

tion can be selected to SVR training. Introducing sample


dx t
1
factor vector xi and predictor vector x into Equation and the derivative of albino background value
(10) after training gives the prediction result. dt
z k becomes x t . Hence, the grey differential
1 1

2.3. GM (1,1) Predicting Model equation becomes:


0

The Let x x 1 , x 2 , , x
0 0 0
n be initial dx t
1

ax t b
1
(12)
data series, its 1-AGO data series is dt

Copyright 2013 SciRes. ENG


640 Y. S. CHEN ET AL.

7
Equation (12) is the albino type of GM (1, 1) model.
Given initial value x t 1 x 1 , the solution of
1 0

6
Equation (12) becomes:
0 b a t 1 b
x t x 1 e .
1
5
a a
4
3. Applications and Discussions
3
Given the initial oil production data (from 1958 to 2012)
of certain oilfield block in China, then the above-men-
2
tioned three method can be used to predict the oil pro-
duction of different oilfield block (A1, A2, A3). After
using the above-mentioned three different predicting 1
0 10 20 30 40 50 60
model gives Figures 2-4.
Figure 3. comparison of oil production prediction value
In Figures 2-4, the black circle means the real oil pro- with different method in block-A3.
duction, the green curve is the predicting curve with
ANN model, the red curve is the predicting curve with the GM (1, 1) model, the maximum relative error is less
SVR model, the blue curve is the predicting curve with than 10%, ANN model and SVR model can be used to
GM (1, 1) model. Figures 2-4 show the predicting accu- short term prediction. However, the GM (1, 1) model
racy with ANN model and SVR model is higher than predicts the overall trend in oil production decline; it can
be used to middle-long term oil production prediction.
15

4. Conclusions
Prediction with ANN model and SVR model can comply
10 with the actual oilfield production dynamics, the predic-
tion errors of them are less than 10%. However, they are
learning type of model; much data is needed to complete
the prediction, so they are only suitable for the short term
5
prediction;
Although the prediction accuracy with GM (1, 1)
model is low, still the prediction result fits with the over-
all downward trend of oil production, so it can be used as
a reference for long-term prediction.
0
0 10 20 30 40 50 60

Figure 2. comparison of oil production prediction value REFERENCES


with different method in block-A1.
[1] B. Y. Ji, Overview of Oilfield Development Indicators
2.5 Forecasting Methods, Petroleum Geology and Oilfield
Development in Daqing, Vol. 18, No. 2, 1999, pp. 19-22.
[2] J. G. Hu, Y. Q. Chen and S. Z. Zhang, A New Model
2 Using for Prediction of Oil-Gas Field Production, Acta
Petrolei Sinica, Vol. 16, No. 1, 1995, pp. 79-86.
1.5
[3] G. Chen and X. L. Yang, An Examination into Predic-
tion Methods Based on Reservoir Performance Analy-
ses, China Offshore Oil and Gas (Geology), Vol. 16, No.
1 4, 2002, pp. 254-259.
[4] Y. Q. Chen and Q. F. Zhao, A Combined Method of
Weibull Model and Water Drive Curve of B Type, Xin-
0.5 jiang Petroleum Geology, Vol. 21, No. 5, 2000, pp. 405-
407.
0
[5] Y. Q. Chen, J. G. Hu and D. J. Zhang, Deduction of
0 5 10 15 20 25 30 35 40 45 50 Logistic Model and Auto Regression Methods, Xinjiang
Figure 3. comparison of oil production prediction value Petroleum Geology, Vol. 17, No. 2, 1996, pp. 150-155.
with different method in block-A2. [6] P. Yue, X. F. Chen, L. G. Cui, et al., Suggestion Solu-

Copyright 2013 SciRes. ENG


Y. S. CHEN ET AL. 641

tion of Wengs Model and Logistic Model in Production Mechanical Engineering and Automation, Vol. 3, 2007,
Research, Journal of Oil and Gas Technology, Vol. 31, pp. 17-18.
No. 4, 2009, pp. 277-279. [11] K. Ito and R. Nakano, Optimizing Support Vector Re-
[7] B. Yang, Applied Neural Network in Oil Well Logging, gression Hyperparameters Based on Cross-Validation,
Petroleum Industry Press, Beijing, 2005. Proceedings of the International Joint Conference on
[8] X. G. Wu and J. L. Ge, Oil Production Forecast with Neural Networks, Vol. 3, 2003, pp. 2077-2082.
ANN Method, Petroleum Exploration and Development, [12] J. M. Yao, B. S. Yu, et al., Application of the Improved
Vol. 21, No. 3, 1994, pp. 31-42. GM Model to Oil Production Forecasting of Tarim Ba-
[9] C. H. Zhang, Optimization Problems in Support Vector sin, Petroleum Geology and Oilfield Development in
Machines, China Agricultural University Doctoral Dis- Daqing, Vol. 26, No. 1, 2007, pp. 92-96.
sertation, Vol. 5, 2004, pp. 36-39. [13] M. F. Chen and Z. X. Lang, Oil Production Forecasting
[10] Z. H. Feng and J. M. Yang, Practical Selection of Sup- with Modified Functional Simulation Model, Xinjiang
port Vector Machine Parameters for SVM Regression, Petroleum Geology, Vol. 24, No. 3, 2003, pp. 246-248.

Copyright 2013 SciRes. ENG

You might also like