Professional Documents
Culture Documents
a r t i c l e i n f o a b s t r a c t
Article history: Predicting pressure-volume-temperature (PVT) properties of black oil is one of the key processes required
Received 10 December 2016 in a successful oil exploration. As crude oils from different regions have different properties, some re-
Revised 9 May 2017
searchers have used API gravity, which is used to classify crude oils, to develop different empirical cor-
Accepted 9 June 2017
relations for different classes of black oils. However, this manual grouping may not necessarily result in
Available online 12 June 2017
correlations that appropriately capture the uncertainties in the black oils. This paper proposes intelligent
Keywords: clustering to group black oils before passing the clusters as inputs to the functional networks for predic-
PVT tion. This hybrid process gives better performance than the empirical correlations, standalone functional
API gravity networks and neural network predictions.
Clustering © 2017 Elsevier Ltd. All rights reserved.
Functional networks
Neural network
Black oil
1. Introduction the industrial demands for less complex, quicker, cheaper and ac-
ceptable solutions.
The API gravity (γ API ) is one of the important pressure-volume- Empirical correlations for predicting PVT properties are gener-
temperature (PVT) properties of crude oils. Crude oil is classified ally developed by performing linear or non-linear regression analy-
based on this property to determine its heaviness which conse- sis using easily acquired crude oil properties as the inputs. In order
quently determines its marketability. Table 1 shows a typical oil to improve the performance of the correlations, some researchers
classification based on γ API (Dandekar, 2013; De Ghetto, Paone, & have used γ API to group crude oils into two or more groups and
Villa, 1995). Knowledge of the γ API and other PVT properties such determine correlations for each group (Kartoatmodjo & Schmidt,
as bubblepoint pressure (Pb ), oil formation volume factor (Bo ) and 1994).
oil viscosity are important for determining future production or oil For more reliable and improved prediction performance of
reserves from petroleum wells. these PVT properties, other researchers have implemented ma-
Some of the PVT properties, e.g.γ API , can easily be measured chine learning [ML] algorithms to predict different PVT proper-
or determined onsite while others, e.g. Pb and Bo , are ideally de- ties (El-Sebakhy et al., 2007; Gharbi, Elsharkawy, & Karkoub, 1999;
termined through laboratory experimentation. However, this labo- Khoukhi, Oloso, Elshafei, Abdulraheem, & Al-Majed, 2011; Osman
ratory analysis, which requires special expertise, is expensive and & Al-Marhoun, 2005). Also, to address the problem in modelling
time consuming. Hence, there is a need for a less rigorous, cheaper ANN, a recursive least squared algorithm has been used for learn-
and quicker solution. ing feedfoward ANN to model crude oil blending process (de Jesús
For a long period of time, the petroleum industry has used Rubio, 2016). However, none of the previous works found in the lit-
equations of states (EOS) for determining these PVT properties. erature which have applied ML techniques in this field, have taken
However, the EOS are considered computationally complex and re- diversity of API or other input properties into consideration.
quire extensive detailed compositions of reservoir fluids. Conse- For instance, if the data that are used to train an ML algorithm
quently, many empirical correlations have been developed to meet have more light crude oils than heavy oils, then the model is con-
fined to the constraints of the light oils. This type of data is called
“imbalanced data set” which is well known in the classification
∗
problem (Ramyachitra & Manikandan, 2014).
Corresponding author.
This paper proposes a hybrid solution of K-Means clustering
E-mail addresses: up748008@myport.ac.uk, munirudeen.oloso@port.ac.uk (M.A.
Oloso), mohamed.hassan@port.ac.uk (M.G. Hassan), mohamed.bader@port.ac.uk and functional networks (FN) for predicting crude oil PVT prop-
(M.B. Bader-El-Den), james.buick@port.ac.uk (J.M. Buick). erties. K-means clustering is used to generate clusters of the input
http://dx.doi.org/10.1016/j.eswa.2017.06.014
0957-4174/© 2017 Elsevier Ltd. All rights reserved.
364 M.A. Oloso et al. / Expert Systems With Applications 87 (2017) 363–369
Aside the EOS, there are two main categories of methods for 3.1. K-means clustering
predicting oil reservoir PVT properties in the literature: empirical
correlations and ML techniques. K-means is a widely used data mining technique. It puts a num-
ber of input observations into a number of clusters which must
2.1. Empirical correlations for predicting oil PVT properties have been defined a priori. There is no defined rule for choosing
an optimal number of clusters for a given dataset. The goal of clus-
Many correlations have been developed for the estimation of Pb tering is to allow natural grouping of data (Jain, 2010). The imple-
and Bob . Usually, the procedure involves linear and/or non-linear mentation, pros and cons of the K-Means algorithm have been dis-
regression analyses on the available datasets. cussed in the literature (Jain, 2010).
Standing developed graphical correlations for both Pb and Bob
based on 105 sets of experimental data (Standing, 1947). These 3.2. Implementation of functional networks
graphical correlations were later expressed by equations (Standing,
1977). The input variables that Standing used for the Pb correlation FN were introduced as a powerful alternative to neural net-
are: gas solubility (Rs ), gas specific gravity (γ g ), γ API and reser- works (Castillo, 1998; Castillo, Cobo, Gutiérrez, & Pruneda, 20 0 0).
voir temperature (T). For the Bob correlation, the correlating pa- Unlike ANN, FN have the advantage that they use domain knowl-
rameters were: Rs , γ g , oil specific gravity (γ o ) and T. On this ba- edge in addition to data knowledge. The network initial topology
sis, many other correlations have been developed for both Pb and can be derived based on the modeling of the properties of the real
Bob (Al-Marhoun, 1988; Almehaideb, 1997; Al-Shammasi, 2001; De world. Once this topology is available, functional equations allow
Ghetto et al., 1995; Jarrahian, Moghadasi, & Heidaryan, 2015; Pet- one to obtain a much simpler equivalent topology.
rosky & Farshad, 1998; Vazquez & Beggs, 1980). It is noted that Several other improvements on the ANN algorithm for differ-
some of these correlations have replaced γ API with γ o and vice- ent applications can be found in the literature (Aljarah, Faris, Mir-
versa. These two variables are related by Eq. (1). jalili, & Al-Madi, 2016; Chairez, 2016; de Jesús Rubio, Elias, Cruz, &
141.5 Pacheco, 2017; Liu et al., 2016).
γAPI = − 131.5 (1) Simplified general topologies for ANN and FN are shown in
γo
Figs. 1 and 2. In these figures, X1 , X2 and X3 are the inputs into
2.2. Machine learning techniques for predicting oil PVT properties the network. X4 and X5 are the outputs of the hidden layer. Wmn
(m = 4, 5, 6; n = 1, 2, 3, 4, 5) are the weights while Y is the output
The quest for better PVT prediction models to improve the cur- in both cases.
rent performances of the empirical correlations has driven applica- There are some quite significant differences between ANN and
tions of different ML techniques in PVT characterisation. The most FN. Notably, the functions in FN are truly learned during the struc-
M.A. Oloso et al. / Expert Systems With Applications 87 (2017) 363–369 365
Table 2
Performances of different prediction methods for bubblepoint pressure.
Table 3
Performances of different prediction methods for oil formation volume factor at bubblepoint pressure.
Fig. 5. Pb Boxplot for K-Means+FN and Ea /Emax . Fig. 7. Bob Boxplot for K-Means+FN and Ea /Emax .
5. Conclusion
Acknowledgment
A hybrid system using K-Means clustering and functional net-
works has been developed to predict crude oil PVT based on Munirudeen A. Oloso acknowledges the financial support from
worldwide data from a wide range of different crude oils with di- the Petroleum Technology Development Fund (PTDF), Nigeria,
verse thermodynamic properties. The clustering part of the hybrid under the award number PTDF/E/OSS/PHD/OMA/648/14. Special
is inspired by the common API grouping of crude oils. thanks also go to the Shell Petroleum Development Company of
Functional networks which resemble neural networks in archi- Nigeria, GeoMark Research and Professor T.A Blasingame of Texas
tecture have been used for the actual prediction. The neurons in A& M University for supplying part of the data used in this work.
368 M.A. Oloso et al. / Expert Systems With Applications 87 (2017) 363–369
The authors also wish to thank the anonymous referees for their Table B2
Second cluster (375 data points).
helpful comments.
Variable Mean Min Max SD
Appendix Rs 256.273 10.0 0 0 840.0 0 0 150.634
γg 0.951 0.556 1.367 0.179
Statistical measures for the performance analysis YAPI 32.341 11.600 55.0 0 0 6.926
T 163.0 0 0 74.0 0 0 275.0 0 0 48.248
Pb 940.504 70.0 0 0 1593.0 0 0 430.858
A.1. Average percent relative error
Bob 1.183 1.030 1.593 0.102
Table B3
1 n
Er = Ei (A1) Third cluster (379 data points).
n 1
Variable Mean Min Max SD
Where,
Rs 872.562 60.0 1870.0 0 0 334.794
Xexp − X pred γg 0.744 0.570 1.101 0.104
Ei = × 100 (A2) YAPI 36.457 17.40 50.400 6.894
Xexp
i T 189.219 100.0 327.0 0 0 43.938
Pb 3240.123 2559.0 3987.0 345.399
i = 1, 2, . . . ., n Bob 1.487 1.075 2.088 0.222