Professional Documents
Culture Documents
Technical Note
a r t i c l e i n f o a b s t r a c t
Article history: Automation in cardiac arrhythmia classification helps medical professionals to make accurate decisions
Received 9 December 2015 upon the patient’s health. The aim of this work is to evaluate the performance of five different linear and
Received in revised form nonlinear unsupervised dimensionality reduction (DR) techniques namely principal component analysis
16 September 2016
(PCA), fast independent component analysis (fastICA) with tangential, kurtosis and Gaussian contrast
Accepted 20 December 2016
functions, kernel PCA (KPCA) with polynomial kernel, hierarchical nonlinear PCA (hNLPCA) and principal
polynomial analysis (PPA) on classification of cardiac arrhythmias using probabilistic neural network
Keywords:
classifier (PNN). The design phase of the classification model comprises of the following stages: pre-
Biomedical signal processing
Decision support systems
processing of the cardiac signal by eliminating detail coefficients that contain noise, feature extraction
Feature extraction through Daubechies wavelet transform, dimensionality reduction through unsupervised DR techniques,
Supervised learning and arrhythmia classification using PNN. PCA is a widely used DR technique for mapping high dimen-
sional data to its low dimensional representation. But real world data like electrocardiogram (ECG) signals
are complex and nonlinear in nature. This work concentrates on performance analysis of four nonlinear
DR techniques and conventional linear PCA technique on classification of cardiac arrhythmias. Entire
MIT-BIH arrhythmia database is used for experimentation. The experimental results demonstrate that
the combination of PNN classifier (at spread parameter, = 0.4) and fastICA DR technique with tangential
contrast function exhibit highest F score of 99.83% with a minimum of 10 dimensions. hNLPCA and KPCA
requires more computation time for low dimensional mapping. PPA performs about 10% better than PCA
and serves intermediate between linear and nonlinear techniques.
© 2016 Elsevier Ltd. All rights reserved.
http://dx.doi.org/10.1016/j.bspc.2016.12.017
1746-8094/© 2016 Elsevier Ltd. All rights reserved.
2 R. Rajagopal, V. Ranganathan / Biomedical Signal Processing and Control 34 (2017) 1–8
Table 1
Number of heartbeats in each class. Connuous ECG recordings from MIT_BIH
Heartbeat type N S V F Q arrhythmia database
Full database 87643 2646 6792 794 15
3. Materials used
The MIT BIH arrhythmia database [25] is used in this work. It 4.1. Data preprocessing
contains 48 half–hour excerpts of two channel ambulatory ECG
recordings, obtained from 47 subjects studied by the BIH arrhyth- The records contain continuous ECG recording for duration
mia laboratory. The recordings were digitized at 360 samples per of 30 min. The raw ECG signals include baseline wander, motion
second per channel with 11 bit resolution over 10 mV range. The artifact and power line interference noise. The discrete wavelet
reference annotations for each beat were included in the database. transform (DWT) is used for denoising the ECG signal and also for
Four records containing paced beats (102, 104, 107 and 217) were extracting the important features from original ECG signal [26,27].
removed from analysis as specified by AAMI. The total number of DWT captures both temporal and frequency information. The DWT
heart beats in each class is given in Table 1. of the original ECG signal is computed by successive high pass and
R. Rajagopal, V. Ranganathan / Biomedical Signal Processing and Control 34 (2017) 1–8 3
low pass filtering of that signal. This can be mathematically repre- 4.2. Feature extraction
sented in the following Eqs. (1) and (2),
The entire database (97,890 heartbeats) is divided into ten sets
∞ each containing 9789 heartbeats. Nine sets are used for train-
yhigh [k] = x [n] g [2k − n] (1) ing (88,101 heartbeats) and one set for testing (9789 heartbeats).
n=−∞
From each heartbeat, wavelet based features are extracted by
∞ using Daubechies wavelet (‘db4 ). Daubechies wavelet with level
ylow [k] = x [n] h [2k − n] (2) 4 decomposition is selected in this work after making perfor-
n=−∞
mance comparisons with discrete Meyer wavelet and other levels
of Daubechies wavelet including ‘db2 , ‘db6 . The power spectral
where, x[n] is the original ECG signal samples, g and h are the density of ECG beats showed that most of the ECG signal variations
impulse response of high pass and low pass filters respectively, occur in the fourth sub band frequency range of (0–11.25 Hz) and
and yhigh [k],ylow [k] are the outputs of high pass and low pass fil- (11.25–22.5 Hz). A total of 107 features are produced by the 4th
ters after sub sampling by 2. This procedure is repeated until the level approximation sub band coefficients and another 107 fea-
required decomposition level is reached. The low frequency com- tures by the 4th level detail sub band coefficients. Fig. 4 shows
ponent is called approximation and high frequency component is the decomposition of heart beat signal (sampled at 360 Hz) into
called detail. approximation and detail sub bands up to level 2. This procedure
In this work, the raw ECG signals are decomposed into approxi- can be repeated for further decomposition.
mation and detail sub bands up to level 9 using Daubechies (‘db8 )
wavelet basis function [28]. The detail coefficients contain most of
the noise information. Hence, soft thresholding is applied to the
detail coefficient at each level. Denoised ECG signal is computed
based on the original approximation coefficient at 9th level and 4.3. Dimensionality reduction
modified detail coefficient of levels from 1 to 9. After denoising,
the continuous ECG waveform is segmented into individual heart- The coefficients of approximation and detail sub bands at level
beats. This segmentation is done by identifying the R peaks using 4 decomposition of Daubechies wavelet transform are used as
Pan Tompkins algorithm [29] and by considering 99 samples before features to represent each heart beat. Hence each heartbeat is rep-
R peak and 100 samples after R Peak. This choice of 200 samples, resented by 214 features (107 coefficients from approximation sub
including R peak for segmentation is taken because it constitutes band summed with 107 coefficients from detail sub band pro-
one cardiac activity with P, QRS and T waves. Figs. 2 and 3 show a vide 214 features). In order to extract useful information from the
segment of the recorded ECG waveform of patient identifier: 209 features and to remove redundancy the following dimensionality
before and after preprocessing respectively. reduction techniques are applied.
4 R. Rajagopal, V. Ranganathan / Biomedical Signal Processing and Control 34 (2017) 1–8
1 Data centering
2 2
This step produces a data set whose mean (x̄) is zero and is
Level 1 detail coefficients shown in (6).
h[n] g[n] (90 to 180 Hz)
XC = X − E{X} (6)
2 Whitening
Level 2 approximation Level 2 detail coefficients
coefficients (45 to 90 Hz)
Eigenvector V and Eigen value E are computed from the covari-
(0 to 45 Hz)
ance matrix of centered data, and whitening is performed as in
Fig. 4. Two level wavelet filter bank. (7),
Z = E−1/2 V ∗ XC (7)
4.3.1. Linear technique
Linear mapping of high dimensional data to low dimensional 3 Fixed point iteration for one unit
data is done with this technique. The important advantages of linear
technique are its simple geometric interpretations and attractive It estimates one row of the demixing matrix w as a vector wT .
computational properties [11]. Estimation of w proceeds iteratively as in (8) until convergence is
achieved.
4.3.1.1. Principal component analysis (PCA). PCA that maximizes the
amount of variance in the data is used in this work. The steps fol- • Selection of initial random vector w
lowed for dimensionality reduction of data matrix X consisting of
n ECG samples xi (i &1028; {1, 2, .,n}) with D features (The value of wp = E{x g(wT x)} − E{xg (wT x)}w (8)
D is 214 in the proposed work due to 214 features extracted from
each heartbeat using wavelet transform) are given below, where, wp ∗ = wp /wp and wp is the norm of w and g is the contrast
function.
1 Subtraction of the mean
4 Evaluation of next independent components
This step produces a data set whose mean x̄ is zero.
To estimate the other independent components, step 3 is
2 Calculation of covariance matrix C is shown in (3). repeated for getting weight vectors wi , i = 2, 3,.n. To prevent dif-
ferent vectors from converging to the same optimum, the weight
C = (x − x̄)(x − x̄)T (3) vectors are decorrelated using Gram-Schmidt orthogonalization
p T w )w and renormalize
and are given by wp+1 = wp+1 − j=1 (wp+1
j j
3 Calculation of eigenvectors V and eigenvalues E is shown in (4) wp+1 as wp+1 = wp+1 / T w
wp+1 p+1 .
V−1 CV = E (4)
4.3.2.2. Kernel principal component analysis (Kernel PCA). Using
4 Sorting of eigenvectors Kernel PCA [31], the linear operations of PCA are done with non-
linear mapping. Kernel PCA computes the principal eigenvectors
of the kernel matrix instead of computation from the covariance
Eigen vector with highest eigenvalue is the principal compo-
matrix. The steps involved are,
nent of the data. Eigen vectors are ordered by eigenvalue, highest
to lowest.
1 Computation of kernel matrix K
5 Projection of data is shown in (5)
For the data samples {xi }, kernel matrix is computed. The entries
T T
in the kernel matrix are given by ki,j = k(xi , xj ), where k is a kernel
Projected data = V (x − x̄) T (5)
function such as Gaussian or polynomial.
Arrhythmia classification is done by varying the selection of
components from 1 to 10 in the projected data and the results are 2 Double centering of K
analyzed.
It means subtracting the mean of the data in the feature space
4.3.2. Nonlinear techniques defined by the kernel function and is shown in (9).
1 1
Nonlinear techniques have the ability to deal with complex
1 1
nonlinear real world data. The following nonlinear techniques are ki,j = − (ki,j − kil − l kjl + 2 klm ) (9)
2 n l n n lm
experimented to analyze its performance in comparison with tra-
ditional PCA. Where, n is the number of data samples.
R. Rajagopal, V. Ranganathan / Biomedical Signal Processing and Control 34 (2017) 1–8 5
F score in %
96
85 KPCA-POLY
94 PPA
fastICA-pow3 75 PPA
92
90 fastICA-tanh 65 fastICA-pow3
1 2 3 4 5 6 7 8 9 10 fastICA-tanh
fastICA-gauss 55
Number of dimensions
hNLPCA 1 2 3 4 5 6 7 8 9 10 fastICA-gauss
Number of dimensions hNLPCA
Fig. 5. Results of average accuracy of PNN classifier for different DR techniques and
dimensions. Fig. 9. Results of average F score of PNN classifier for different DR techniques and
dimensions.
100
95 PCA Table 3
Sensivity in %
90
85 KPCA-POLY Variation of average F scores with respect to different spread values during PNN
80 classification using fastICA and Kernel PCA DR techniques.
75 PPA
70 Spread parameter Average F Score
65 fastICA-pow3
60 FastICA–tanh KernelPCA–poly
fastICA-tanh
55
fastICA-gauss 0.1 78.11 92.92
1 2 3 4 5 6 7 8 9 10
0.2 99.62 93.05
Number of dimensions hNLPCA 0.3 99.75 93.05
0.4 99.83 92.97
Fig. 6. Results of average sensitivity of PNN classifier for different DR techniques 0.5 99.83 92.97
and dimensions. 0.6 95.87 92.97
0.7 95.87 92.9
0.8 95.87 92.74
100 PCA 0.9 95.87 92.52
Specificity in %
98 1 95.72 91.66
KPCA-POLY
96 PPA
94 fastICA-pow3 Table 4
Variation of average F scores with respect to different spread values during PNN
92 fastICA-tanh classification using PCA, hNLPCA and PPA DR techniques.
1 2 3 4 5 6 7 8 9 10
fastICA-gauss
Number of dimensions Spread parameter Average F Score
hNLPCA
PCA hNLPCA PPA
Fig. 7. Results of average specificity of PNN classifier for different DR techniques 0.01 61.77 79.13 61.21
and dimensions. 0.02 76.74 92.53 89.19
0.03 78.19 90.51 91.2
0.04 78.6 85.91 91.49
7 0.05 78.74 80.98 91.49
False posive rate in %
[20] Hari Mohan Rai, Anurag Trivedi, Shailja Shukla, ECG signal processing for [28] B.N. Singh, A.K. Tiwari, Optimal selection of wavelet basis function applied to
abnormalities detection using multi-resolution wavelet transform and ECG signal denoising, Digit. Signal Process. 16 (2006) 275–287, http://dx.doi.
artificial neural network classifier, Measurement 46 (2013) 3238–3246, org/10.1016/j.dsp.2005.12.003.
http://dx.doi.org/10.1016/j.measurement.2013.05.021. [29] J. Pan, J.W. Tompkins, A real time QRS detection algorithm, IEEE Trans.
[21] Behnaz Ghoraani, Sridhar Krishnan, Discriminant non-stationary signal Biomed. Eng. (1985) 32, http://dx.doi.org/10.1109/tbme.1985.325532.
features clustering using hard and fuzzy cluster labeling, EURASIP J. Adv. [30] Aapo Hyvarinen, Fast and robust fixed point algorithms for independent
Signal Process. 2012 (1) (2012) 1–20, http://dx.doi.org/10.1186/1687-6180- component analysis, IEEE Trans. Neural Netw. (1999) 626–634, http://dx.doi.
2012-250. org/10.1109/72.761722.
[22] Fahim Sufi, Ibrahim Khalil, Abdun Naser Mahmood, A clustering based system [31] J. Shawe Taylor, N. Christianini, Kernel Methods for Pattern Analysis,
for instant detection of cardiac abnormalities from compressed ECG, Expert Cambridge University Press, Cambridge, UK, 2004, http://dx.doi.org/10.1017/
Syst. Appl. 38 (2011) 4705–4713, http://dx.doi.org/10.1016/j.eswa.2010.08. CBO9780511809682.
149. [32] M. Scholz, R. Vigario, P.C.A. Nonlinear, A new hierarchical approach, Proc.
[23] Sung Nien Yu, Combining independent component analysis and back ESANN (2002) 439–444.
propagation neural network for ECG beat classification, Proc. Conf. IEEE Eng. [33] V. Laparra, S. Jimenez, D. Tuia, G. Camps-Valls, J. Malo, Principal polynomial
Med. Biol. Soc. (2006) 3090–3093. analysis, Int. J. Neural Syst. (2014) 26.
[24] Can Ye, B.V.K. Vijayakumar, M.T. Coimbra, Heartbeat classification using [34] M. Llamedo, Martinez Juan Pablo, Heartbeat classification using feature
morphological and dynamic features of ECG signals, IEEE Trans. Biomed. Eng. selection driven by database generalization criteria, IEEE Trans. Biomed. Eng.
(2012) 59. 58 (2011) 616–625, http://dx.doi.org/10.1109/TBME.2010.2068048.
[25] G.B. Moody, R.G. Mark, The impact of the MIT BIH arrhythmia database, IEEE [35] M. Llamedo, J.P. Martinez, An automatic patient adapted ECG heartbeat
Eng. Med. Biol. 20 (2001) 45–50, http://dx.doi.org/10.1109/51.932724. classifier allowing expert assistance, IEEE Trans. Biomed. Eng. 59 (2012)
[26] E.B. Mazomenos, D. Biswas, A. Acharyya, Taihai Chen A low-complexity ECG 2312–2320, http://dx.doi.org/10.1109/TBME.2012.2202662.
feature extraction algorithm for mobile healthcare applications, IEEE J. [36] Z. Zhang, J. Dong, X. Luo, K. Choi, X. Wu, Heartbeat classification using disease
Biomed. Health Inform. 17 (2013) 459–569, http://dx.doi.org/10.1109/TITB. specific feature selection, Comput. Biol. Med. 46 (2014) 79–89, http://dx.doi.
2012.2231312. org/10.1016/j.compbiomed.2013.11.019.
[27] Li Deqiang, Pedrycz Witold, J. Nicolino Pizzi, Fuzzy wavelet packet based [37] C.L. Herry, M. Frasch, H. Wu, Characterization of ECG patterns with the
feature extraction method and its application to biomedical signal synchrosqueezing transform. (2015). arXiv:1510.02541.
classification, IEEE Trans. Biomed. Eng. 52 (2005) 1132–1139, http://dx.doi.
org/10.1109/TBME.2005.848377.