Professional Documents
Culture Documents
Abstract—This paper proposes a method of feature selection and assumption that every feature is independence, the ideal
classifcaition based on ant colony algorithm for hyperspectral decreasing dimension can not be acquired, the abundant
remote sensing image. After all features are randomly projected information contained by hyperspectral imaging can not be
on a plane, each ant stochastically selects a feature on the plane sufficiently mined. Accordingly, studying a method of efficient
firstly, and then decides which route to be selected in terms of the feature selection which feature can represent many
criterion function among features. Whereafter the feature multispectral features is necessary[4,5].
combination is formed. At last, using combination feature, the
classification of AVIRIS image is carried out by maximum There are several feature searching methods for
likelihood classifier. In order to verify the effectiveness of this hyperspectral image. But we can find the limitation of optimal
algorithm, the approach is compared with the classical method and suboptimal method. So we propose a approach
suboptimal search technique, using AVIRIS images as a data set. based on ant colony algorithm, in this approach, firstly all
Experimental results prove the processing that based on ant features are randomly projected on a plane, each ant
colony algorithm is more effective and is fit for the band selection stochastically selects a feature on the plan, and then decides
of hyperspectral image. which route to be selected in terms of the criterion function
between features. Two criterion functions are selected. Using
Keywords-component; feature selection; classification; ant hyperspectral remote sensing images (acquired by the airborne
colony algorithm; hyperspectral remote sensing images
visible/infrared imaging spectrometer [AVIRIS] sensor) as a
data set, and the maximum likelihood classifier to classify the
I. INTRODUCTION selected features, experiments are performed.
Hyperspectral remote sensing provides very high spectral The paper is organized as follows. Section 2 summarizes
resolution image data and the potential for discrimination of ant colony algorithm. Section 3 gives a detail of the feature
subtle differences in ground covers, but the vast amount of data selection based on ant colony algorithm for hyperspectral
bring a lot of difficulty to the subsequent interpretation and remote sensing image. Section 4 validates the effectiveness of
analysis. Previous research has demonstrated that high- the proposed method. Finally, section 5 concludes this paper.
dimensional data spaces are mostly empty, indicating that the
data structure involved exists primarily in a subspace. As a
result, there is a need that can reduce the dimensions of the data II. ANT COLONY ALGORITHM
to the right number without losing the original information that Ant colony algorithm (ACA) is a cooperative search
allows for the separation of class [1-3]. technique that mimics the foraging behavior of real life ant
colonies. The ants rapidly establish the shortest route from food
Either feature selection or feature extraction can reduce data
source to their nest. Inspired by the food searching behaviors of
dimensions. But feature extraction strategies are often complex.
ants, Marco Dorigo et al. proposed ant colony algorithm[6]. At
Mostly, aiming at the classification of hyperspectral imaging,
the beginning, ACA builds solution to the traveling salesman
losing of radiation or reverberation information coming from
problem (TSP). In 1991, Deneubourg et al. proposed a model
objects on the ground and contained by original bands may be
that explains the phenomenon of ants clustering according to
produce. The strategy of feature selection can reserve all kinds
the behavior of ant colonies in clustering their corpses and
of object information on the ground, but the plenty of
sorting their larvae, then, the model is mainly applied to robotic
hyperspectral imaging bands and the strong correlation
implementation[7]. Lumer and Faieta in 1994 developed
between bands can not make some of the methods about
Deneubourg et al.’s model[8]. In their model a population of
feature selection perform. Because of the computation or the
Δτ ij (t , t + 1) = ∑ Δτ ijk (t , t + 1) (2) −1
k =1 1 ⎡ Σi + Σ j ⎤ 1 (Σi + Σ j ) / 2
Jij (B) = (μ j − μi )T ⎢ ⎥ (μ j − μi ) + ln 1/ 2 (7)
where, ρ represents the evaporation degree of pheromone 8 ⎣ 2 ⎦ 2 Σ ⋅ Σ 1/ 2
i j
concentration on path (i,j) after one cycle, and 1 − ρ represents
the factor of residual pheromone, in order to prevent the Where n is the number of classes ( n = 9 , for our image
infinite accumulation of pheromone, usually, the value of ρ is set);
taken in [0,1); Δτ ijk (t , t + 1) is the increase of pheromone J ij (B) is the Bhattacharyya distance between the ith and
concentration laid on path (i,j) by kth ant between time t and jth classes;
t+1, and at the initial time, Δτ ij (0) = 0 . μi is the mean vector of the ith class;
Σ i is the covariance matrix of the ith class.
b. Jeffries-Matusita (JM) distance Experiments using AVIRIS data was carried out to validate
our feature selection algorithms. Overall, 2572 pixels were
n −1 n
selected to form a training set. Each pixel was characterized by
∑∑
− J ij ( B )
JM = 2 2(1 − e ) (8) the 200 features related to the channels of the sensor. On the
i =1 j = i +1` condition of selecting different number of spectral bands,
Where, n is the number of the classes ( n = 9 , for our image separately using BH distance, JM distance as the criterion
function, simulating experiments on these data are performed.
set); J ij (B) is the Bhattacharyya distance between the ith and
The error matrix and the classification accuracies acquired by
jth classes. the ACA algorithm using JM distance based on 15 bands
We assume that there are Gaussian class distributions for selected are showed in table 1.
the computation of the BH distance and the JM distance. As
BH and JM are distance measures, the larger the obtained TABLE I. ERROR MATRIX ACQUIRED BY THE ACA ALGORITHM USING
distance, the larger the expectation of ant colony, the better the JM DISTANCE BASED ON 15 BANDS SELECTED
solution. Ground Classification
C1 C2 C3 C4 C5 C6
truth accruacy
C. Classification of hyperspectral image C1 1347 81 6 0 0 0 93.93%
C2 109 855 4 0 0 0 88.33%
The features selected from the data set are combined to C3 0 7 736 4 0 0 98.53%
form composite image. In terms of the ground truth distribution C4 0 0 7 1285 0 2 99.30%
and the number of classes, after selecting training sample areas C5 0 0 1 0 488 0 99.80%
in the image, the statistical parameters of ground objects are C6 1 0 0 0 0 211 99.53%
acquired through the trained samples. The maximum likelihood On the condition of selecting different number of spectral
classification is carried out to the combination image. bands, separately using BH distance, JM distance as the
criterion function, we made comparison the effectivenesses of
IV. EXPERIMENT RESULTS proposed method and other suboptimal SFFS techniques(See
Fig.3). It is showed in the Fig.3 that the classification accuracy
To testify the effectiveness of the proposed method for using JM distance is higher than that using BH distance based
feature selection, experiments are conducted on an AVIRIS on the SFFS method, especially when the number of the bands
image of mixed agriculture and forestry in Northwestern is up 10 and down 20. And these experiments show that the
Indian, USA recorded in June 1992. The image set was method proposed by this paper is effective in band selection of
composed of 220 spectral bands acquired in the 0.4—2.5μm. hyperspectral image when the number of the selected bands is
The spacial resolution of every image is 20m. Water absorption down 20.
bands were removed, leaving 200 of original 220 bands. A
scene 145×145 pixels in size was selected for our experiments
Classification accuracy
Classification accuracy
Figure 1. Band 29th (wavelength range between about 0.68 and 0.69) of the
hyperspectral image utilized in the experiments
C1 Corn-notill
C2 Soybeans-notill
C3 Woods
C4 Wheat
(a) Classified result using (b) Classified result using
C5 Grass/Tree proposed method(BH) proposed method(JM)
C6 Hay-windrowed