You are on page 1of 5

South-Eastern European Journal

of Earth Observaon and Geomacs


Issue
Vo3, 2S, 2014

93
Aristotle University of Thessaloniki, Greece
Published online May 2014

Comparison of the accuracy of classicaons generated by
interimage and by interimage integrated with data mining

Denilson Passo
a
, Edilson Bias
a,*
, Ricardo Brites
a
, Gilson Costa
b
a
Instute of Geosciences , University of Braslia , Brazil
b
Department of Electrical Engineering , Poncal Catholic University of Rio de Janeiro, Brazil
*Corresponding author: edbias@unb.br, +556131076977, +556181317202

Abstract: The technological development in remote sensing has been providing a large
increase in the quality of satellite imagery. A notable example is the WorldView II satellite
that has a spaal resoluon of 46 cm. Despite this breakthrough, the classicaon methods
are sll very aected by the spectral mixture due to the multude of features observed in
high resoluon images, especially in urban environments. In the search for a soluon,
soware packages that enable imagery classicaon based on objects, facilitang the
representaon and processing of human knowledge about the specic characteriscs of
targets of interest, have appeared. These systems are based on rules and decision trees,
where they seek to represent the mode of reasoning and knowledge used by experts in
solving problems within its specialty. Among the classicaon systems based on objects, the
InterIMAGE is highlighted as an open source system developed by the Computer Vision
Laboratory in the Department of Electrical Engineering of the Poncal Catholic University of
Rio de Janeiro and the divisions of Image Processing and Remote Sensing in the Naonal
Instute for Space Research. The soware has a specic graphical user interface to support
the denion of decision rules. These rules dene the structured and explicit knowledge of
the user / analyst and are used by the system in the process of interpretaon, where the
values to be inserted in the structure of the rule can be obtained by visual analysis of
aributes histograms calculated for image segments. Once appropriately constructed, its
use is fast and computaonally ecient in separang the classes. In order to avoid errors in
interpretaon of visual aributes and to acquire greater eciency and speed in construcng
rules, authors have used data mining techniques, which help to check which aributes, from
those available in the data set, are more relevant to describe classes automacally and to
provide the thresholds to separate them . Finally, in order to evaluate the classicaons, the
accuracy assessment was conducted making use of three indices: Global, Kappa and Tau.
Keywords: WorldViewII, InterIMAGE,data mining, accuracy assessment.


1. Introducon
Over the past few years there was a signicant increase in the use of satellite remote sensing
imagery, which has contributed to new research, mainly targeng improvements in
classiers methodologies.
Among these alternaves, the methodology for spaal imagery analysis based on objects
GEOBIA (Geographic Object - Based Image Analysis) (Hay and Caslla, 2008) has gained
importance in recent years. It emerges as a signicant advance in the process of automac
classicaon of satellite imagery, as it enables the modeling of human knowledge related to
specic features of the targets, such as brightness, texture, shape, context, size,
neighborhood relaons, among others, with the focus on the classicaon of image
segments instead of isolated pixels.
South-Eastern European Journal
of Earth Observaon and Geomacs
Issue
Vo3, 2S, 2014

94
Aristotle University of Thessaloniki, Greece
Published online May 2014

With respect to object-based soware, currently one can nd proprietary and open source
systems, as is the case of InterIMAGE developed by the Computer Vision Laboratory,
Department of Electrical Engineering, Poncal Catholic University of Rio de Janeiro and the
divisions of Image Processing and Remote Sensing of the Naonal Instute for Space
Research (Costa et al., 2008).
This study was conducted at the satellite city of Sobradinho - DF, distant about 30 miles from
downtown Brasilia - DF, Figure 1, and aimed to compare the potenal and accuracy of a
classicaon generated by InterIMAGE with decision rules constructed through visual
analysis of histograms and classicaon using decision rules built automacally by data
mining and evaluated ulizing accuracy assessment techniques.


Figure 1. Locaon of test-area.

2. Methodological procedures
Figure 2 depicts the methodological procedures for the analysis.


Figure 2. Methodological procedures.

3. Results and discussion
The use of Analysis Manager resorces, available in the InterIMAGE soware, allowed the
renement of the results by the analysis of histograms. The separaon of classes in
InterIMAGE, with decision rules generated automacally by data mining, was possible based
South-Eastern European Journal
of Earth Observaon and Geomacs
Issue
Vo3, 2S, 2014

95
Aristotle University of Thessaloniki, Greece
Published online May 2014

on the informaon of the decision tree created by the WEKA soware (Tab. 1) using the J48
algorithm that redeploys, in the soware, the C4.5 algorithm, proposed by Quinlan (1993).
Figure 3 shows an example of arboreal vegetaon class generated by each of the methods:
rule created in InterIMAGEby visual interpretaon of histograms (Fig. 3a) and rule created in
InterIMAGEusing data mining (Fig. 3b).
Besides assisng in the construcon of rules, simplifying their construcon, data mining also
assisted in verifying which aributes among those available in the data set are the most
relevant to describe classes automacally, they were: image brightness; standard deviaon
of bands 1, 2, 6 and 7, rao using the bands 3 and 1; entropy of bands 2 and 5; rao of the
bands 1, 3 and 8; maximum value of the bands 1, 2, 3 and 4; minimum value of bands 5 and
6 and, nally, two results of the Normalized Dierence Vegetaon Index - NDVI using the
bands 7 and 8 as the infrared, since WorldView II has two bands in the infrared. These
parameters were calculated using, always, the average value of the aribute within the
object (segment).

Table 1. Decision tree created by datamining.
NDVI_b_7<= 0.32
| Rao3 <= 0.12
| | Standard D.7 <= 19.48: Soil (18.0)
| | Standard D.7 > 19.48
| | | BandMeanDiv_3_1 <= 1.39
| | | | MinPixelValue5 <= 134: Shadow (3.0)
| | | | MinPixelValue5 > 134
| | | | | Standard D.1 <= 12.4
| | | | | | MaxPixelValue2 <= 276: Clay le
(3.0)
| | | | | | MaxPixelValue2 > 276: Soil (4.0)
| | | | | Standard D.1 > 12.44: Clay le
(159.0/1.0)
| | | BandMeanDiv_3_1 > 1.39
| | | | Rao3 <= 0.10: Clay le (37.0/1.0)
| | | | Rao3 > 0.10
| | | | | MaxPixelValue1 <= 432
| | | | | | Standard D.6 <= 43.4: Soil (34.0)
| | | | | | Standard D.6 > 43.4
| | | | | | | Brightness <= 500.04: Soil (2.0)
| | | | | | | Brightness > 500.04: Clay le
(2.0)
| | | | | MaxPixelValue1 > 432: Clay le (4.0)
| Rao3 > 0.12
| | MaxPixelValue4 <= 796
| | | Rao1 <= 0.14
| | | | NDVI_b_8<= 0.12
| | | | | MinPixelValue6 <= 190
| | | | | | Standard D.1 <= 28.11: Dark C. T.
(2.0)
| | | | | | Standard D.1 > 28.11: Shadow
(2.0)
| | | | | MinPixelValue6 > 190
connua...
| | | | | | Rao8 <= 0.095
| | | | | | | Standard D.7 <= 29.33: Metal le (2.0)
| | | | | | | Standard D.7 > 29.33: Middle C. T.
(2.0)
| | | | | | Rao8 > 0.095
| | | | | | | Rao8 <= 0.11: Asphalt (142.0/1.0)
| | | | | | | Rao8 > 0.11
| | | | | | | | Entropy5 <= 4.9: Middle C. T.
(6.0/1.0)
| | | | | | | | Entropy5 > 4.9: Asphalt (6.0)
| | | | NDVI_8> 0.12: Dark C. T. (164.0)
| | | Rao1 > 0.14: Shadow (68.0)
| | MaxPixelValue4 > 796
| | | MaxPixelValue3 <= 1147
| | | | Rao3 <= 0.15
| | | | | Entropy2 <= 5.92
| | | | | | Standard D.2 <= 37.17: Middle C. T.
(56.0)
| | | | | | Standard D.2 > 37.17
| | | | | | | Standard D.7 <= 64.18: Metal le (4.0)
| | | | | | | Standard D.7 > 64.18
| | | | | | | Standard D.1 <= 55.02: Middle C. T.
(7.0)
| | | | | | | | Standard D.1 > 55.02: Metal le
(2.0)
| | | | | Entropy2 > 5.92: Metal le (7.0)
| | | | Rao3 > 0.15: Metal le (36.0)
| | | MaxPixelValue3 > 1147: Clear C. T. (53.0)
NDVI_b_7> 0.32
| MinPixelValue5 <= 61
| | BandMeanDiv_3_1 <= 1.04: Woody veg.
(85.0/1.0)
| | BandMeanDiv_3_1 > 1.04: Underwood (2.0)
| MinPixelValue5 > 61: Underwood (100.0)

South-Eastern European Journal
of Earth Observaon and Geomacs
Issue
Vo3, 2S, 2014

96
Aristotle University of Thessaloniki, Greece
Published online May 2014


Figure 2. Samples of decision rules created by visual interpretaon of histogram (a) and by data mining (b).

Visually, one can observe a lile dierence between the two classicaons (Fig. 4).
However, when looking at their confusion matrix (Tab. 2 and 3), it is percepble that in the
classicaon that used the rules created by visual interpretaon of histograms, the class that
got beer separaon was the Metal Tile, explained by its lile mixture with the others,
despite the reduced number of reference samples. The coverage classes Clear Cement Tile
and Dark Ciment Tile also had good separaon, contribung signicantly to percentage
increase of accuracy indices.
By analyzing the confusion matrix of the classicaon that used decision rules created by
data mining (Tab. 3), it is noced that, in general, there was lile dispersion of values , i.e.,
the values were concentrated in the main diagonal, meaning lile confusion between
classes. However, there were sll mixing between classes Caly Tile and Bare Soil, and the
laer, in turn, with the Undergrowth vegetaon class, which may have occurred due to
some areas experiencing sparse distribuon or dry leaves, due to the season, exposing soil
and hindering the spectral separaon of the classes.
In both classicaons the Asphalt class, got good accuracy indices due to the use of a
shapele of roads which served as a mask.


Figure 3. Themac map resulng from the classicaon created by visual interpretaon of histograms (a) and
datamining (b).




South-Eastern European Journal
of Earth Observaon and Geomacs
Issue
Vo3, 2S, 2014

97
Aristotle University of Thessaloniki, Greece
Published online May 2014


Table 2. Confusion matrix of the classicaon created by
InterIMAGE using interpretaon of histograms.
Table 3. Confusion matrix of the classicaon
created by InterIMAGEusing data mining.

4. Conclusions
The results showed that the classicaon using data mining to build the decision rules was
more ecient to map the coverage of urban areas, than the classicaon performed solely
for the visual analysis of histograms generated in InterIMAGE itself. The percentages of
overall accuracy (global) for the classicaon generated by InterIMAGEwith rules generated
by visual interpretaon of histograms achieved overall index, or gobal, (Hellden et al., 1980),
Kappa index (Cohen, 1960) and Tau Index (Kleka, 1980) of 90.6%, 89.4% and 89.5%,
respecvely, while the result of classicaon rules generated by data mining reached values
of 92.6%, 91.6% and 91.7% respecvely.

References
Cohen J., 1960, Coecient of Agreement for Nominal Scales. Educaonal and Measurment.
Volume XX, Issue 1: 37-46.
Costa G., Pinho C., Feitosa R., Almeida C., Kux H., Fonseca L., Oliveira D., 2008, INTERIMAGE:
Uma Plataforma Cogniva Open Source para a Interpretao Automca de
Imagens Digitais. Brazilian Journal of Cartography, Volume 60: 331-337.
Hay G., Caslla G., 2008, Geographic Object-based Image Analysis (GEOBIA):a new name for
a new discipline. In Object-based image analysis-spaal concepts for knowledge-
driven remote sensing applicaons, Blaschke T., Lang S., Hay G. (editors), Springer
Verlag, Berlin.
Hellden U., Stern M., 1980, Evaluaon of Landsat Imagery and Digital Data for Monitoring
Desercaon Indicators in Tunisia. Proceedings 14
th
Int. Symp. on Rem. Sens. of
Environ., pp. 1601-1611.
Klecka W., 1980, Discriminant Analysis. SAGE Publicaons, Inc., Beverly Hills, California, pp.
71.
Quinlan R., 1993, C4.5: Programs for Machine Learning Morgan Kaufmann Publishers. Inc.,
San Francisco, CA, USA.

You might also like