You are on page 1of 6

Proceedings of the 2nd International conference on Electronics, Communication and Aerospace Technology (ICECA 2018)

IEEE Conference Record # 42487; IEEE Xplore ISBN:978-1-5386-0965-1

Geographical Area Mapping and Classification


Utilizing Multispectral Satellite Imagery Processing
Based On Machine Learning Algorithms
Classifying Land based on its use for different purposes

Apurva Saksena, Anushka Ringshia, Arnnava Sharma, Aparna Halbe


saksena.apurva@gmail.com, anushkaringshia@gmail.com, arnnava31596@gmail.com, aparna_halbe@spit.ac.in
Sardar Patel Institute of Technology, Andheri, Mumbai

Abstract—Geographically, a city is characterized as a land cover. Therefore, quantifying these land resources and
patchwork of intensive land-uses. Land-use is the rational and mapping them to measure current situations and how they are
judicious approach of allocating available land resources for changing is critical.[5]
different activities (such as settlements, arable fields, pastures,
and managed woods) within a city. It is a way of utilizing the There are several types of land uses, namely:
land, including the allocation, planning, and management of its ● Residential- includes housing area
resources. The use of a particular patch of land and its physical
● Commercial - for businesses and factories
character are linked. However, research that establishes this link
● Defence
is lacking despite the proliferation of geospatial data. Linking a
city's physical form with its function is the goal of this paper. ● Recreational - comprising of fun and non-essentials
like gardens and parks, tourism
Keywords—Land-cover; land-use; sustainability; development; ● Transit- roads and highways, railways, airports and
remote sensing; mapping; even seaways
● Agricultural - arable farmlands and pastures
I. INTRODUCTION ● Managed woods
Any global city cannot be understood without reference to ● Mining refineries- for coal, petroleum, electricity
its spatial forms such as commercial, residential, industrial, generation and more
marshes/lakes, defence and more.[1] Land use shows how people use the landscape – whether
for development, conservation, or mixed uses. The different
Hence, our goal is to use remote sensing to analyze types of land cover can be managed or used quite
environment variables like vegetation, impervious surfaces differently.[6]
and soil; encoding them into numerical categories and
classifying, to finally link a city’s physical form with its
functions. The aim is to ensure the highest and best use of the
land resources by promoting more efficient utilization,
acquisition, and disposition of land.[2]
Land cover: Land cover is the physical material at the
surface of the earth. It comprises of vegetation and resources
like grass, asphalt, trees, bare ground, water, etc. Land cover
data basically documents how much of a region is covered by
forests, wetlands, impervious surfaces, agriculture, and other
land and water types (including wetlands or open water) [3].
By analyzing satellite and aerial imagery, the land cover can
be determined. Identification, delineation and mapping of this
land cover establish the baseline from which global
monitoring activities like change detection, further studies,
resource management, and planning activities can take place.
The land cover also provides the ground cover information for
baseline thematic maps.[4]
Land use: Land use is a set of functions that can be
applied to the land available to them. Its practices have a Fig.1. Overview of Land cover to Land Use (Images show Landsat
significant impact on the natural resources such as soil, water imagery, Overlay point grid, Image interpretation and Land use to Land cover
and vegetation. Deforestation of the temperate regions, urban maps)
sprawl, soil erosion or degradation, salinization, and
desertification are some of the major effects of land use on

978-1-5386-0965-1/18/$31.00 ©2018 IEEE 1065


Proceedings of the 2nd International conference on Electronics, Communication and Aerospace Technology (ICECA 2018)
IEEE Conference Record # 42487; IEEE Xplore ISBN:978-1-5386-0965-1

II. METHODOLOGY A. Pre-processed Satellite Images


The uniqueness of land-use to particular locations can be A Landsat image is a satellite image. The Landsat program
exploited by combining human expertise with the advantages was started for the primary reason of obtaining a global
offered by machine learning algorithms. The assumption archive of satellite images[9] .WorldView-2 is a commercial
behind this method is that land-use can be modelled in terms Earth observation satellite. WorldView-2 provides
of environmental variables: vegetation, impervious surfaces commercially available panchromatic imagery of 0.46 m
and soil (VIS) . Urban ecosystems are a composite of these (18in) resolution and eight-band multispectral imagery with
three variables and therefore can be observed, quantified and 1.84 m (72 in) resolution.[4] These images are then pre-
measured from satellite images.[7] In this project, the processed to give an intensity image. Intensity images are
morphology of impervious surfaces will be measured and required to prevent unwanted distortions.The WorldView-2
characterized per land-use category. Impervious and pervious image has a better resolution and hence it is used for land-
(vegetation and soil) surfaces can be encoded into numerical cover classification as well [10]. The preprocessed image
categories and classified using machine learning algorithms. undergoes classification using Support-Vector-Machine
In this project, the VIS can be modelled by taking advantage Algorithm in the Spatial Analyst toolbox of the ArcGIS
of the linear correlation of impervious and pervious surfaces in software.[11] The land-cover map is hence obtained which
very high resolution (0.5mx0.5m pixels) and medium will eventually lead to the land-use map.
resolution (30mx30m pixels) satellite images. Impervious
B. Computing the NDVI index
surfaces can then be further characterized according to their
morphology within arbitrarily defined land-use boundaries and The normalized difference vegetation index (NDVI) is a
classified into land-use categories. [8] simple graphical indicator that can be used to analyze remote
sensing measurements, and assess whether the target being
observed contains live green vegetation or not.[12]
Live green plants absorb solar radiation in the photo
synthetically active radiation (PAR) spectral region, which
they use as a source of energy in the process of
photosynthesis. Leaf cells have also evolved to re-emit solar
radiation in the near-infrared spectral region because the
photon energy at wavelengths longer than about 700
nanometers is not large enough to synthesize organic
molecules. The pigment in plant leaves, chlorophyll, strongly
absorbs visible light (from 0.4 to 0.7 µm) for use in
photosynthesis. The cell structure of the leaves, on the other
hand, strongly reflects near-infrared light (from 0.7 to 1.1 µm).
The more leaves a plant has, the more these wavelengths of
light are affected, respectively. It was thus possible to exploit
the strong differences in plant reflectance to determine their
spatial distribution in these satellite images [13] .Red and NIR
stand for the spectral reflectance measurements acquired in the
red (visible) and near-infrared regions, respectively. Using this
index, a certain area on the map can be concluded to be
suitable for agriculture.[14]

Negative values of NDVI (values approaching -1)


Fig.2. General Flowchart
correspond to water. Values close to zero (-0.1 to 0.1)
generally correspond to barren areas of rock, sand, or snow.
Lastly, low, positive values represent shrub and grassland
(approximately 0.2 to 0.4), while high values indicate
temperate and tropical rainforests (values approaching 1).[15]

978-1-5386-0965-1/18/$31.00 ©2018 IEEE 1066


Proceedings of the 2nd International conference on Electronics, Communication and Aerospace Technology (ICECA 2018)
IEEE Conference Record # 42487; IEEE Xplore ISBN:978-1-5386-0965-1

Fig.3. The green regions indicate spaces having NDVI index between 0
to 1 whereas the blue regions have NDVI index between -1 to 0

In this way, the NDVI index for both Landsat and


WorldView-2 images is computed. We hence obtain an
intensity map containing the pixelized version of the map
showing areas with different NDVI index.The difference
between the Landsat and WorldView-2 is that of resolution.
The WorldView-2 has a better and deeper resolution of 0.5m
while Landsat images provide a resolution of 30m.
C. Image Masking
ArcGIS Software is used for image masking.The Image
Masking tool takes an input image, masks it, and produces a
new image which is a copy of the input image, except that the
new image will have its pixel intensity value set to zero (or
some other chosen background intensity value) according to
the mask and the masking operations performed [16]. Hence,
it can be used for sharpening, blurring, embossing or detection
of an edge. Here the input is the satellite image and after
masking, we should get a clear intensity diagram.[17]. Fig.4. Vegetation impervious surface-soil model

D. Encoding into numerical categories E. Classifying using Support-Vector-Machine Algorithm


Impervious and pervious (vegetation and soil) surfaces can The Support-Vector-Machine algorithm is used to classify
be encoded into numerical categories and classified using in the Spatial Analyst Toolbox of ArcGIS software version
machine learning algorithms. Urban ecosystems are an 10.3.1. [11] The working of any machine learning algorithm
agglomeration of these three fundamental variables (V-I-S) depends on the training data set. The initial input on the basis
and therefore can be observed, quantified and measured from of which a machine can learn is crucial for the success of any
satellite images.The basic idea is to encode into numerical machine learning algorithm. We classify land into a cloud,
categories, upon which machine learning algorithms can then water, white roofs, blue roofs, green roofs, dark green roofs,
be applied to classify.[18] light red roofs, brown roofs, light grey surface, dark grey
surface, bare soil, grassland, tree canopy, shadow and
farmland.[19] Whenever any satellite image is input, it will be
classified into these categories as shown in the fig.6 [20].

978-1-5386-0965-1/18/$31.00 ©2018 IEEE 1067


Proceedings of the 2nd International conference on Electronics, Communication and Aerospace Technology (ICECA 2018)
IEEE Conference Record # 42487; IEEE Xplore ISBN:978-1-5386-0965-1

Fig.8. Land Use category

III. OBSERVATION AND RESULTS


Here we have used Pre-Processed Landsat And
WorldView-2 images. After following the above-given
Fig.5. Preliminary 2015 Land-Cover Map of a city methodology we can observe them from a raw satellite image
we can obtain a Land-Cover map using Support-Machine-
Vector Algorithm [10]. We have classified land based the
cover of the land like blue, yellow, green covers but our main
aim of this paper is to obtain a Land-Use [9]. For this, we have
used image masking on the Land-Cover map into categories
like agricultural, commercial, industrial, mixed-use and finally
residential represented by colours green, red, violet, ochre and
yellow respectively. So we can observe how a raw satellite
image has been converted to a Land-Use map which can help
us classify land according to its use.[12]
Fig.6. Land Cover Classes

F. Land Use map A. Land-Use Classification Accuracy


The final output displaying the land-use is obtained by TABLE 1.
masking the land-cover map into categories like agricultural,
commercial, industrial, mixed-use and finally residential. The
respective colour codes are green, red, violet, ochre and
yellow [21]. By displaying the land-use map, the utility of
various areas is revealed and a rough estimate of the soil
condition is obtained. This method is found to be accurate and
reliable. By learning from the training data set, all input
satellite images can be effectively converted to land-use
maps[22].

Fig.9. Accuracy report 1

After performing the Land-Use classification we need to


check for its accuracy to see if it is viable. The sum total of the
values in the rows gives the total number of retrieved
instances. The value in the column gives the number of
relevant instances. So for example, the total number of
retrieved instances for commercial are 4 and the relevant
instances are just 2. However, the number of correctly
retrieved relevant instances is just 1 as there might be 2 times
when commercial has been detected but it was corrected
detected just once. [14]

Fig.7. Output of the same city is the Land Use map

978-1-5386-0965-1/18/$31.00 ©2018 IEEE 1068


Proceedings of the 2nd International conference on Electronics, Communication and Aerospace Technology (ICECA 2018)
IEEE Conference Record # 42487; IEEE Xplore ISBN:978-1-5386-0965-1

TABLE 2. unfair distribution of economic assets, and the loss of


community consciousness. Increasing the sustainability of
communities will require a shift from poorly managed sprawl
to land use planning that can build and keep up efficient
Infrastructure, encourage close-knit neighborhoods and
community consciousness, and preserve the environment.

V. FUTURE WORK
The accuracy of the Land-Use Map Classification should be
around 90 percent ideally to get near perfect results. Therefore,
more efficient and accurate classification algorithms need to be
developed in order to increase the accuracy of the Land-Use
Map Classification. Also, better image processing software
could be developed and used in order to correctly capture and
Fig.10. Accuracy report 2 process a geospatial or a hyperspectral image.

Precision is the fraction of correct relevant instances among


the total number of retrieved instances. A recall is the fraction VI. ACKNOWLEDGMENT
of correct relevant instances that have been retrieved by the
total number of relevant instances. Support is the total number The authors gratefully acknowledge the contributions of the
of relevant instances. entire I.T. department and Sardar Patel Institute of Technology
along with its staff for their work.
The F1 score is a measure of a test's accuracy. It considers
both the precision and the recall of the test to compute the
score.[20]
REFERENCES
[1] O. Arino, P. Bicheron, F. Achard, J. Latham, R. Witt, J.-L. Weber,
"GLOBCOVER—The most detailed portrait of Earth", European Space
Agency ESA Bulletin, pp. 24-31, 2008
[2] M. C. Hansen, T. R. Loveland, "A review of large area monitoring of
land cover change using Landsat data", Remote Sens. Environ., 2012.
[3] Yang, Yi, and Shawn Newsam. "Geographic image retrieval using local
invariant features." IEEE Transactions on Geoscience and Remote
Sensing51.2 (2013): 818-832.
So for example, Precision for Commercial is ¼=0.25 as a
number of correct relevant instances are 1 and total instances [4] Comber, A., Fisher, P., & Wadsworth, R. (2005). What is land cover?
retrieved are 4. The Recall for Commercial is ½=0.50 as a Environment and Planning B: Planning and Design, 32(2), 199–209.
number of correct relevant instances are 1 and number of https://doi.org/10.1068/b31135
relevant instances are 2. The using the above formula we can [5] Cihlar, J., & Jansen, L. (2001). From Land Cover to Land Use: A
find the F1 score. Support is 2 as the number of relevant Methodology for Efficient Land Use Mapping over Large Areas. The
instances is 2. Professional Geographer, 53(2), 275–289. https://doi.org/10.1111/0033-
0124.00285
Similarly, we do the same for the rest and find the average
[6] D. Hoiem, A. A. Efros, M. Hebert, "Recovering surface layout from an
precision, recall, F1 score and support which is 0.71, 0.60,
image", Int. J. Comput. Vis., vol. 75, no. 1, pp. 151-172, Oct. 2007.
0.62 and 20 respectively. This helps us in knowing that the
Land-Use Map that we have obtained by masking the Land- [7] J. Feranec, G. Hazeu, S. Christensen, G. Jaffrain, "Corine land cover
Cover Map gives us fairly accurate results. This will help us change detection in Europe (case studies of the Netherlands and
correctly classify land based on its use. [23] Slovakia)", Land Use Policy, vol. 24, pp. 234-247, 2007.
[8] RIDD, M. K. (1995). Exploring a V-I-S (vegetation-impervious surface-
soil) model for urban ecosystem analysis through remote sensing:
comparative anatomy for cities†. International Journal of Remote
IV. CONCLUSION
Sensing, 16(12), 2165–2185.
The ways in which municipalities, states, and nations plan https://doi.org/10.1080/01431169508954549
the physical arrangement or land use of our communities is [9] J. Knorn, A. Rabe, V. C. Radeloff, T. Kuemmerle, J. Kozak, P. Hostert,
critical to sustainability. The land use patterns, which are "Land cover mapping of large areas using chain classification of
shared by cities across the world have given rise to complex neighboring Landsat satellite images", Remote Sens. Environ., vol. 113,
problems created by urban sprawl faced by all—growing pp. 957-964, 2009.
traffic congestion and lengthening commute times, air
[10] W. B. Cohen, S. N. Goward, "Landsat's role in ecological applications of
pollution, wasteful energy consumption and greater reliance
remote sensing", Bioscience, vol. 54, pp. 535-545, 2004.
on petroleum, elimination of open space and wildlife habitat,

978-1-5386-0965-1/18/$31.00 ©2018 IEEE 1069


Proceedings of the 2nd International conference on Electronics, Communication and Aerospace Technology (ICECA 2018)
IEEE Conference Record # 42487; IEEE Xplore ISBN:978-1-5386-0965-1

[11] Griffiths, Patrick, et al. "A pixel-based Landsat compositing algorithm


for large area land cover mapping." IEEE Journal of Selected Topics in
Applied Earth Observations and Remote Sensing 6.5 (2013): 2088-2101.
[12] M. Baumann, M. Ozdogan, T. Kuemmerle, K. J. Wendland, E. Esipova,
V. C. Radeloff, "Using the Landsat record to detect forest-cover changes
during and after the collapse of the Soviet Union in the temperate zone
of European Russia", Remote Sens. Environ., vol. 124, pp. 174-184,
2012..
[13] J. C.-W. Chan, D. Paelinckx, "Evaluation of random forest and adaboost
tree-based ensemble classification and spectral band selection for
ecotope mapping using airborne hyperspectral imagery", Remote Sens.
Environ., vol. 112, no. 6, pp. 2999-3011, Jun. 2008.
[14] T. Hastie, R. Tibshirani, J. Friedman, The Elements of Statistical
Learning, New York:Springer-Verlag, 2009.
[15] C. E. Woodcock, S. A. Macomber, M. Pax-Lenney, W. B. Cohen,
"Monitoring large areas for forest change using Landsat: Generalization
across space time and Landsat sensors", Remote Sens. Environ., vol. 78,
pp. 194-203, 2001.
[16] Senthilnath, J., et al. "Hierarchical clustering algorithm for land cover
mapping using satellite images." IEEE journal of selected topics in
applied earth observations and remote sensing 5.3 (2012): 762-768.
[17] S. Z. Li, Markov Random Field Modeling in Image Analysis, New
York:Springer-Verlag, 2009.
[18] C. Tomasi, R. Manduchi, "Bilateral filtering for gray and color images",
Proc. 6th Int. Conf. Computer Vis., pp. 839-846, 1998.
[19] T. M. Lillesand, R. W. Kiefer, J. W. Chipman, Remote Sensing and
Image Interpretation, NJ, Hoboken:Wiley, 2003.
[20] Schindler, Konrad. "An overview and comparison of smooth labeling
methods for land-cover classification." IEEE transactions on geoscience
and remote sensing 50.11 (2012): 4534-4545.
[21] P. Coppin, I. Jonckheere, K. Nackaerts, B. Muys, E. Lambin, "Digital
change detection methods in ecosystem monitoring: A review", Int. J.
Remote Sens., vol. 25, pp. 1565-1596, 2004.
[22] B. Sirmaek, C. Unsalan, "Urban-area and building detection using sift
keypoints and graph theory", IEEE Trans. Geosci. Remote Sens., vol.
47, no. 4, pp. 1156-1167, Apr. 2009.
[23] Storper, M., & Scott, A. J. (2016). Current debates in urban theory: A
critical assessment. Urban Studies, 53(6), 1114–1136.
https://doi.org/10.1177/0042098016634002

978-1-5386-0965-1/18/$31.00 ©2018 IEEE 1070

You might also like