An extended ID3 decision tree algorithm for spatial data

Utilizing data mining tasks such as classification on spatial data is more complex than those on non-spatial data. It is because spatial data mining algorithms have to consider not only objects of interest itself but also neighbours of the objects in order to extract useful and interesting patterns....

Full description

Saved in:
Bibliographic Details
Main Authors: Sitanggang, Imas Sukaesih, Yaakob, Razali, Mustapha, Norwati, Nuruddin, Ahmad Ainuddin
Format: Conference or Workshop Item
Language:English
Published: IEEE 2011
Online Access:http://psasir.upm.edu.my/id/eprint/47782/1/An%20extended%20ID3%20decision%20tree%20algorithm%20for%20spatial%20data.pdf
http://psasir.upm.edu.my/id/eprint/47782/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.upm.eprints.47782
record_format eprints
spelling my.upm.eprints.477822016-07-15T05:24:29Z http://psasir.upm.edu.my/id/eprint/47782/ An extended ID3 decision tree algorithm for spatial data Sitanggang, Imas Sukaesih Yaakob, Razali Mustapha, Norwati Nuruddin, Ahmad Ainuddin Utilizing data mining tasks such as classification on spatial data is more complex than those on non-spatial data. It is because spatial data mining algorithms have to consider not only objects of interest itself but also neighbours of the objects in order to extract useful and interesting patterns. One of classification algorithms namely the ID3 algorithm which originally designed for a non-spatial dataset has been improved by other researchers in the previous work to construct a spatial decision tree from a spatial dataset containing polygon features only. The objective of this paper is to propose a new spatial decision tree algorithm based on the ID3 algorithm for discrete features represented in points, lines and polygons. As in the ID3 algorithm that use information gain in the attribute selection, the proposed algorithm uses the spatial information gain to choose the best splitting layer from a set of explanatory layers. The new formula for spatial information gain is proposed using spatial measures for point, line and polygon features. Empirical result demonstrates that the proposed algorithm can be used to join two spatial objects in constructing spatial decision trees on small spatial dataset. The proposed algorithm has been applied to the real spatial dataset consisting of point and polygon features. The result is a spatial decision tree with 138 leaves and the accuracy is 74.72%. IEEE 2011 Conference or Workshop Item PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/47782/1/An%20extended%20ID3%20decision%20tree%20algorithm%20for%20spatial%20data.pdf Sitanggang, Imas Sukaesih and Yaakob, Razali and Mustapha, Norwati and Nuruddin, Ahmad Ainuddin (2011) An extended ID3 decision tree algorithm for spatial data. In: 2011 IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services (ICSDM 2011), 29 June-1 July 2011, Fuzhou, China. (pp. 48-53). 10.1109/ICSDM.2011.5969003
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
language English
description Utilizing data mining tasks such as classification on spatial data is more complex than those on non-spatial data. It is because spatial data mining algorithms have to consider not only objects of interest itself but also neighbours of the objects in order to extract useful and interesting patterns. One of classification algorithms namely the ID3 algorithm which originally designed for a non-spatial dataset has been improved by other researchers in the previous work to construct a spatial decision tree from a spatial dataset containing polygon features only. The objective of this paper is to propose a new spatial decision tree algorithm based on the ID3 algorithm for discrete features represented in points, lines and polygons. As in the ID3 algorithm that use information gain in the attribute selection, the proposed algorithm uses the spatial information gain to choose the best splitting layer from a set of explanatory layers. The new formula for spatial information gain is proposed using spatial measures for point, line and polygon features. Empirical result demonstrates that the proposed algorithm can be used to join two spatial objects in constructing spatial decision trees on small spatial dataset. The proposed algorithm has been applied to the real spatial dataset consisting of point and polygon features. The result is a spatial decision tree with 138 leaves and the accuracy is 74.72%.
format Conference or Workshop Item
author Sitanggang, Imas Sukaesih
Yaakob, Razali
Mustapha, Norwati
Nuruddin, Ahmad Ainuddin
spellingShingle Sitanggang, Imas Sukaesih
Yaakob, Razali
Mustapha, Norwati
Nuruddin, Ahmad Ainuddin
An extended ID3 decision tree algorithm for spatial data
author_facet Sitanggang, Imas Sukaesih
Yaakob, Razali
Mustapha, Norwati
Nuruddin, Ahmad Ainuddin
author_sort Sitanggang, Imas Sukaesih
title An extended ID3 decision tree algorithm for spatial data
title_short An extended ID3 decision tree algorithm for spatial data
title_full An extended ID3 decision tree algorithm for spatial data
title_fullStr An extended ID3 decision tree algorithm for spatial data
title_full_unstemmed An extended ID3 decision tree algorithm for spatial data
title_sort extended id3 decision tree algorithm for spatial data
publisher IEEE
publishDate 2011
url http://psasir.upm.edu.my/id/eprint/47782/1/An%20extended%20ID3%20decision%20tree%20algorithm%20for%20spatial%20data.pdf
http://psasir.upm.edu.my/id/eprint/47782/
_version_ 1643833980998385664
score 13.211869