A comparative study for outlier detection techniques in data mining
Existing studies in data mining mostly focus on finding patterns in large datasets and further using it for organizational decision making. However, finding such exceptions and outliers has not yet received as much attention in the data mining field as some other topics have, such as association rul...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2006
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/7808/1/Mat_Deris_Mustafa_2006_Comparative_Study_Outlier_Detection_Techniques.pdf http://eprints.utm.my/id/eprint/7808/ http://dx.doi.org/10.1109/ICCIS.2006.252287 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utm.7808 |
---|---|
record_format |
eprints |
spelling |
my.utm.78082017-08-30T01:32:48Z http://eprints.utm.my/id/eprint/7808/ A comparative study for outlier detection techniques in data mining Bakar, Zuriana Abu Mohemad, R. Ahmad, A. Mat Deris, Mustafa Mat QA75 Electronic computers. Computer science Existing studies in data mining mostly focus on finding patterns in large datasets and further using it for organizational decision making. However, finding such exceptions and outliers has not yet received as much attention in the data mining field as some other topics have, such as association rules, classification and clustering. Thus, this paper describes the performance of control chart, linear regression, and Manhattan distance techniques for outlier detection in data mining. Experimental studies show that outlier detection technique using control chart is better than the technique modeled from linear regression because the number of outlier data detected by control chart is smaller than linear regression. Further, experimental studies shows that Manhattan distance technique outperformed compared with the other techniques when the threshold values increased. 2006 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/7808/1/Mat_Deris_Mustafa_2006_Comparative_Study_Outlier_Detection_Techniques.pdf Bakar, Zuriana Abu and Mohemad, R. and Ahmad, A. and Mat Deris, Mustafa Mat (2006) A comparative study for outlier detection techniques in data mining. In: 2006 IEEE Conference on Cybernetics and Intelligent Systems, 7-9 June 2006. http://dx.doi.org/10.1109/ICCIS.2006.252287 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
language |
English |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Bakar, Zuriana Abu Mohemad, R. Ahmad, A. Mat Deris, Mustafa Mat A comparative study for outlier detection techniques in data mining |
description |
Existing studies in data mining mostly focus on finding patterns in large datasets and further using it for organizational decision making. However, finding such exceptions and outliers has not yet received as much attention in the data mining field as some other topics have, such as association rules, classification and clustering. Thus, this paper describes the performance of control chart, linear regression, and Manhattan distance techniques for outlier detection in data mining. Experimental studies show that outlier detection technique using control chart is better than the technique modeled from linear regression because the number of outlier data detected by control chart is smaller than linear regression. Further, experimental studies shows that Manhattan distance technique outperformed compared with the other techniques when the threshold values increased. |
format |
Conference or Workshop Item |
author |
Bakar, Zuriana Abu Mohemad, R. Ahmad, A. Mat Deris, Mustafa Mat |
author_facet |
Bakar, Zuriana Abu Mohemad, R. Ahmad, A. Mat Deris, Mustafa Mat |
author_sort |
Bakar, Zuriana Abu |
title |
A comparative study for outlier detection techniques in data mining
|
title_short |
A comparative study for outlier detection techniques in data mining
|
title_full |
A comparative study for outlier detection techniques in data mining
|
title_fullStr |
A comparative study for outlier detection techniques in data mining
|
title_full_unstemmed |
A comparative study for outlier detection techniques in data mining
|
title_sort |
comparative study for outlier detection techniques in data mining |
publishDate |
2006 |
url |
http://eprints.utm.my/id/eprint/7808/1/Mat_Deris_Mustafa_2006_Comparative_Study_Outlier_Detection_Techniques.pdf http://eprints.utm.my/id/eprint/7808/ http://dx.doi.org/10.1109/ICCIS.2006.252287 |
_version_ |
1643644856781766656 |
score |
13.211869 |