Document clustering: comparison of ward's clustering and Kohonen network performance / Noor Suriana Abu Bakar and Nurul Nisa Mohd Nasir

Document clustering has been investigated for use in a number of different areas of information retrieval. The aimed of research in the field is to improve efficiency and effectiveness of retrieval. Since the clusters perform best quality, Hierarchical clustering is most commonly used in document cl...

Full description

Saved in:
Bibliographic Details
Main Authors: Abu Bakar, Noor Suriana, Mohd Nasir, Nurul Nisa
Format: Research Reports
Language:English
Published: 2010
Subjects:
Online Access:http://ir.uitm.edu.my/id/eprint/42623/1/42623.pdf
http://ir.uitm.edu.my/id/eprint/42623/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Document clustering has been investigated for use in a number of different areas of information retrieval. The aimed of research in the field is to improve efficiency and effectiveness of retrieval. Since the clusters perform best quality, Hierarchical clustering is most commonly used in document clustering. Recently, there exist researches that apply NN in IR. However, research in NN based document clustering still less frequent. Therefore, this study will apply hierarchical based document clustering and NN based document clustering in terms of suggestion supervisor and examiner for thesis. The results from these two techniques will then compare with manual system to find out whether hierarchical based or NN based performed better. The collection of theses will be used and employed the pre-processing including stop word removal and stemming further measure the document similarity before apply the clustering techniques. The result will give some insight whether NN is better for suggestion supervisor and examiner.