Evaluation of the effectiveness of clustering algorithm in retrieving Malay documents / Aminah Mahmood

In recent years, we have witnessed a tremendous growth in the volume of text documents available on the Internet, digital libraries, new sources and company-wide intranets. This has led to an increased interest in developing methods that can help users to effectively navigate, summarize and organ...

Full description

Saved in:
Bibliographic Details
Main Author: Mahmood, Aminah
Format: Thesis
Language:English
Published: 2004
Online Access:http://ir.uitm.edu.my/id/eprint/1370/1/TB_AMINAH%20MAHMOOD%20AP%2004_5%20P01.pdf
http://ir.uitm.edu.my/id/eprint/1370/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uitm.ir.1370
record_format eprints
spelling my.uitm.ir.13702019-03-22T06:58:59Z http://ir.uitm.edu.my/id/eprint/1370/ Evaluation of the effectiveness of clustering algorithm in retrieving Malay documents / Aminah Mahmood Mahmood, Aminah In recent years, we have witnessed a tremendous growth in the volume of text documents available on the Internet, digital libraries, new sources and company-wide intranets. This has led to an increased interest in developing methods that can help users to effectively navigate, summarize and organize this information with the ultimate goal of helping them to find what they are looking for. The main issue in this information age is the efficiency and effectiveness of the retrieval system that can be used by the information provider. A good retrieval system should provide tools to perform searching accurately based on user requirements. Cluster analysis is a technique for multivariate analysis that assigns items to automatically created group based on a calculation of the degrees of association between items and groups. In the information retrieval (IR) field, cluster analysis has been used to create groups of documents with the goal of improving the efficiency and effectiveness of retrieval, or to determine the structure of the literature of a field. The IR community has explored docimient clustering as an alternative method of organizing retrieval results, but clustering has yet to be deployed on the major search engines. This study has evaluated and identified the effectiveness of clustering algorithm in Malay document retrieval system using Hadith test collections, which consists of Hadith documents, relevant judgments and one set of queries. Three types of experiments are conducted. First experiment use exact match, which is no method, is applied. Second experiment use stenmiing method. Finally, the last experiment uses combination of stemming and clustering methods. 2004 Thesis NonPeerReviewed text en http://ir.uitm.edu.my/id/eprint/1370/1/TB_AMINAH%20MAHMOOD%20AP%2004_5%20P01.pdf Mahmood, Aminah (2004) Evaluation of the effectiveness of clustering algorithm in retrieving Malay documents / Aminah Mahmood. Degree thesis, Universiti Teknologi MARA.
institution Universiti Teknologi Mara
building Tun Abdul Razak Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Mara
content_source UiTM Institutional Repository
url_provider http://ir.uitm.edu.my/
language English
description In recent years, we have witnessed a tremendous growth in the volume of text documents available on the Internet, digital libraries, new sources and company-wide intranets. This has led to an increased interest in developing methods that can help users to effectively navigate, summarize and organize this information with the ultimate goal of helping them to find what they are looking for. The main issue in this information age is the efficiency and effectiveness of the retrieval system that can be used by the information provider. A good retrieval system should provide tools to perform searching accurately based on user requirements. Cluster analysis is a technique for multivariate analysis that assigns items to automatically created group based on a calculation of the degrees of association between items and groups. In the information retrieval (IR) field, cluster analysis has been used to create groups of documents with the goal of improving the efficiency and effectiveness of retrieval, or to determine the structure of the literature of a field. The IR community has explored docimient clustering as an alternative method of organizing retrieval results, but clustering has yet to be deployed on the major search engines. This study has evaluated and identified the effectiveness of clustering algorithm in Malay document retrieval system using Hadith test collections, which consists of Hadith documents, relevant judgments and one set of queries. Three types of experiments are conducted. First experiment use exact match, which is no method, is applied. Second experiment use stenmiing method. Finally, the last experiment uses combination of stemming and clustering methods.
format Thesis
author Mahmood, Aminah
spellingShingle Mahmood, Aminah
Evaluation of the effectiveness of clustering algorithm in retrieving Malay documents / Aminah Mahmood
author_facet Mahmood, Aminah
author_sort Mahmood, Aminah
title Evaluation of the effectiveness of clustering algorithm in retrieving Malay documents / Aminah Mahmood
title_short Evaluation of the effectiveness of clustering algorithm in retrieving Malay documents / Aminah Mahmood
title_full Evaluation of the effectiveness of clustering algorithm in retrieving Malay documents / Aminah Mahmood
title_fullStr Evaluation of the effectiveness of clustering algorithm in retrieving Malay documents / Aminah Mahmood
title_full_unstemmed Evaluation of the effectiveness of clustering algorithm in retrieving Malay documents / Aminah Mahmood
title_sort evaluation of the effectiveness of clustering algorithm in retrieving malay documents / aminah mahmood
publishDate 2004
url http://ir.uitm.edu.my/id/eprint/1370/1/TB_AMINAH%20MAHMOOD%20AP%2004_5%20P01.pdf
http://ir.uitm.edu.my/id/eprint/1370/
_version_ 1685648138359013376
score 13.211869