Issues in evaluating the retrieval performance of multiscript translation of Al-Quran
The main aim of this paper is to present on the issues of evaluating the retrieval performance of the multi-script indexing of translated texts of al-Quran. Translations of al-Quran has played a major role in the recitation of al-Quran in its original texts and understanding through the translated w...
Saved in:
Main Authors: | , |
---|---|
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2011
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/7481/1/Issues_of_ret_perf.pdf http://irep.iium.edu.my/7481/ http://kict.iium.edu.my/wcomlis2011/index.php?option=com_content&view=article&id=24&Itemid=471 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.iium.irep.7481 |
---|---|
record_format |
dspace |
spelling |
my.iium.irep.74812011-11-21T13:06:38Z http://irep.iium.edu.my/7481/ Issues in evaluating the retrieval performance of multiscript translation of Al-Quran Othman, Roslina Abdul Wahid, Fauziah Z665 Library Science. Information Science The main aim of this paper is to present on the issues of evaluating the retrieval performance of the multi-script indexing of translated texts of al-Quran. Translations of al-Quran has played a major role in the recitation of al-Quran in its original texts and understanding through the translated words, among the public. Even in querying, non-Arabic speakers will find the texts through the translated words in addition to topical search. Transliteration is a need in the absence of terminology in the normal conduct of Cross-Language Information Retrieval research area, while in the case of this research, the transliterated version was meant for those with the ability to read the older script in its own original translation. The Malay Roman script has its own version of the translation. Objectives include to examine the reported retrieval performance of these texts and to evaluate the retrieval performance of the translations available in two different scripts of a language: Malay Rumi and Malay Jawi, built upon Pimpinan ar-Rahman version, Indri and Jawi software. Measures include recall, precision and overlap. Recall explains the performance in retrieving all relevant items, while precision describes the performance in rejecting non-relevant items. Overlap exhibits the retrieval of items common in both sub-collections. Queries are constructed from questions posed by newspaper readers in both scripts resulted as keywords with semantic, while relevance judgment is made by a panel of expert based on answers to the questions. Findings based on recall, precision and overlaps revealed the major issues of standardized texts, translation and transliteration, text alignments, queries construction, question-answering relevance vs. topical relevance. Indri's performance is not a major issue, while the Jawi software requires improvement to a minor extent. This paper contributes to the issues of handling test collections involving parallel corpus in the area of Cross Language IR facing the Muslim World. 2011-11-16 Conference or Workshop Item REM application/pdf en http://irep.iium.edu.my/7481/1/Issues_of_ret_perf.pdf Othman, Roslina and Abdul Wahid, Fauziah (2011) Issues in evaluating the retrieval performance of multiscript translation of Al-Quran. In: 6th World Congress of Muslim Librarians and Information Scientists 2011 (WCOMLIS 2011), 16 - 17 November 2011, IIUM. (Unpublished) http://kict.iium.edu.my/wcomlis2011/index.php?option=com_content&view=article&id=24&Itemid=471 |
institution |
Universiti Islam Antarabangsa Malaysia |
building |
IIUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
International Islamic University Malaysia |
content_source |
IIUM Repository (IREP) |
url_provider |
http://irep.iium.edu.my/ |
language |
English |
topic |
Z665 Library Science. Information Science |
spellingShingle |
Z665 Library Science. Information Science Othman, Roslina Abdul Wahid, Fauziah Issues in evaluating the retrieval performance of multiscript translation of Al-Quran |
description |
The main aim of this paper is to present on the issues of evaluating the retrieval performance of the multi-script indexing of translated texts of al-Quran. Translations of al-Quran has played a major role in the recitation of al-Quran in its original texts and understanding through the translated words, among the public. Even in querying, non-Arabic speakers will find the texts through the translated words in addition to topical search. Transliteration is a need in the absence of terminology in the normal conduct of Cross-Language Information Retrieval research area, while in the case of this research, the transliterated version was meant for those with the ability to read the older script in its own original translation. The Malay Roman script has its own version of the translation. Objectives include to examine the reported retrieval performance of these texts and to evaluate the retrieval performance of the translations available in two different scripts of a language: Malay Rumi and Malay Jawi, built upon Pimpinan ar-Rahman version, Indri and Jawi software. Measures include recall, precision and overlap. Recall explains the performance in retrieving all relevant items, while precision describes the performance in rejecting non-relevant items. Overlap exhibits the retrieval of items common in both sub-collections. Queries are constructed from questions posed by newspaper readers in both scripts resulted as keywords with semantic, while relevance judgment is made by a panel of expert based on answers to the questions. Findings based on recall, precision and overlaps revealed the major issues of standardized texts, translation and transliteration, text alignments, queries construction, question-answering relevance vs. topical relevance. Indri's performance is not a major issue, while the Jawi software requires improvement to a minor extent. This paper contributes to the issues of handling test collections involving parallel corpus in the area of Cross Language IR facing the Muslim World. |
format |
Conference or Workshop Item |
author |
Othman, Roslina Abdul Wahid, Fauziah |
author_facet |
Othman, Roslina Abdul Wahid, Fauziah |
author_sort |
Othman, Roslina |
title |
Issues in evaluating the retrieval performance of multiscript translation of Al-Quran |
title_short |
Issues in evaluating the retrieval performance of multiscript translation of Al-Quran |
title_full |
Issues in evaluating the retrieval performance of multiscript translation of Al-Quran |
title_fullStr |
Issues in evaluating the retrieval performance of multiscript translation of Al-Quran |
title_full_unstemmed |
Issues in evaluating the retrieval performance of multiscript translation of Al-Quran |
title_sort |
issues in evaluating the retrieval performance of multiscript translation of al-quran |
publishDate |
2011 |
url |
http://irep.iium.edu.my/7481/1/Issues_of_ret_perf.pdf http://irep.iium.edu.my/7481/ http://kict.iium.edu.my/wcomlis2011/index.php?option=com_content&view=article&id=24&Itemid=471 |
_version_ |
1643605950188224512 |
score |
13.211869 |