Query translation for multilingual content with semantic technique

Cross-lingual information retrieval (CLIR) allows user query in a different language from the language of target resources. Thus, translation is the key element in the query processing. There are three translation approaches: query, document, or hybrid query-document. However, query translation is v...

Full description

Saved in:
Bibliographic Details
Main Authors: Norita Md Norwawi, Sundresan a/l Perumal, Emran Huda, Waka Jeng
Format: Article
Language:en
Published: Penerbit Universiti Kebangsaan Malaysia 2020
Online Access:http://journalarticle.ukm.my/15907/1/9.pdf
http://journalarticle.ukm.my/15907/
http://www.ukm.my/jsm/malay_journals/jilid49bil9_2020/KandunganJilid49Bil9_2020.html
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1834432774960840704
author Norita Md Norwawi,
Sundresan a/l Perumal,
Emran Huda,
Waka Jeng,
author_facet Norita Md Norwawi,
Sundresan a/l Perumal,
Emran Huda,
Waka Jeng,
author_sort Norita Md Norwawi,
building Tun Sri Lanang Library
collection Institutional Repository
content_provider Universiti Kebangsaan Malaysia
content_source UKM Journal Article Repository
continent Asia
country Malaysia
description Cross-lingual information retrieval (CLIR) allows user query in a different language from the language of target resources. Thus, translation is the key element in the query processing. There are three translation approaches: query, document, or hybrid query-document. However, query translation is very challenging due to the polysemy problem. Different linguistic nature of the languages will lead to ambiguity of meaning subsequently user’s true intention could be misinterpreted. This paper presents a semantic technique on query translation for a multilingual knowledge repository to improve the query processing. Offline translated documents or parallel corpora in English, Arabic, and Malay language including Jawi text was used as the data. Set of keywords were constructed preidentified by expert related to prophetic food. These keywords were annotated with the relevant Quranic verses, Hadith texts, Manuscript text images and scientific article determined by expert. The synonym and context-based translation was annotated together with the specific keyword. A query will do a three-way pattern match based on the keyword indexing list that link to the relevant documents. A one-stop knowledge repository on prophetic food was developed as a proof of concept using sources are from al-Quran, Hadith, classical manuscript, and scientific articles verified by experts to ensure the content authenticity and integrity.
format Article
id my-ukm.journal.15907
institution Universiti Kebangsaan Malaysia
language en
publishDate 2020
publisher Penerbit Universiti Kebangsaan Malaysia
record_format eprints
spelling my-ukm.journal.159072020-12-06T09:12:51Z http://journalarticle.ukm.my/15907/ Query translation for multilingual content with semantic technique Norita Md Norwawi, Sundresan a/l Perumal, Emran Huda, Waka Jeng, Cross-lingual information retrieval (CLIR) allows user query in a different language from the language of target resources. Thus, translation is the key element in the query processing. There are three translation approaches: query, document, or hybrid query-document. However, query translation is very challenging due to the polysemy problem. Different linguistic nature of the languages will lead to ambiguity of meaning subsequently user’s true intention could be misinterpreted. This paper presents a semantic technique on query translation for a multilingual knowledge repository to improve the query processing. Offline translated documents or parallel corpora in English, Arabic, and Malay language including Jawi text was used as the data. Set of keywords were constructed preidentified by expert related to prophetic food. These keywords were annotated with the relevant Quranic verses, Hadith texts, Manuscript text images and scientific article determined by expert. The synonym and context-based translation was annotated together with the specific keyword. A query will do a three-way pattern match based on the keyword indexing list that link to the relevant documents. A one-stop knowledge repository on prophetic food was developed as a proof of concept using sources are from al-Quran, Hadith, classical manuscript, and scientific articles verified by experts to ensure the content authenticity and integrity. Penerbit Universiti Kebangsaan Malaysia 2020-09 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/15907/1/9.pdf Norita Md Norwawi, and Sundresan a/l Perumal, and Emran Huda, and Waka Jeng, (2020) Query translation for multilingual content with semantic technique. Sains Malaysiana, 49 (9). pp. 2113-2118. ISSN 0126-6039 http://www.ukm.my/jsm/malay_journals/jilid49bil9_2020/KandunganJilid49Bil9_2020.html
spellingShingle Norita Md Norwawi,
Sundresan a/l Perumal,
Emran Huda,
Waka Jeng,
Query translation for multilingual content with semantic technique
title Query translation for multilingual content with semantic technique
title_full Query translation for multilingual content with semantic technique
title_fullStr Query translation for multilingual content with semantic technique
title_full_unstemmed Query translation for multilingual content with semantic technique
title_short Query translation for multilingual content with semantic technique
title_sort query translation for multilingual content with semantic technique
url http://journalarticle.ukm.my/15907/1/9.pdf
http://journalarticle.ukm.my/15907/
http://www.ukm.my/jsm/malay_journals/jilid49bil9_2020/KandunganJilid49Bil9_2020.html
url_provider http://journalarticle.ukm.my/