Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive
This paper presents the analysis of text data acquired from News Sabah Times, which is the only Sabah's newspaper that has a section for Kadazandusun news. Currently, there is no text translation tool available to translate a sentence or large text specifically from Kadazandusun to other langua...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://eprints.ums.edu.my/id/eprint/25717/1/Corpus%20Analysis%20A%20Case%20Study%20on%20Kadazandusun%20Newspaper%20Archive.pdf https://eprints.ums.edu.my/id/eprint/25717/ https://doi.org/10.1109/ICETAS48360.2019.9117514 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.ums.eprints.25717 |
---|---|
record_format |
eprints |
spelling |
my.ums.eprints.257172020-07-28T01:39:22Z https://eprints.ums.edu.my/id/eprint/25717/ Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive Mohd Shamrie Sainin Asni Tahir Suraya Alias DS Asia PL Languages and literatures of Eastern Asia, Africa, Oceania This paper presents the analysis of text data acquired from News Sabah Times, which is the only Sabah's newspaper that has a section for Kadazandusun news. Currently, there is no text translation tool available to translate a sentence or large text specifically from Kadazandusun to other languages such as Bahasa Melayu. Thus, the first step is to develop such a system is to analyze the available corpus from the newspaper archive. The objective is to perform text analysis and then providing the possible ways of utilizing the knowledge. In this work, the purpose is to report the methodology and utilization of the fundamental corpus analysis related to text mining and not covering on the linguistics aspects and grammatical context. In addition, this paper also reports on the findings from the newspaper corpus analysis. 2020 Article PeerReviewed text en https://eprints.ums.edu.my/id/eprint/25717/1/Corpus%20Analysis%20A%20Case%20Study%20on%20Kadazandusun%20Newspaper%20Archive.pdf Mohd Shamrie Sainin and Asni Tahir and Suraya Alias (2020) Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive. IEEE. https://doi.org/10.1109/ICETAS48360.2019.9117514 |
institution |
Universiti Malaysia Sabah |
building |
UMS Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaysia Sabah |
content_source |
UMS Institutional Repository |
url_provider |
http://eprints.ums.edu.my/ |
language |
English |
topic |
DS Asia PL Languages and literatures of Eastern Asia, Africa, Oceania |
spellingShingle |
DS Asia PL Languages and literatures of Eastern Asia, Africa, Oceania Mohd Shamrie Sainin Asni Tahir Suraya Alias Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive |
description |
This paper presents the analysis of text data acquired from News Sabah Times, which is the only Sabah's newspaper that has a section for Kadazandusun news. Currently, there is no text translation tool available to translate a sentence or large text specifically from Kadazandusun to other languages such as Bahasa Melayu. Thus, the first step is to develop such a system is to analyze the available corpus from the newspaper archive. The objective is to perform text analysis and then providing the possible ways of utilizing the knowledge. In this work, the purpose is to report the methodology and utilization of the fundamental corpus analysis related to text mining and not covering on the linguistics aspects and grammatical context. In addition, this paper also reports on the findings from the newspaper corpus analysis. |
format |
Article |
author |
Mohd Shamrie Sainin Asni Tahir Suraya Alias |
author_facet |
Mohd Shamrie Sainin Asni Tahir Suraya Alias |
author_sort |
Mohd Shamrie Sainin |
title |
Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive |
title_short |
Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive |
title_full |
Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive |
title_fullStr |
Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive |
title_full_unstemmed |
Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive |
title_sort |
corpus analysis: a case study on kadazandusun newspaper archive |
publishDate |
2020 |
url |
https://eprints.ums.edu.my/id/eprint/25717/1/Corpus%20Analysis%20A%20Case%20Study%20on%20Kadazandusun%20Newspaper%20Archive.pdf https://eprints.ums.edu.my/id/eprint/25717/ https://doi.org/10.1109/ICETAS48360.2019.9117514 |
_version_ |
1760230404557635584 |
score |
13.211869 |