Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive

This paper presents the analysis of text data acquired from News Sabah Times, which is the only Sabah's newspaper that has a section for Kadazandusun news. Currently, there is no text translation tool available to translate a sentence or large text specifically from Kadazandusun to other langua...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohd Shamrie Sainin, Asni Tahir, Suraya Alias
Format: Article
Language:English
Published: 2020
Subjects:
Online Access:https://eprints.ums.edu.my/id/eprint/25717/1/Corpus%20Analysis%20A%20Case%20Study%20on%20Kadazandusun%20Newspaper%20Archive.pdf
https://eprints.ums.edu.my/id/eprint/25717/
https://doi.org/10.1109/ICETAS48360.2019.9117514
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.ums.eprints.25717
record_format eprints
spelling my.ums.eprints.257172020-07-28T01:39:22Z https://eprints.ums.edu.my/id/eprint/25717/ Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive Mohd Shamrie Sainin Asni Tahir Suraya Alias DS Asia PL Languages and literatures of Eastern Asia, Africa, Oceania This paper presents the analysis of text data acquired from News Sabah Times, which is the only Sabah's newspaper that has a section for Kadazandusun news. Currently, there is no text translation tool available to translate a sentence or large text specifically from Kadazandusun to other languages such as Bahasa Melayu. Thus, the first step is to develop such a system is to analyze the available corpus from the newspaper archive. The objective is to perform text analysis and then providing the possible ways of utilizing the knowledge. In this work, the purpose is to report the methodology and utilization of the fundamental corpus analysis related to text mining and not covering on the linguistics aspects and grammatical context. In addition, this paper also reports on the findings from the newspaper corpus analysis. 2020 Article PeerReviewed text en https://eprints.ums.edu.my/id/eprint/25717/1/Corpus%20Analysis%20A%20Case%20Study%20on%20Kadazandusun%20Newspaper%20Archive.pdf Mohd Shamrie Sainin and Asni Tahir and Suraya Alias (2020) Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive. IEEE. https://doi.org/10.1109/ICETAS48360.2019.9117514
institution Universiti Malaysia Sabah
building UMS Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sabah
content_source UMS Institutional Repository
url_provider http://eprints.ums.edu.my/
language English
topic DS Asia
PL Languages and literatures of Eastern Asia, Africa, Oceania
spellingShingle DS Asia
PL Languages and literatures of Eastern Asia, Africa, Oceania
Mohd Shamrie Sainin
Asni Tahir
Suraya Alias
Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive
description This paper presents the analysis of text data acquired from News Sabah Times, which is the only Sabah's newspaper that has a section for Kadazandusun news. Currently, there is no text translation tool available to translate a sentence or large text specifically from Kadazandusun to other languages such as Bahasa Melayu. Thus, the first step is to develop such a system is to analyze the available corpus from the newspaper archive. The objective is to perform text analysis and then providing the possible ways of utilizing the knowledge. In this work, the purpose is to report the methodology and utilization of the fundamental corpus analysis related to text mining and not covering on the linguistics aspects and grammatical context. In addition, this paper also reports on the findings from the newspaper corpus analysis.
format Article
author Mohd Shamrie Sainin
Asni Tahir
Suraya Alias
author_facet Mohd Shamrie Sainin
Asni Tahir
Suraya Alias
author_sort Mohd Shamrie Sainin
title Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive
title_short Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive
title_full Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive
title_fullStr Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive
title_full_unstemmed Corpus Analysis: A Case Study on Kadazandusun Newspaper Archive
title_sort corpus analysis: a case study on kadazandusun newspaper archive
publishDate 2020
url https://eprints.ums.edu.my/id/eprint/25717/1/Corpus%20Analysis%20A%20Case%20Study%20on%20Kadazandusun%20Newspaper%20Archive.pdf
https://eprints.ums.edu.my/id/eprint/25717/
https://doi.org/10.1109/ICETAS48360.2019.9117514
_version_ 1760230404557635584
score 13.211869