Part-of-speech for old Malay manuscript corpus: A review
Research in Malay Part-of-Speech (POS) has increased considerably in the past few years.From the literature, POS are known as the first stage in automated text analysis and the development of language technologies can scarcely begun without this initial phase.Malay language can be written in Roman o...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Book Section |
Published: |
Springer Berlin Heidelberg
2013
|
Subjects: | |
Online Access: | http://repo.uum.edu.my/19095/ http://doi.org/10.1007/978-3-642-40567-9_5 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Research in Malay Part-of-Speech (POS) has increased considerably in the past few years.From the literature, POS are known as the first stage in automated text analysis and the development of language technologies can scarcely begun without this initial phase.Malay language can be written in Roman or Jawi.Three different spelling between Roman and Jawi make this study essential.In this paper, we highlighted the problem and issues related to Malay language, POS general framework, POS approaches and techniques.POS at basis was introduced to get information from Old Malay Manuscripts that contain important information in various spheres of knowledge.Promising result for the auto-tagging of Malay written in Jawi is expected. |
---|