Adaptive learning for lemmatization in morphology analysis

Morphological analysis is used to study the internal structure words by reducing the number of vocabularies used while retaining the semantic meaning of the knowledge in NLP system. Most of the existing algorithms are focusing on stemmatization instead of lemmatization process. Even with technology...

Full description

Saved in:
Bibliographic Details
Main Authors: Ting, Mary, Abdul Kadir, Rabiah, Tengku Sembok, Tengku Mohd, Ahmad, Fatimah, Azman, Azreen
Format: Conference or Workshop Item
Published: Springer International Publishing 2014
Online Access:http://psasir.upm.edu.my/id/eprint/40308/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Morphological analysis is used to study the internal structure words by reducing the number of vocabularies used while retaining the semantic meaning of the knowledge in NLP system. Most of the existing algorithms are focusing on stemmatization instead of lemmatization process. Even with technology advancement, yet none of the available lemmatization algorithms able to produce 100 % accurate result. The base words produced by the current algorithm might be unusable as it alters the overall meaning it tried to represent, which will directly affect the outcome of NLP systems. This paper proposed a new method to handle lemmatization process during the morphological analysis. The method consists three layers of lemmatization process, which incorporate the used of Stanford parser API, WordNet database and adaptive learning technique. The lemmatized words yields from the proposed method are more accurate, thus it will improve the semantic knowledge represented and stored in the knowledge base.