Representing semantics of text by acquiring its canonical form

Canonical form is a notion stating that related idea should have the same meaning representation. It is a notion that greatly simplifies task by dealing with a single meaning representation for a wide range of expression. The issue in text representation is to generate a formal approach of capturing...

Full description

Saved in:
Bibliographic Details
Main Authors: Taiye, Mohammed Ahmed, Kamaruddin, Siti Sakira, Ahmad, Farzana Kabir
Format: Article
Published: INSIGHT - Indonesian Society for Knowledge and Human Development 2017
Subjects:
Online Access:http://repo.uum.edu.my/25656/
http://doi.org/10.18517/ijaseit.7.3.2395
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uum.repo.25656
record_format eprints
spelling my.uum.repo.256562019-02-24T07:57:57Z http://repo.uum.edu.my/25656/ Representing semantics of text by acquiring its canonical form Taiye, Mohammed Ahmed Kamaruddin, Siti Sakira Ahmad, Farzana Kabir QA75 Electronic computers. Computer science Canonical form is a notion stating that related idea should have the same meaning representation. It is a notion that greatly simplifies task by dealing with a single meaning representation for a wide range of expression. The issue in text representation is to generate a formal approach of capturing meaning or semantics in sentences. These issues include heterogeneity and inconsistency in text. Polysemous, synonymous, morphemes and homonymous word poses serious drawbacks when trying to capture senses in sentences. This calls for a need to capture and represent senses in order to resolve vagueness and improve understanding of senses in documents for knowledge creation purposes. We introduce a simple and straightforward method to capture canonical form of sentences. The proposed method first identifies the canonical forms using the Word Sense Disambiguation (WSD) technique and later applies the First Order Predicate Logic (FOPL) scheme to represent the identified canonical forms. We adopted two algorithms in WSD, which are Lesk and Selectional Preference Restriction. These algorithms concentrate mainly on disambiguating senses in words, phrases and sentences. Also we adopted the First order Predicate Logic scheme to analyse argument predicate in sentences, employing the consequence logic theorem to test for satisfiability, validity and completeness of information in sentences. INSIGHT - Indonesian Society for Knowledge and Human Development 2017 Article PeerReviewed Taiye, Mohammed Ahmed and Kamaruddin, Siti Sakira and Ahmad, Farzana Kabir (2017) Representing semantics of text by acquiring its canonical form. International Journal on Advanced Science, Engineering and Information Technology, 7 (3). pp. 808-814. ISSN 2088-5334 http://doi.org/10.18517/ijaseit.7.3.2395 doi:10.18517/ijaseit.7.3.2395
institution Universiti Utara Malaysia
building UUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Utara Malaysia
content_source UUM Institutionali Repository
url_provider http://repo.uum.edu.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Taiye, Mohammed Ahmed
Kamaruddin, Siti Sakira
Ahmad, Farzana Kabir
Representing semantics of text by acquiring its canonical form
description Canonical form is a notion stating that related idea should have the same meaning representation. It is a notion that greatly simplifies task by dealing with a single meaning representation for a wide range of expression. The issue in text representation is to generate a formal approach of capturing meaning or semantics in sentences. These issues include heterogeneity and inconsistency in text. Polysemous, synonymous, morphemes and homonymous word poses serious drawbacks when trying to capture senses in sentences. This calls for a need to capture and represent senses in order to resolve vagueness and improve understanding of senses in documents for knowledge creation purposes. We introduce a simple and straightforward method to capture canonical form of sentences. The proposed method first identifies the canonical forms using the Word Sense Disambiguation (WSD) technique and later applies the First Order Predicate Logic (FOPL) scheme to represent the identified canonical forms. We adopted two algorithms in WSD, which are Lesk and Selectional Preference Restriction. These algorithms concentrate mainly on disambiguating senses in words, phrases and sentences. Also we adopted the First order Predicate Logic scheme to analyse argument predicate in sentences, employing the consequence logic theorem to test for satisfiability, validity and completeness of information in sentences.
format Article
author Taiye, Mohammed Ahmed
Kamaruddin, Siti Sakira
Ahmad, Farzana Kabir
author_facet Taiye, Mohammed Ahmed
Kamaruddin, Siti Sakira
Ahmad, Farzana Kabir
author_sort Taiye, Mohammed Ahmed
title Representing semantics of text by acquiring its canonical form
title_short Representing semantics of text by acquiring its canonical form
title_full Representing semantics of text by acquiring its canonical form
title_fullStr Representing semantics of text by acquiring its canonical form
title_full_unstemmed Representing semantics of text by acquiring its canonical form
title_sort representing semantics of text by acquiring its canonical form
publisher INSIGHT - Indonesian Society for Knowledge and Human Development
publishDate 2017
url http://repo.uum.edu.my/25656/
http://doi.org/10.18517/ijaseit.7.3.2395
_version_ 1644284388205133824
score 13.211869