Effective keyword query structuring using NER for XML retrieval

Purpose: A more effective way for searching XML database is to use structured queries. However, using query languages to express queries prove to be difficult for most users since this requires learning a query language and knowledge of the underlying data schema. On the other hand, the success of w...

Full description

Saved in:
Bibliographic Details
Main Authors: Roko, Abubakar, Doraisamy, Shyamala, Jantan, Azrul Hazri, Azman, Azreen
Format: Article
Language:English
Published: Emerald Group Publishing 2015
Online Access:http://psasir.upm.edu.my/id/eprint/37324/1/Effective%20keyword%20query%20structuring%20using%20NER%20for%20XML%20retrieval.pdf
http://psasir.upm.edu.my/id/eprint/37324/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.upm.eprints.37324
record_format eprints
spelling my.upm.eprints.373242015-10-29T06:04:59Z http://psasir.upm.edu.my/id/eprint/37324/ Effective keyword query structuring using NER for XML retrieval Roko, Abubakar Doraisamy, Shyamala Jantan, Azrul Hazri Azman, Azreen Purpose: A more effective way for searching XML database is to use structured queries. However, using query languages to express queries prove to be difficult for most users since this requires learning a query language and knowledge of the underlying data schema. On the other hand, the success of web search engines has made many users to be familiar with keyword search and therefore they prefer to use a keyword search query interface to search XML data. The purpose of this paper is to propose and evaluate XKQSS, a query structuring method that relegates the task of generating structured queries from a user to a search engine while retaining the simple keyword search query interface. Design/methodology/approach: Existing query structuring approaches require users to provide structural hints in their input keyword queries even though their interface is keyword base. Other problems with existing systems include their inability to put keyword query ambiguities into consideration during query structuring and how to select the best generated structure query that best represents a given keyword query. To address these problems, this study allows users to submit a schema independent keyword query, use named Entity Recognition (NER) to categorize query keywords in order to resolve query ambiguities and compute semantic information for a node from its data content. Algorithms were proposed that find user search intentions and convert the intentions into a set of ranked structured queries. Findings: Experiments with Sigmod and IMDB datasets were conducted to evaluate the effectiveness of the method. The experimental result shows that the XKQSS is about 20% more effective than XReal in terms of return nodes identification, a state-of-art systems for XML retrieval. Originality/value: Existing systems do not take keyword query ambiguities into account. XKSS consists of two guidelines based on NER that help to resolve these ambiguities before converting the submitted query. It also include a ranking function computes a score for each generated query by using both semantic information and data statistic as opposed to data statistic only approach used by the existing approaches. Emerald Group Publishing 2015 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/37324/1/Effective%20keyword%20query%20structuring%20using%20NER%20for%20XML%20retrieval.pdf Roko, Abubakar and Doraisamy, Shyamala and Jantan, Azrul Hazri and Azman, Azreen (2015) Effective keyword query structuring using NER for XML retrieval. International Journal of Web Information Systems, 11 (1). pp. 33-53. ISSN 1744-0084; ESSN: 1744-0092 10.1108/IJWIS-06-2014-0022
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
language English
description Purpose: A more effective way for searching XML database is to use structured queries. However, using query languages to express queries prove to be difficult for most users since this requires learning a query language and knowledge of the underlying data schema. On the other hand, the success of web search engines has made many users to be familiar with keyword search and therefore they prefer to use a keyword search query interface to search XML data. The purpose of this paper is to propose and evaluate XKQSS, a query structuring method that relegates the task of generating structured queries from a user to a search engine while retaining the simple keyword search query interface. Design/methodology/approach: Existing query structuring approaches require users to provide structural hints in their input keyword queries even though their interface is keyword base. Other problems with existing systems include their inability to put keyword query ambiguities into consideration during query structuring and how to select the best generated structure query that best represents a given keyword query. To address these problems, this study allows users to submit a schema independent keyword query, use named Entity Recognition (NER) to categorize query keywords in order to resolve query ambiguities and compute semantic information for a node from its data content. Algorithms were proposed that find user search intentions and convert the intentions into a set of ranked structured queries. Findings: Experiments with Sigmod and IMDB datasets were conducted to evaluate the effectiveness of the method. The experimental result shows that the XKQSS is about 20% more effective than XReal in terms of return nodes identification, a state-of-art systems for XML retrieval. Originality/value: Existing systems do not take keyword query ambiguities into account. XKSS consists of two guidelines based on NER that help to resolve these ambiguities before converting the submitted query. It also include a ranking function computes a score for each generated query by using both semantic information and data statistic as opposed to data statistic only approach used by the existing approaches.
format Article
author Roko, Abubakar
Doraisamy, Shyamala
Jantan, Azrul Hazri
Azman, Azreen
spellingShingle Roko, Abubakar
Doraisamy, Shyamala
Jantan, Azrul Hazri
Azman, Azreen
Effective keyword query structuring using NER for XML retrieval
author_facet Roko, Abubakar
Doraisamy, Shyamala
Jantan, Azrul Hazri
Azman, Azreen
author_sort Roko, Abubakar
title Effective keyword query structuring using NER for XML retrieval
title_short Effective keyword query structuring using NER for XML retrieval
title_full Effective keyword query structuring using NER for XML retrieval
title_fullStr Effective keyword query structuring using NER for XML retrieval
title_full_unstemmed Effective keyword query structuring using NER for XML retrieval
title_sort effective keyword query structuring using ner for xml retrieval
publisher Emerald Group Publishing
publishDate 2015
url http://psasir.upm.edu.my/id/eprint/37324/1/Effective%20keyword%20query%20structuring%20using%20NER%20for%20XML%20retrieval.pdf
http://psasir.upm.edu.my/id/eprint/37324/
_version_ 1643831961665404928
score 13.211869