Indexing strategy for big data processing: A case study of PingER

With the huge amount of data continuously accumulated and shared by individuals and organizations, it has become necessary to meet the emerging processing and retrieval requirements associated with these large volumes of complex data. This could be achieved by indexing the data sets and reducing he...

Full description

Saved in:

Bibliographic Details
Main Author:	Adamu, Fatima Binta
Format:	Thesis
Language:	en en
Published:	2015
Subjects:	QA299.6-433 Analysis
Online Access:	https://etd.uum.edu.my/5604/1/s817056_01.pdf https://etd.uum.edu.my/5604/2/s817056_02.pdf https://etd.uum.edu.my/5604/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1833436546671312896
author	Adamu, Fatima Binta
author_facet	Adamu, Fatima Binta
author_sort	Adamu, Fatima Binta
building	UUM Library
collection	Institutional Repository
content_provider	Universiti Utara Malaysia
content_source	UUM Electronic Theses
continent	Asia
country	Malaysia
description	With the huge amount of data continuously accumulated and shared by individuals and organizations, it has become necessary to meet the emerging processing and retrieval requirements associated with these large volumes of complex data. This could be achieved by indexing the data sets and reducing heavy computational overhead accustomed to most current indexing strategies during processing of very large amount of data sets. This study proposed a novel Indexing strategy called Big Data INDexing Strategy (BIND), using a concept of high performance parallel computing. BIND supports parallel distribution of data and performs processing in a MapReduce fashion. To develop BIND strategy, Ian foster’s task-scheduling concept for parallel processing is applied. The proposed indexing strategy was first tested on a 2-node cluster environment where varying sizes of datasets were used to note if the performance improves or declines as the size of the data increases. Subsequently, it was tested on a 3-node cluster to note the performance when the number of computation resources are increased. The results demonstrate that BIND minimizes the processing and query time as compared to the current strategy. The findings have significant implication in efficiently managing Big Data and facilitating data storage and information retrieval for users and organizations that manage Big Data.
format	Thesis
id	my.uum.etd-5604
institution	Universiti Utara Malaysia
language	en en
publishDate	2015
record_format	eprints
spelling	my.uum.etd-56042021-03-18T00:24:24Z https://etd.uum.edu.my/5604/ Indexing strategy for big data processing: A case study of PingER Adamu, Fatima Binta QA299.6-433 Analysis With the huge amount of data continuously accumulated and shared by individuals and organizations, it has become necessary to meet the emerging processing and retrieval requirements associated with these large volumes of complex data. This could be achieved by indexing the data sets and reducing heavy computational overhead accustomed to most current indexing strategies during processing of very large amount of data sets. This study proposed a novel Indexing strategy called Big Data INDexing Strategy (BIND), using a concept of high performance parallel computing. BIND supports parallel distribution of data and performs processing in a MapReduce fashion. To develop BIND strategy, Ian foster’s task-scheduling concept for parallel processing is applied. The proposed indexing strategy was first tested on a 2-node cluster environment where varying sizes of datasets were used to note if the performance improves or declines as the size of the data increases. Subsequently, it was tested on a 3-node cluster to note the performance when the number of computation resources are increased. The results demonstrate that BIND minimizes the processing and query time as compared to the current strategy. The findings have significant implication in efficiently managing Big Data and facilitating data storage and information retrieval for users and organizations that manage Big Data. 2015 Thesis NonPeerReviewed text en https://etd.uum.edu.my/5604/1/s817056_01.pdf text en https://etd.uum.edu.my/5604/2/s817056_02.pdf Adamu, Fatima Binta (2015) Indexing strategy for big data processing: A case study of PingER. Masters thesis, Universiti Utara Malaysia.
spellingShingle	QA299.6-433 Analysis Adamu, Fatima Binta Indexing strategy for big data processing: A case study of PingER
title	Indexing strategy for big data processing: A case study of PingER
title_full	Indexing strategy for big data processing: A case study of PingER
title_fullStr	Indexing strategy for big data processing: A case study of PingER
title_full_unstemmed	Indexing strategy for big data processing: A case study of PingER
title_short	Indexing strategy for big data processing: A case study of PingER
title_sort	indexing strategy for big data processing: a case study of pinger
topic	QA299.6-433 Analysis
url	https://etd.uum.edu.my/5604/1/s817056_01.pdf https://etd.uum.edu.my/5604/2/s817056_02.pdf https://etd.uum.edu.my/5604/
url_provider	http://etd.uum.edu.my/

Indexing strategy for big data processing: A case study of PingER

Similar Items