Replica Creation Algorithm for Data Grids

Data grid system is a data management infrastructure that facilitates reliable access and sharing of large amount of data, storage resources, and data transfer services that can be scaled across distributed locations. This thesis presents a new replication algorithm that improves data access perform...

Full description

Saved in:
Bibliographic Details
Main Author: Madi, Mohammed Kamel
Format: Thesis
Language:en
en
Published: 2012
Subjects:
Online Access:https://etd.uum.edu.my/3352/1/MOHAMMED_KAMEL_MADI.pdf
https://etd.uum.edu.my/3352/3/MOHAMMED_KAMEL_MADI.pdf
https://etd.uum.edu.my/3352/
http://sierra.uum.edu.my/record=b1239722~S1
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1833436117613936640
author Madi, Mohammed Kamel
author_facet Madi, Mohammed Kamel
author_sort Madi, Mohammed Kamel
building UUM Library
collection Institutional Repository
content_provider Universiti Utara Malaysia
content_source UUM Electronic Theses
continent Asia
country Malaysia
description Data grid system is a data management infrastructure that facilitates reliable access and sharing of large amount of data, storage resources, and data transfer services that can be scaled across distributed locations. This thesis presents a new replication algorithm that improves data access performance in data grids by distributing relevant data copies around the grid. The new Data Replica Creation Algorithm (DRCM) improves performance of data grid systems by reducing job execution time and making the best use of data grid resources (network bandwidth and storage space). Current algorithms focus on number of accesses in deciding which file to replicate and where to place them, which ignores resources’ capabilities. DRCM differs by considering both user and resource perspectives; strategically placing replicas at locations that provide the lowest transfer cost. The proposed algorithm uses three strategies: Replica Creation and Deletion Strategy (RCDS), Replica Placement Strategy (RPS), and Replica Replacement Strategy (RRS). DRCM was evaluated using network simulation (OptorSim) based on selected performance metrics (mean job execution time, efficient network usage, average storage usage, and computing element usage), scenarios, and topologies. Results revealed better job execution time with lower resource consumption than existing approaches. This research contributes replication strategies embodied in one algorithm that enhances data grid performance, capable of making a decision on creating or deleting more than one file during same decision. Furthermore, dependency-level-between-files criterion was utilized and integrated with the exponential growth/decay model to give an accurate file evaluation.
format Thesis
id my.uum.etd-3352
institution Universiti Utara Malaysia
language en
en
publishDate 2012
record_format eprints
spelling my.uum.etd-33522023-03-01T01:27:46Z https://etd.uum.edu.my/3352/ Replica Creation Algorithm for Data Grids Madi, Mohammed Kamel QA71-90 Instruments and machines Data grid system is a data management infrastructure that facilitates reliable access and sharing of large amount of data, storage resources, and data transfer services that can be scaled across distributed locations. This thesis presents a new replication algorithm that improves data access performance in data grids by distributing relevant data copies around the grid. The new Data Replica Creation Algorithm (DRCM) improves performance of data grid systems by reducing job execution time and making the best use of data grid resources (network bandwidth and storage space). Current algorithms focus on number of accesses in deciding which file to replicate and where to place them, which ignores resources’ capabilities. DRCM differs by considering both user and resource perspectives; strategically placing replicas at locations that provide the lowest transfer cost. The proposed algorithm uses three strategies: Replica Creation and Deletion Strategy (RCDS), Replica Placement Strategy (RPS), and Replica Replacement Strategy (RRS). DRCM was evaluated using network simulation (OptorSim) based on selected performance metrics (mean job execution time, efficient network usage, average storage usage, and computing element usage), scenarios, and topologies. Results revealed better job execution time with lower resource consumption than existing approaches. This research contributes replication strategies embodied in one algorithm that enhances data grid performance, capable of making a decision on creating or deleting more than one file during same decision. Furthermore, dependency-level-between-files criterion was utilized and integrated with the exponential growth/decay model to give an accurate file evaluation. 2012 Thesis NonPeerReviewed text en https://etd.uum.edu.my/3352/1/MOHAMMED_KAMEL_MADI.pdf text en https://etd.uum.edu.my/3352/3/MOHAMMED_KAMEL_MADI.pdf Madi, Mohammed Kamel (2012) Replica Creation Algorithm for Data Grids. PhD. thesis, Universiti Utara Malaysia. http://sierra.uum.edu.my/record=b1239722~S1
spellingShingle QA71-90 Instruments and machines
Madi, Mohammed Kamel
Replica Creation Algorithm for Data Grids
title Replica Creation Algorithm for Data Grids
title_full Replica Creation Algorithm for Data Grids
title_fullStr Replica Creation Algorithm for Data Grids
title_full_unstemmed Replica Creation Algorithm for Data Grids
title_short Replica Creation Algorithm for Data Grids
title_sort replica creation algorithm for data grids
topic QA71-90 Instruments and machines
url https://etd.uum.edu.my/3352/1/MOHAMMED_KAMEL_MADI.pdf
https://etd.uum.edu.my/3352/3/MOHAMMED_KAMEL_MADI.pdf
https://etd.uum.edu.my/3352/
http://sierra.uum.edu.my/record=b1239722~S1
url_provider http://etd.uum.edu.my/