Development of a diversified ensemble data summarization (DDS) tool for learning medical data in a multi relational environment

Medical or scientific data are normally stored in relational databases in which data are stored in multiple tables. A data summarization approach to knowledge discovery in structured medical datasets is often limited due to the complexity of the database schema. Since most of these data are stored i...

Full description

Saved in:
Bibliographic Details
Main Authors: Rayner Alfred, Leau, Yu Beng, Tan, Soo Fun
Format: Research Report
Language:en
Published: Universiti Malaysia Sabah 2012
Subjects:
Online Access:https://eprints.ums.edu.my/id/eprint/24736/1/Development%20of%20a%20diversified%20ensemble%20data%20summarization%20%28DDS%29%20tool%20for%20learning%20medical%20data%20in%20a%20multi%20relational%20environment.pdf
https://eprints.ums.edu.my/id/eprint/24736/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1831793626634518528
author Rayner Alfred
Leau, Yu Beng
Tan, Soo Fun
author_facet Rayner Alfred
Leau, Yu Beng
Tan, Soo Fun
author_sort Rayner Alfred
building UMS Library
collection Institutional Repository
content_provider Universiti Malaysia Sabah
content_source UMS Institutional Repository
continent Asia
country Malaysia
description Medical or scientific data are normally stored in relational databases in which data are stored in multiple tables. A data summarization approach to knowledge discovery in structured medical datasets is often limited due to the complexity of the database schema. Since most of these data are stored in multiple tables, designing a suitable data summarization method for each individual table that is associated with the target table is required in order to get the best result in summarizing the overall data stored in a multi-relational environment. A diversified data summarization ensemble method is best applied in the task of learning data stored in multiple tables since ensemble methods improve quality and robustness of the results. This research investigates the feasibility of combining a few types of data summarization methods ( e.g., DARA) in order to learn data stored in relational databases with high cardinality attributes (one-to-many relations between entities). The proposed algorithm is called a diversified data summarization ensemble method. With this new algorithm, one could facilitate the task of data modelling for data stored in a multi-relational setting by improving the predictive accuracy of the data modelling task. This can be achieved by summarizing each table that exists in the database by using a more appropriate data summarization method depending on the type of data stored in each individual table. This research helps the understandi'ng and development of a diversified data summarization ensemble method that is able to summarize relational data. By applying a subset of data summarization methods to summarize different sets of the relational datasets, more interpretable and useful information can be extracted.
format Research Report
id my.ums.eprints-24736
institution Universiti Malaysia Sabah
language en
publishDate 2012
publisher Universiti Malaysia Sabah
record_format eprints
spelling my.ums.eprints-247362020-01-29T02:47:51Z https://eprints.ums.edu.my/id/eprint/24736/ Development of a diversified ensemble data summarization (DDS) tool for learning medical data in a multi relational environment Rayner Alfred Leau, Yu Beng Tan, Soo Fun QA Mathematics Medical or scientific data are normally stored in relational databases in which data are stored in multiple tables. A data summarization approach to knowledge discovery in structured medical datasets is often limited due to the complexity of the database schema. Since most of these data are stored in multiple tables, designing a suitable data summarization method for each individual table that is associated with the target table is required in order to get the best result in summarizing the overall data stored in a multi-relational environment. A diversified data summarization ensemble method is best applied in the task of learning data stored in multiple tables since ensemble methods improve quality and robustness of the results. This research investigates the feasibility of combining a few types of data summarization methods ( e.g., DARA) in order to learn data stored in relational databases with high cardinality attributes (one-to-many relations between entities). The proposed algorithm is called a diversified data summarization ensemble method. With this new algorithm, one could facilitate the task of data modelling for data stored in a multi-relational setting by improving the predictive accuracy of the data modelling task. This can be achieved by summarizing each table that exists in the database by using a more appropriate data summarization method depending on the type of data stored in each individual table. This research helps the understandi'ng and development of a diversified data summarization ensemble method that is able to summarize relational data. By applying a subset of data summarization methods to summarize different sets of the relational datasets, more interpretable and useful information can be extracted. Universiti Malaysia Sabah 2012 Research Report NonPeerReviewed text en https://eprints.ums.edu.my/id/eprint/24736/1/Development%20of%20a%20diversified%20ensemble%20data%20summarization%20%28DDS%29%20tool%20for%20learning%20medical%20data%20in%20a%20multi%20relational%20environment.pdf Rayner Alfred and Leau, Yu Beng and Tan, Soo Fun (2012) Development of a diversified ensemble data summarization (DDS) tool for learning medical data in a multi relational environment. (Unpublished)
spellingShingle QA Mathematics
Rayner Alfred
Leau, Yu Beng
Tan, Soo Fun
Development of a diversified ensemble data summarization (DDS) tool for learning medical data in a multi relational environment
title Development of a diversified ensemble data summarization (DDS) tool for learning medical data in a multi relational environment
title_full Development of a diversified ensemble data summarization (DDS) tool for learning medical data in a multi relational environment
title_fullStr Development of a diversified ensemble data summarization (DDS) tool for learning medical data in a multi relational environment
title_full_unstemmed Development of a diversified ensemble data summarization (DDS) tool for learning medical data in a multi relational environment
title_short Development of a diversified ensemble data summarization (DDS) tool for learning medical data in a multi relational environment
title_sort development of a diversified ensemble data summarization (dds) tool for learning medical data in a multi relational environment
topic QA Mathematics
url https://eprints.ums.edu.my/id/eprint/24736/1/Development%20of%20a%20diversified%20ensemble%20data%20summarization%20%28DDS%29%20tool%20for%20learning%20medical%20data%20in%20a%20multi%20relational%20environment.pdf
https://eprints.ums.edu.my/id/eprint/24736/
url_provider http://eprints.ums.edu.my/