Enhanced computational methods for detection and interpretation of heart disease based on ensemble learning and autoencoder framework / Abdallah Osama Hamdan Abdellatif

Heart disease remains the primary cause of mortality globally, and its early detection is critical for reducing mortality rates. However, the challenge of class imbalance and high dimensionality in clinical data significantly impedes the efficacy of Machine Learning (ML) models in this domain. This...

Full description

Saved in:
Bibliographic Details
Main Author: Abdalla Osama , Hamdan Abdellatif
Format: Thesis
Published: 2024
Subjects:
Online Access:http://studentsrepo.um.edu.my/15481/2/Abdallah_Osama.pdf
http://studentsrepo.um.edu.my/15481/1/Abdallah_Osama.pdf
http://studentsrepo.um.edu.my/15481/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1831436636959801344
author Abdalla Osama , Hamdan Abdellatif
author_facet Abdalla Osama , Hamdan Abdellatif
author_sort Abdalla Osama , Hamdan Abdellatif
building UM Library
collection Institutional Repository
content_provider Universiti Malaya
content_source UM Student Repository
continent Asia
country Malaysia
description Heart disease remains the primary cause of mortality globally, and its early detection is critical for reducing mortality rates. However, the challenge of class imbalance and high dimensionality in clinical data significantly impedes the efficacy of Machine Learning (ML) models in this domain. This thesis presents two innovative methods that holistically address these challenges at algorithmic and data levels to enhance heart disease detection. The first method introduces an Improved Weighted Random Forest (IWRF) approach, focusing on algorithmic innovation to tackle the imbalance problem. It employs supervised infinite feature selection (Inf-FSs) to identify significant features and Bayesian optimization for fine-tuning hyperparameters. Validated on Statlog and heart disease clinical records datasets, this method demonstrates a notable improvement in prediction accuracy and F-measure, outperforming existing models and marking an accuracy enhancement of 2.4% and 4.6% on these datasets. In contrast, the second method addresses the data-level imbalance through a novel framework named Conditional Autoencoder with Stack Predictor for Heart Disease (CAVE-SPFHD). This approach integrates a conditional variational autoencoder (CVAE) to effectively balance the dataset and a stack predictor (SPFHD) that utilizes tree-based ensemble learning algorithms. The base models' predictions are integrated using a support vector machine, significantly enhancing detection accuracy. Tested across four datasets, CAVE-SPFHD surpasses state-of-the-art methods in f1-score, providing improved not only predictive performance but also critical interpretative insights using the SHapley Additive explanation (SHAP) algorithm. Together, these two methods represent a comprehensive approach to heart disease detection in ML, effectively addressing the dual challenges of class imbalance and high dimensionality. By innovatively tackling these issues at both the algorithm and data levels, this thesis significantly contributes to the field, offering robust, accurate, and interpretable ML solutions for early heart disease detection, which is crucial for proactive healthcare interventions.
format Thesis
id my.um.stud-15481
institution Universiti Malaya
publishDate 2024
record_format eprints
spelling my.um.stud-154812024-11-05T21:55:55Z Enhanced computational methods for detection and interpretation of heart disease based on ensemble learning and autoencoder framework / Abdallah Osama Hamdan Abdellatif Abdalla Osama , Hamdan Abdellatif TK Electrical engineering. Electronics Nuclear engineering Heart disease remains the primary cause of mortality globally, and its early detection is critical for reducing mortality rates. However, the challenge of class imbalance and high dimensionality in clinical data significantly impedes the efficacy of Machine Learning (ML) models in this domain. This thesis presents two innovative methods that holistically address these challenges at algorithmic and data levels to enhance heart disease detection. The first method introduces an Improved Weighted Random Forest (IWRF) approach, focusing on algorithmic innovation to tackle the imbalance problem. It employs supervised infinite feature selection (Inf-FSs) to identify significant features and Bayesian optimization for fine-tuning hyperparameters. Validated on Statlog and heart disease clinical records datasets, this method demonstrates a notable improvement in prediction accuracy and F-measure, outperforming existing models and marking an accuracy enhancement of 2.4% and 4.6% on these datasets. In contrast, the second method addresses the data-level imbalance through a novel framework named Conditional Autoencoder with Stack Predictor for Heart Disease (CAVE-SPFHD). This approach integrates a conditional variational autoencoder (CVAE) to effectively balance the dataset and a stack predictor (SPFHD) that utilizes tree-based ensemble learning algorithms. The base models' predictions are integrated using a support vector machine, significantly enhancing detection accuracy. Tested across four datasets, CAVE-SPFHD surpasses state-of-the-art methods in f1-score, providing improved not only predictive performance but also critical interpretative insights using the SHapley Additive explanation (SHAP) algorithm. Together, these two methods represent a comprehensive approach to heart disease detection in ML, effectively addressing the dual challenges of class imbalance and high dimensionality. By innovatively tackling these issues at both the algorithm and data levels, this thesis significantly contributes to the field, offering robust, accurate, and interpretable ML solutions for early heart disease detection, which is crucial for proactive healthcare interventions. 2024-09 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/15481/2/Abdallah_Osama.pdf application/pdf http://studentsrepo.um.edu.my/15481/1/Abdallah_Osama.pdf Abdalla Osama , Hamdan Abdellatif (2024) Enhanced computational methods for detection and interpretation of heart disease based on ensemble learning and autoencoder framework / Abdallah Osama Hamdan Abdellatif. PhD thesis, Universiti Malaya. http://studentsrepo.um.edu.my/15481/
spellingShingle TK Electrical engineering. Electronics Nuclear engineering
Abdalla Osama , Hamdan Abdellatif
Enhanced computational methods for detection and interpretation of heart disease based on ensemble learning and autoencoder framework / Abdallah Osama Hamdan Abdellatif
title Enhanced computational methods for detection and interpretation of heart disease based on ensemble learning and autoencoder framework / Abdallah Osama Hamdan Abdellatif
title_full Enhanced computational methods for detection and interpretation of heart disease based on ensemble learning and autoencoder framework / Abdallah Osama Hamdan Abdellatif
title_fullStr Enhanced computational methods for detection and interpretation of heart disease based on ensemble learning and autoencoder framework / Abdallah Osama Hamdan Abdellatif
title_full_unstemmed Enhanced computational methods for detection and interpretation of heart disease based on ensemble learning and autoencoder framework / Abdallah Osama Hamdan Abdellatif
title_short Enhanced computational methods for detection and interpretation of heart disease based on ensemble learning and autoencoder framework / Abdallah Osama Hamdan Abdellatif
title_sort enhanced computational methods for detection and interpretation of heart disease based on ensemble learning and autoencoder framework / abdallah osama hamdan abdellatif
topic TK Electrical engineering. Electronics Nuclear engineering
url http://studentsrepo.um.edu.my/15481/2/Abdallah_Osama.pdf
http://studentsrepo.um.edu.my/15481/1/Abdallah_Osama.pdf
http://studentsrepo.um.edu.my/15481/
url_provider http://studentsrepo.um.edu.my/