Rotation Forest Ensemble Classifier to Improve the Cardiovascular Disease Risk Prediction Accuracy

Heart disease risk prediction is very important as it is one of the primary causes of sudden death in the world. Early-stage prediction can save the lives by undergoing appropriate diagnosis steps or making necessary changes in their lifestyles. Recent studies have focused on the use of data mining...

Full description

Saved in:
Bibliographic Details
Main Authors: Reddy, K.V.V., Elamvazuthi, I., Aziz, A.A., Paramasivam, S., Chua, H.N., Pranavanand, S.
Format: Conference or Workshop Item
Published: Institute of Electrical and Electronics Engineers Inc. 2021
Online Access:http://scholars.utp.edu.my/id/eprint/33452/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85126971116&doi=10.1109%2fNICS54270.2021.9701455&partnerID=40&md5=37e4cce6662becf0ffaf91d5fcea820c
Tags: Add Tag
No Tags, Be the first to tag this record!
id oai:scholars.utp.edu.my:33452
record_format eprints
spelling oai:scholars.utp.edu.my:334522022-12-28T08:22:07Z http://scholars.utp.edu.my/id/eprint/33452/ Rotation Forest Ensemble Classifier to Improve the Cardiovascular Disease Risk Prediction Accuracy Reddy, K.V.V. Elamvazuthi, I. Aziz, A.A. Paramasivam, S. Chua, H.N. Pranavanand, S. Heart disease risk prediction is very important as it is one of the primary causes of sudden death in the world. Early-stage prediction can save the lives by undergoing appropriate diagnosis steps or making necessary changes in their lifestyles. Recent studies have focused on the use of data mining and machine learning in the detection of diseases based on specific features of a person. The Rotation Forest, a tree-based ensemble classifier that uses Principal Component Analysis for feature extraction, is proposed to improve the prediction accuracy of heart disease risk. The Statlog heart dataset has been selected from the publicly available UCI machine learning repository in this research work. The dataset was trained with a Rotation Forest ensemble classifier with default base classifier J48, and then, Random Forest on full features and selected features obtained from One Rule and Support Vector Machines attribute evaluators. The performance of the Rotation Forest was compared with the standard machine learning classifiers, Naïve Bayes, Logistic Regression, Support Vector Machines, K-Nearest Neighbors, AdaBoostM1, and Bagging. The Rotation Forest algorithm with Random Forest provided the highest accuracy of 94.44 and area under the ROC curve 0.980 on selected features of the Statlog dataset from the One Rule method. © 2021 IEEE. Institute of Electrical and Electronics Engineers Inc. 2021 Conference or Workshop Item NonPeerReviewed Reddy, K.V.V. and Elamvazuthi, I. and Aziz, A.A. and Paramasivam, S. and Chua, H.N. and Pranavanand, S. (2021) Rotation Forest Ensemble Classifier to Improve the Cardiovascular Disease Risk Prediction Accuracy. In: UNSPECIFIED. https://www.scopus.com/inward/record.uri?eid=2-s2.0-85126971116&doi=10.1109%2fNICS54270.2021.9701455&partnerID=40&md5=37e4cce6662becf0ffaf91d5fcea820c
institution Universiti Teknologi Petronas
building UTP Resource Centre
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Petronas
content_source UTP Institutional Repository
url_provider http://eprints.utp.edu.my/
description Heart disease risk prediction is very important as it is one of the primary causes of sudden death in the world. Early-stage prediction can save the lives by undergoing appropriate diagnosis steps or making necessary changes in their lifestyles. Recent studies have focused on the use of data mining and machine learning in the detection of diseases based on specific features of a person. The Rotation Forest, a tree-based ensemble classifier that uses Principal Component Analysis for feature extraction, is proposed to improve the prediction accuracy of heart disease risk. The Statlog heart dataset has been selected from the publicly available UCI machine learning repository in this research work. The dataset was trained with a Rotation Forest ensemble classifier with default base classifier J48, and then, Random Forest on full features and selected features obtained from One Rule and Support Vector Machines attribute evaluators. The performance of the Rotation Forest was compared with the standard machine learning classifiers, Naïve Bayes, Logistic Regression, Support Vector Machines, K-Nearest Neighbors, AdaBoostM1, and Bagging. The Rotation Forest algorithm with Random Forest provided the highest accuracy of 94.44 and area under the ROC curve 0.980 on selected features of the Statlog dataset from the One Rule method. © 2021 IEEE.
format Conference or Workshop Item
author Reddy, K.V.V.
Elamvazuthi, I.
Aziz, A.A.
Paramasivam, S.
Chua, H.N.
Pranavanand, S.
spellingShingle Reddy, K.V.V.
Elamvazuthi, I.
Aziz, A.A.
Paramasivam, S.
Chua, H.N.
Pranavanand, S.
Rotation Forest Ensemble Classifier to Improve the Cardiovascular Disease Risk Prediction Accuracy
author_facet Reddy, K.V.V.
Elamvazuthi, I.
Aziz, A.A.
Paramasivam, S.
Chua, H.N.
Pranavanand, S.
author_sort Reddy, K.V.V.
title Rotation Forest Ensemble Classifier to Improve the Cardiovascular Disease Risk Prediction Accuracy
title_short Rotation Forest Ensemble Classifier to Improve the Cardiovascular Disease Risk Prediction Accuracy
title_full Rotation Forest Ensemble Classifier to Improve the Cardiovascular Disease Risk Prediction Accuracy
title_fullStr Rotation Forest Ensemble Classifier to Improve the Cardiovascular Disease Risk Prediction Accuracy
title_full_unstemmed Rotation Forest Ensemble Classifier to Improve the Cardiovascular Disease Risk Prediction Accuracy
title_sort rotation forest ensemble classifier to improve the cardiovascular disease risk prediction accuracy
publisher Institute of Electrical and Electronics Engineers Inc.
publishDate 2021
url http://scholars.utp.edu.my/id/eprint/33452/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85126971116&doi=10.1109%2fNICS54270.2021.9701455&partnerID=40&md5=37e4cce6662becf0ffaf91d5fcea820c
_version_ 1753790779498692608
score 13.223943