Classification of Diabetes Mellitus using Ensemble Algorithms
Diabetes Mellitus (DM) is one of the most prevalent diseases in the world today which is associated by having high glucose levels in the body either due to inadequate production of insulin or the body cell's not responding towards the produced insulin. Data mining and machine learning technique...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference or Workshop Item |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2021
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85124145391&doi=10.1109%2fICIAS49414.2021.9642508&partnerID=40&md5=1d27bf9ffd020cabf2625a9327eb2990 http://eprints.utp.edu.my/29205/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utp.eprints.29205 |
---|---|
record_format |
eprints |
spelling |
my.utp.eprints.292052022-03-25T01:11:49Z Classification of Diabetes Mellitus using Ensemble Algorithms Noor, N.A.B.S. Elamvazuthi, I. Yahya, N. Diabetes Mellitus (DM) is one of the most prevalent diseases in the world today which is associated by having high glucose levels in the body either due to inadequate production of insulin or the body cell's not responding towards the produced insulin. Data mining and machine learning techniques can be extremely useful in classification of DM considering the need to have a shift from current traditional methods which use sharp needles to draw blood towards a non - invasive method. The objective of this study is to perform DM classification using various machine learning algorithms. In this paper, individual classifiers such as Support Vector Machine, Naïve Bayes, Bayes Net, Decision Stump, k - Nearest Neighbors, Logistic Regression, Multilayer Perceptron and Decision Tree are experimented. Apart from that, ensemble methods such as bagging, boosting, hybrid classifier using combinations of Random Forest with other base classifiers and ensemble algorithm which is the Random Forest has also been studied. Proposed DM classification model is chosen based on an optimized model reflected by their accuracy and performance of the model. In this research, it was found that performance of ensemble method using hybrid classifier of Random Forest - Bayes Net model has proven to be the best DM classification model with an accuracy of 83.91 and AUC of 0.904 using the Pima Indian Diabetes Dataset (PIDD). © 2021 IEEE. Institute of Electrical and Electronics Engineers Inc. 2021 Conference or Workshop Item NonPeerReviewed https://www.scopus.com/inward/record.uri?eid=2-s2.0-85124145391&doi=10.1109%2fICIAS49414.2021.9642508&partnerID=40&md5=1d27bf9ffd020cabf2625a9327eb2990 Noor, N.A.B.S. and Elamvazuthi, I. and Yahya, N. (2021) Classification of Diabetes Mellitus using Ensemble Algorithms. In: UNSPECIFIED. http://eprints.utp.edu.my/29205/ |
institution |
Universiti Teknologi Petronas |
building |
UTP Resource Centre |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Petronas |
content_source |
UTP Institutional Repository |
url_provider |
http://eprints.utp.edu.my/ |
description |
Diabetes Mellitus (DM) is one of the most prevalent diseases in the world today which is associated by having high glucose levels in the body either due to inadequate production of insulin or the body cell's not responding towards the produced insulin. Data mining and machine learning techniques can be extremely useful in classification of DM considering the need to have a shift from current traditional methods which use sharp needles to draw blood towards a non - invasive method. The objective of this study is to perform DM classification using various machine learning algorithms. In this paper, individual classifiers such as Support Vector Machine, Naïve Bayes, Bayes Net, Decision Stump, k - Nearest Neighbors, Logistic Regression, Multilayer Perceptron and Decision Tree are experimented. Apart from that, ensemble methods such as bagging, boosting, hybrid classifier using combinations of Random Forest with other base classifiers and ensemble algorithm which is the Random Forest has also been studied. Proposed DM classification model is chosen based on an optimized model reflected by their accuracy and performance of the model. In this research, it was found that performance of ensemble method using hybrid classifier of Random Forest - Bayes Net model has proven to be the best DM classification model with an accuracy of 83.91 and AUC of 0.904 using the Pima Indian Diabetes Dataset (PIDD). © 2021 IEEE. |
format |
Conference or Workshop Item |
author |
Noor, N.A.B.S. Elamvazuthi, I. Yahya, N. |
spellingShingle |
Noor, N.A.B.S. Elamvazuthi, I. Yahya, N. Classification of Diabetes Mellitus using Ensemble Algorithms |
author_facet |
Noor, N.A.B.S. Elamvazuthi, I. Yahya, N. |
author_sort |
Noor, N.A.B.S. |
title |
Classification of Diabetes Mellitus using Ensemble Algorithms |
title_short |
Classification of Diabetes Mellitus using Ensemble Algorithms |
title_full |
Classification of Diabetes Mellitus using Ensemble Algorithms |
title_fullStr |
Classification of Diabetes Mellitus using Ensemble Algorithms |
title_full_unstemmed |
Classification of Diabetes Mellitus using Ensemble Algorithms |
title_sort |
classification of diabetes mellitus using ensemble algorithms |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
publishDate |
2021 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85124145391&doi=10.1109%2fICIAS49414.2021.9642508&partnerID=40&md5=1d27bf9ffd020cabf2625a9327eb2990 http://eprints.utp.edu.my/29205/ |
_version_ |
1738656932874420224 |
score |
13.211869 |