Gene selection and classification of microarray gene expression data based on a new adaptive L1-norm elastic net penalty
The removal of irrelevant and insignificant genes has always been a major step in microarray data analysis. The application of gene selection methods in biological datasets has greatly increased, supporting expert systems in cancer diagnostic capability with high classification accuracy. Penalized l...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier Ltd
2021
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/95607/1/MuhammadHisyamLee2021_GeneSelectionAndClassificationOfMicroarrayGene.pdf http://eprints.utm.my/id/eprint/95607/ http://dx.doi.org/10.1016/j.imu.2021.100622 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utm.95607 |
---|---|
record_format |
eprints |
spelling |
my.utm.956072022-05-31T13:04:30Z http://eprints.utm.my/id/eprint/95607/ Gene selection and classification of microarray gene expression data based on a new adaptive L1-norm elastic net penalty Alharthi, Aiedh Mrisi Lee, Muhammad Hisyam Algamal, Zakariya Yahya QA Mathematics The removal of irrelevant and insignificant genes has always been a major step in microarray data analysis. The application of gene selection methods in biological datasets has greatly increased, supporting expert systems in cancer diagnostic capability with high classification accuracy. Penalized logistic regression (PLR) using the elastic net (EN) has been widely used in high-dimensional cancer classification in recent years to estimate the gene coefficients and perform gene selection simultaneously. However, the EN estimator does not satisfy the oracle properties. This paper proposes the PLR using the adaptive elastic net (AEN), abbreviated as PLRAEN, to address the inconsistency. Our method employs the ratio (BWR) as an initial weight inside the L1-norm of the EN model. Several experiments were performed on a simulation study for a different number of predictor variables, sample sizes, and correlation coefficients and also on three public gene expression datasets to evaluate the effectiveness. Experimental results demonstrate that the proposed method consistently outperforms two other contemporary penalized methods regarding classification accuracy and the number of selected genes. Therefore, we conclude that PLRAEN is a better method to implement gene selection in the high-dimensional cancer classification field. Elsevier Ltd 2021 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/95607/1/MuhammadHisyamLee2021_GeneSelectionAndClassificationOfMicroarrayGene.pdf Alharthi, Aiedh Mrisi and Lee, Muhammad Hisyam and Algamal, Zakariya Yahya (2021) Gene selection and classification of microarray gene expression data based on a new adaptive L1-norm elastic net penalty. Informatics in Medicine Unlocked, 24 . p. 100622. ISSN 2352-9148 http://dx.doi.org/10.1016/j.imu.2021.100622 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
language |
English |
topic |
QA Mathematics |
spellingShingle |
QA Mathematics Alharthi, Aiedh Mrisi Lee, Muhammad Hisyam Algamal, Zakariya Yahya Gene selection and classification of microarray gene expression data based on a new adaptive L1-norm elastic net penalty |
description |
The removal of irrelevant and insignificant genes has always been a major step in microarray data analysis. The application of gene selection methods in biological datasets has greatly increased, supporting expert systems in cancer diagnostic capability with high classification accuracy. Penalized logistic regression (PLR) using the elastic net (EN) has been widely used in high-dimensional cancer classification in recent years to estimate the gene coefficients and perform gene selection simultaneously. However, the EN estimator does not satisfy the oracle properties. This paper proposes the PLR using the adaptive elastic net (AEN), abbreviated as PLRAEN, to address the inconsistency. Our method employs the ratio (BWR) as an initial weight inside the L1-norm of the EN model. Several experiments were performed on a simulation study for a different number of predictor variables, sample sizes, and correlation coefficients and also on three public gene expression datasets to evaluate the effectiveness. Experimental results demonstrate that the proposed method consistently outperforms two other contemporary penalized methods regarding classification accuracy and the number of selected genes. Therefore, we conclude that PLRAEN is a better method to implement gene selection in the high-dimensional cancer classification field. |
format |
Article |
author |
Alharthi, Aiedh Mrisi Lee, Muhammad Hisyam Algamal, Zakariya Yahya |
author_facet |
Alharthi, Aiedh Mrisi Lee, Muhammad Hisyam Algamal, Zakariya Yahya |
author_sort |
Alharthi, Aiedh Mrisi |
title |
Gene selection and classification of microarray gene expression data based on a new adaptive L1-norm elastic net penalty |
title_short |
Gene selection and classification of microarray gene expression data based on a new adaptive L1-norm elastic net penalty |
title_full |
Gene selection and classification of microarray gene expression data based on a new adaptive L1-norm elastic net penalty |
title_fullStr |
Gene selection and classification of microarray gene expression data based on a new adaptive L1-norm elastic net penalty |
title_full_unstemmed |
Gene selection and classification of microarray gene expression data based on a new adaptive L1-norm elastic net penalty |
title_sort |
gene selection and classification of microarray gene expression data based on a new adaptive l1-norm elastic net penalty |
publisher |
Elsevier Ltd |
publishDate |
2021 |
url |
http://eprints.utm.my/id/eprint/95607/1/MuhammadHisyamLee2021_GeneSelectionAndClassificationOfMicroarrayGene.pdf http://eprints.utm.my/id/eprint/95607/ http://dx.doi.org/10.1016/j.imu.2021.100622 |
_version_ |
1735386825088827392 |
score |
13.211869 |