A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets

Identification of informative genes is essential for the disease and cancer studies. Metaheuristic algorithms have been widely used for this purpose. However, their performance on various high-dimensional datasets of genomic studies has not been fully addressed. This work was intended to perform a c...

Full description

Saved in:
Bibliographic Details
Main Authors: Hameed, Shilan S., Hassan, Wan Haslina, Abdul Latiff, Liza, Muhammad, Fahmi F.
Format: Article
Published: Springer Science and Business Media Deutschland GmbH 2021
Subjects:
Online Access:http://eprints.utm.my/id/eprint/94946/
http://dx.doi.org/10.1007/s00500-021-05726-0
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.94946
record_format eprints
spelling my.utm.949462022-04-29T22:32:30Z http://eprints.utm.my/id/eprint/94946/ A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets Hameed, Shilan S. Hassan, Wan Haslina Abdul Latiff, Liza Muhammad, Fahmi F. T Technology (General) Identification of informative genes is essential for the disease and cancer studies. Metaheuristic algorithms have been widely used for this purpose. However, their performance on various high-dimensional datasets of genomic studies has not been fully addressed. This work was intended to perform a comprehensive comparative analysis on three well-known nature-inspired metaheuristic algorithms, namely binary particle swarm optimization (BPSO), genetic algorithm (GA) and cuckoo search algorithm (CS) when they are used in gene selection and classification in twelve high-dimensional cancer datasets. The methodology was carried out through the utilization of a three-phase hybrid approach, considering a pre-processing filtration using Pearson product-moment correlation coefficient (PPMCC) followed by the metaheuristic and classification algorithms. Comparably, five different classification algorithms were used in each phase of analysis. It was seen that the application of PCCMA filter has acted upon reducing the computational complexity of overall analysis. The comparative study showed that BPSO outperformed GA and CS in terms of accuracy. However, CS was able to select fewer attributed genes and was less computationally complex compared to that of GA and BPSO. Springer Science and Business Media Deutschland GmbH 2021 Article PeerReviewed Hameed, Shilan S. and Hassan, Wan Haslina and Abdul Latiff, Liza and Muhammad, Fahmi F. (2021) A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets. Soft Computing, 25 (13). pp. 8683-8701. ISSN 1432-7643 http://dx.doi.org/10.1007/s00500-021-05726-0
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic T Technology (General)
spellingShingle T Technology (General)
Hameed, Shilan S.
Hassan, Wan Haslina
Abdul Latiff, Liza
Muhammad, Fahmi F.
A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets
description Identification of informative genes is essential for the disease and cancer studies. Metaheuristic algorithms have been widely used for this purpose. However, their performance on various high-dimensional datasets of genomic studies has not been fully addressed. This work was intended to perform a comprehensive comparative analysis on three well-known nature-inspired metaheuristic algorithms, namely binary particle swarm optimization (BPSO), genetic algorithm (GA) and cuckoo search algorithm (CS) when they are used in gene selection and classification in twelve high-dimensional cancer datasets. The methodology was carried out through the utilization of a three-phase hybrid approach, considering a pre-processing filtration using Pearson product-moment correlation coefficient (PPMCC) followed by the metaheuristic and classification algorithms. Comparably, five different classification algorithms were used in each phase of analysis. It was seen that the application of PCCMA filter has acted upon reducing the computational complexity of overall analysis. The comparative study showed that BPSO outperformed GA and CS in terms of accuracy. However, CS was able to select fewer attributed genes and was less computationally complex compared to that of GA and BPSO.
format Article
author Hameed, Shilan S.
Hassan, Wan Haslina
Abdul Latiff, Liza
Muhammad, Fahmi F.
author_facet Hameed, Shilan S.
Hassan, Wan Haslina
Abdul Latiff, Liza
Muhammad, Fahmi F.
author_sort Hameed, Shilan S.
title A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets
title_short A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets
title_full A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets
title_fullStr A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets
title_full_unstemmed A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets
title_sort comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets
publisher Springer Science and Business Media Deutschland GmbH
publishDate 2021
url http://eprints.utm.my/id/eprint/94946/
http://dx.doi.org/10.1007/s00500-021-05726-0
_version_ 1732945413388894208
score 13.211869