Mutable composite firefly algorithm for gene selection in microarray based cancer classification

Cancer classification is critical due to the strenuous effort required in cancer treatment and the rising cancer mortality rate. Recent trends with high throughput technologies have led to discoveries in terms of biomarkers that successfully contributed to cancerrelated issues. A computational appro...

Full description

Saved in:
Bibliographic Details
Main Author: Fajila, Mohamed Nisper Fathima
Format: Thesis
Language:en
Published: 2022
Subjects:
Online Access:https://etd.uum.edu.my/10171/1/s826643_01.pdf
https://etd.uum.edu.my/10171/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841486356403978240
author Fajila, Mohamed Nisper Fathima
author_facet Fajila, Mohamed Nisper Fathima
author_sort Fajila, Mohamed Nisper Fathima
building UUM Library
collection Institutional Repository
content_provider Universiti Utara Malaysia
content_source UUM Electronic Theses
continent Asia
country Malaysia
description Cancer classification is critical due to the strenuous effort required in cancer treatment and the rising cancer mortality rate. Recent trends with high throughput technologies have led to discoveries in terms of biomarkers that successfully contributed to cancerrelated issues. A computational approach for gene selection based on microarray data analysis has been applied in many cancer classification problems. However, the existing hybrid approaches with metaheuristic optimization algorithms in feature selection (specifically in gene selection) are not generalized enough to efficiently classify most cancer microarray data while maintaining a small set of genes. This leads to the classification accuracy and genes subset size problem. Hence, this study proposed to modify the Firefly Algorithm (FA) along with the Correlation-based Feature Selection (CFS) filter for the gene selection task. An improved FA was proposed to overcome FA slow convergence by generating mutable size solutions for the firefly population. In addition, a composite position update strategy was designed for the mutable size solutions. The proposed strategy was to balance FA exploration and exploitation in order to address the local optima problem. The proposed hybrid algorithm known as CFS-Mutable Composite Firefly Algorithm (CFS-MCFA) was evaluated on cancer microarray data for biomarker selection along with the deployment of Support Vector Machine (SVM) as the classifier. Evaluation was performed based on two metrics: classification accuracy and size of feature set. The results showed that the CFS-MCFA-SVM algorithm outperforms benchmark methods in terms of classification accuracy and genes subset size. In particular, 100 percent accuracy was achieved on all four datasets and with only a few biomarkers (between one and four). This result indicates that the proposed algorithm is one of the competitive alternatives in feature selection, which later contributes to the analysis of microarray data.
format Thesis
id my.uum.etd-10171
institution Universiti Utara Malaysia
language en
publishDate 2022
record_format eprints
spelling my.uum.etd-101712025-08-25T07:10:34Z https://etd.uum.edu.my/10171/ Mutable composite firefly algorithm for gene selection in microarray based cancer classification Fajila, Mohamed Nisper Fathima RC0254 Neoplasms. Tumors. Oncology (including Cancer) Cancer classification is critical due to the strenuous effort required in cancer treatment and the rising cancer mortality rate. Recent trends with high throughput technologies have led to discoveries in terms of biomarkers that successfully contributed to cancerrelated issues. A computational approach for gene selection based on microarray data analysis has been applied in many cancer classification problems. However, the existing hybrid approaches with metaheuristic optimization algorithms in feature selection (specifically in gene selection) are not generalized enough to efficiently classify most cancer microarray data while maintaining a small set of genes. This leads to the classification accuracy and genes subset size problem. Hence, this study proposed to modify the Firefly Algorithm (FA) along with the Correlation-based Feature Selection (CFS) filter for the gene selection task. An improved FA was proposed to overcome FA slow convergence by generating mutable size solutions for the firefly population. In addition, a composite position update strategy was designed for the mutable size solutions. The proposed strategy was to balance FA exploration and exploitation in order to address the local optima problem. The proposed hybrid algorithm known as CFS-Mutable Composite Firefly Algorithm (CFS-MCFA) was evaluated on cancer microarray data for biomarker selection along with the deployment of Support Vector Machine (SVM) as the classifier. Evaluation was performed based on two metrics: classification accuracy and size of feature set. The results showed that the CFS-MCFA-SVM algorithm outperforms benchmark methods in terms of classification accuracy and genes subset size. In particular, 100 percent accuracy was achieved on all four datasets and with only a few biomarkers (between one and four). This result indicates that the proposed algorithm is one of the competitive alternatives in feature selection, which later contributes to the analysis of microarray data. 2022 Thesis NonPeerReviewed text en https://etd.uum.edu.my/10171/1/s826643_01.pdf Fajila, Mohamed Nisper Fathima (2022) Mutable composite firefly algorithm for gene selection in microarray based cancer classification. Masters thesis, Universiti Utara Malaysia.
spellingShingle RC0254 Neoplasms. Tumors. Oncology (including Cancer)
Fajila, Mohamed Nisper Fathima
Mutable composite firefly algorithm for gene selection in microarray based cancer classification
title Mutable composite firefly algorithm for gene selection in microarray based cancer classification
title_full Mutable composite firefly algorithm for gene selection in microarray based cancer classification
title_fullStr Mutable composite firefly algorithm for gene selection in microarray based cancer classification
title_full_unstemmed Mutable composite firefly algorithm for gene selection in microarray based cancer classification
title_short Mutable composite firefly algorithm for gene selection in microarray based cancer classification
title_sort mutable composite firefly algorithm for gene selection in microarray based cancer classification
topic RC0254 Neoplasms. Tumors. Oncology (including Cancer)
url https://etd.uum.edu.my/10171/1/s826643_01.pdf
https://etd.uum.edu.my/10171/
url_provider http://etd.uum.edu.my/