Depth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction

Software Defect Prediction (SDP) plays a vital role in the software development life cycle as it helps identify and fix software defects. However, predicting software defects with irrelevant features and overlapping classes is challenging and can lead to lengthy training and low model accuracy. To...

Full description

Saved in:
Bibliographic Details
Main Authors: Nasser, Abdullah, H.M. Ghanem, Waheed Ali, H.Y. Saad, Abdul-Malik, Hamed Abdul-Qawy, Antar Shaddad, A. Ghaleb, Sanaa A, Mohammed Alduais, Nayef Abdulwahab, Din, Fakhrud, Ghetas, Mohamed
Format: Article
Language:en
Published: Elsevier 2024
Subjects:
Online Access:http://eprints.uthm.edu.my/12384/1/J17840_db3d7dac97eb7dd5ebfb84944076dced.pdf
http://eprints.uthm.edu.my/12384/
https://doi.org/10.1016/j.eswa.2024.124266
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1833419779780640768
author Nasser, Abdullah
H.M. Ghanem, Waheed Ali
H.Y. Saad, Abdul-Malik
Hamed Abdul-Qawy, Antar Shaddad
A. Ghaleb, Sanaa A
Mohammed Alduais, Nayef Abdulwahab
Din, Fakhrud
Ghetas, Mohamed
author_facet Nasser, Abdullah
H.M. Ghanem, Waheed Ali
H.Y. Saad, Abdul-Malik
Hamed Abdul-Qawy, Antar Shaddad
A. Ghaleb, Sanaa A
Mohammed Alduais, Nayef Abdulwahab
Din, Fakhrud
Ghetas, Mohamed
author_sort Nasser, Abdullah
building UTHM Library
collection Institutional Repository
content_provider Universiti Tun Hussein Onn Malaysia
content_source UTHM Institutional Repository
continent Asia
country Malaysia
description Software Defect Prediction (SDP) plays a vital role in the software development life cycle as it helps identify and fix software defects. However, predicting software defects with irrelevant features and overlapping classes is challenging and can lead to lengthy training and low model accuracy. To address these challenges, this research introduces a novel Depth Linear Discrimination-Oriented Feature Selection Method based on Adaptive Sine Cosine Algorithm, named Depth Adaptive Sine Cosine Feature Selection (DASC-FS). DASC-FS integrates the Adaptive Sine Cosine Algorithm (ASCA) as a search algorithm to determine the relevant features and adopts Depth Linear Discriminant Analysis (D-LDA) to identify the discriminative features that maximize class sepa ration. The paper proposes ASCA which is a metaheuristic algorithm meticulously designed to enhance the search capabilities of the standard Sine Cosine Algorithm (SCA). Combining the simplicity of the SCA with the efficiency of multiple mutation operators inspired by Genetic Algorithms (GA), ASCA enhances the diversity of the solutions and imparts remarkable adaptability to various situations. Furthermore, this study introduces a novel linear discriminant method, called Depth Linear Discriminant Analysis (D-LDA) to enhance the robustness of the original LDA. D-LDA systematically integrates the matrix depth concept into LDA, offering a systematic approach to address the challenges associated with scatter matrix estimation. As matrix depth measures how central or deep a particular matrix is within a distribution with respect to different directions, it is an efficient tool for computing a robust scatter matrix estimator that can handle outliers and complex data structures. The experimental results showed that DASC-FS consistently obtains the highest accuracy compared to most existing methods by integrating ASCA and D-LDA, thereby considering both accuracy optimization and class separation. The results also show that the use of multiple mutation operators in ASCA improves the search process capa bilities. The results also show that the capacity of D-LDA to reduce data dimensionality and increase class separation yields highly competitive results compared to other LDAs. Finally, features related to code size and complexity have emerged as key factors for SDP because they consistently rank as important features across different classifiers and datasets. DASC-FS offers a valuable solution in domain knowledge for enhancing
format Article
id my.uthm.eprints-12384
institution Universiti Tun Hussein Onn Malaysia
language en
publishDate 2024
publisher Elsevier
record_format eprints
spelling my.uthm.eprints-123842025-05-05T05:01:38Z http://eprints.uthm.edu.my/12384/ Depth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction Nasser, Abdullah H.M. Ghanem, Waheed Ali H.Y. Saad, Abdul-Malik Hamed Abdul-Qawy, Antar Shaddad A. Ghaleb, Sanaa A Mohammed Alduais, Nayef Abdulwahab Din, Fakhrud Ghetas, Mohamed QA Mathematics Software Defect Prediction (SDP) plays a vital role in the software development life cycle as it helps identify and fix software defects. However, predicting software defects with irrelevant features and overlapping classes is challenging and can lead to lengthy training and low model accuracy. To address these challenges, this research introduces a novel Depth Linear Discrimination-Oriented Feature Selection Method based on Adaptive Sine Cosine Algorithm, named Depth Adaptive Sine Cosine Feature Selection (DASC-FS). DASC-FS integrates the Adaptive Sine Cosine Algorithm (ASCA) as a search algorithm to determine the relevant features and adopts Depth Linear Discriminant Analysis (D-LDA) to identify the discriminative features that maximize class sepa ration. The paper proposes ASCA which is a metaheuristic algorithm meticulously designed to enhance the search capabilities of the standard Sine Cosine Algorithm (SCA). Combining the simplicity of the SCA with the efficiency of multiple mutation operators inspired by Genetic Algorithms (GA), ASCA enhances the diversity of the solutions and imparts remarkable adaptability to various situations. Furthermore, this study introduces a novel linear discriminant method, called Depth Linear Discriminant Analysis (D-LDA) to enhance the robustness of the original LDA. D-LDA systematically integrates the matrix depth concept into LDA, offering a systematic approach to address the challenges associated with scatter matrix estimation. As matrix depth measures how central or deep a particular matrix is within a distribution with respect to different directions, it is an efficient tool for computing a robust scatter matrix estimator that can handle outliers and complex data structures. The experimental results showed that DASC-FS consistently obtains the highest accuracy compared to most existing methods by integrating ASCA and D-LDA, thereby considering both accuracy optimization and class separation. The results also show that the use of multiple mutation operators in ASCA improves the search process capa bilities. The results also show that the capacity of D-LDA to reduce data dimensionality and increase class separation yields highly competitive results compared to other LDAs. Finally, features related to code size and complexity have emerged as key factors for SDP because they consistently rank as important features across different classifiers and datasets. DASC-FS offers a valuable solution in domain knowledge for enhancing Elsevier 2024 Article PeerReviewed text en http://eprints.uthm.edu.my/12384/1/J17840_db3d7dac97eb7dd5ebfb84944076dced.pdf Nasser, Abdullah and H.M. Ghanem, Waheed Ali and H.Y. Saad, Abdul-Malik and Hamed Abdul-Qawy, Antar Shaddad and A. Ghaleb, Sanaa A and Mohammed Alduais, Nayef Abdulwahab and Din, Fakhrud and Ghetas, Mohamed (2024) Depth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction. Expert Systems With Applications, 253. pp. 1-27. https://doi.org/10.1016/j.eswa.2024.124266
spellingShingle QA Mathematics
Nasser, Abdullah
H.M. Ghanem, Waheed Ali
H.Y. Saad, Abdul-Malik
Hamed Abdul-Qawy, Antar Shaddad
A. Ghaleb, Sanaa A
Mohammed Alduais, Nayef Abdulwahab
Din, Fakhrud
Ghetas, Mohamed
Depth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction
title Depth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction
title_full Depth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction
title_fullStr Depth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction
title_full_unstemmed Depth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction
title_short Depth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction
title_sort depth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction
topic QA Mathematics
url http://eprints.uthm.edu.my/12384/1/J17840_db3d7dac97eb7dd5ebfb84944076dced.pdf
http://eprints.uthm.edu.my/12384/
https://doi.org/10.1016/j.eswa.2024.124266
url_provider http://eprints.uthm.edu.my/