Malware Classification and Detection using Variations of Machine Learning Algorithm Models

Malware attacks are attacks carried out by an attacker by sending malicious codes to various files or even many packages and servers. Therefore, reliable network operations are a factor that needs to be considered to prevent attacks as early as possible in order to avoid more severe system damage...

Full description

Saved in:
Bibliographic Details
Main Authors: Andi Maslan, Andi Maslan, Abdul Hamid, Abdul Hamid
Format: Article
Language:en
Published: 2025
Subjects:
Online Access:http://eprints.uthm.edu.my/12628/1/J19579_eac5d370d2c9829a28ac1bedf6af0f2e.pdf
http://eprints.uthm.edu.my/12628/
https://doi.org/10.26555/jiteki.v11i1.30477
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1834509274616692736
author Andi Maslan, Andi Maslan
Abdul Hamid, Abdul Hamid
author_facet Andi Maslan, Andi Maslan
Abdul Hamid, Abdul Hamid
author_sort Andi Maslan, Andi Maslan
building UTHM Library
collection Institutional Repository
content_provider Universiti Tun Hussein Onn Malaysia
content_source UTHM Institutional Repository
continent Asia
country Malaysia
description Malware attacks are attacks carried out by an attacker by sending malicious codes to various files or even many packages and servers. Therefore, reliable network operations are a factor that needs to be considered to prevent attacks as early as possible in order to avoid more severe system damage. Types of attacks can be Ping of Death, flooding, remote-controlled attacks, UDP flooding, and Smurf Attacks. Attack data was obtained from the ClaMP dataset, which has an unbalanced data set, and has very high noise, so it is necessary to analyze data packets in network logs and optimize feature extraction which is then analyzed statistically with machine learning algorithms. The purpose of the study is to detect, classify malware attacks using a variety of ML Algorithm models such as SVM, KNN and Neural Network and testing detection performance. The research stage starts from pre-Processing, extraction, feature selection and classification processes and performance testing. Training and testing data in the study used a mixed model, namely data division, split model and cross validation. The results of the study concluded that the best algorithm for detecting malware packages is the Neural Network for the Feature Combination category with an accuracy rate of 96.91%, Recall of 97.35% and Precision of 96.78%. So that the study can have implications for cyber experts to be able to prevent malware attacks early. While further research requires a special algorithm to improve malware attack detection, in addition to KNN, SVM and Neural Network. And another research challenge is to focus on feature extraction techniques on datasets that have unbalanced or varied features with the Natural Language Processing (NLP) approach. So this research can be used as a reference for researchers who are conducting research in the same field.
format Article
id my.uthm.eprints-12628
institution Universiti Tun Hussein Onn Malaysia
language en
publishDate 2025
record_format eprints
spelling my.uthm.eprints-126282025-05-30T08:58:23Z http://eprints.uthm.edu.my/12628/ Malware Classification and Detection using Variations of Machine Learning Algorithm Models Andi Maslan, Andi Maslan Abdul Hamid, Abdul Hamid QA Mathematics Malware attacks are attacks carried out by an attacker by sending malicious codes to various files or even many packages and servers. Therefore, reliable network operations are a factor that needs to be considered to prevent attacks as early as possible in order to avoid more severe system damage. Types of attacks can be Ping of Death, flooding, remote-controlled attacks, UDP flooding, and Smurf Attacks. Attack data was obtained from the ClaMP dataset, which has an unbalanced data set, and has very high noise, so it is necessary to analyze data packets in network logs and optimize feature extraction which is then analyzed statistically with machine learning algorithms. The purpose of the study is to detect, classify malware attacks using a variety of ML Algorithm models such as SVM, KNN and Neural Network and testing detection performance. The research stage starts from pre-Processing, extraction, feature selection and classification processes and performance testing. Training and testing data in the study used a mixed model, namely data division, split model and cross validation. The results of the study concluded that the best algorithm for detecting malware packages is the Neural Network for the Feature Combination category with an accuracy rate of 96.91%, Recall of 97.35% and Precision of 96.78%. So that the study can have implications for cyber experts to be able to prevent malware attacks early. While further research requires a special algorithm to improve malware attack detection, in addition to KNN, SVM and Neural Network. And another research challenge is to focus on feature extraction techniques on datasets that have unbalanced or varied features with the Natural Language Processing (NLP) approach. So this research can be used as a reference for researchers who are conducting research in the same field. 2025 Article PeerReviewed text en http://eprints.uthm.edu.my/12628/1/J19579_eac5d370d2c9829a28ac1bedf6af0f2e.pdf Andi Maslan, Andi Maslan and Abdul Hamid, Abdul Hamid (2025) Malware Classification and Detection using Variations of Machine Learning Algorithm Models. Jurnal Ilmiah Teknik Elektro Komputer dan Informatika (JITEKI), 11 (1). pp. 27-41. ISSN 2338-3070 https://doi.org/10.26555/jiteki.v11i1.30477
spellingShingle QA Mathematics
Andi Maslan, Andi Maslan
Abdul Hamid, Abdul Hamid
Malware Classification and Detection using Variations of Machine Learning Algorithm Models
title Malware Classification and Detection using Variations of Machine Learning Algorithm Models
title_full Malware Classification and Detection using Variations of Machine Learning Algorithm Models
title_fullStr Malware Classification and Detection using Variations of Machine Learning Algorithm Models
title_full_unstemmed Malware Classification and Detection using Variations of Machine Learning Algorithm Models
title_short Malware Classification and Detection using Variations of Machine Learning Algorithm Models
title_sort malware classification and detection using variations of machine learning algorithm models
topic QA Mathematics
url http://eprints.uthm.edu.my/12628/1/J19579_eac5d370d2c9829a28ac1bedf6af0f2e.pdf
http://eprints.uthm.edu.my/12628/
https://doi.org/10.26555/jiteki.v11i1.30477
url_provider http://eprints.uthm.edu.my/