Polynomials feature-transformed heap dimensionality reduction and stacking ensemble for spectrometry data classification
Pattern recognition has emerged as a burgeoning field of study with increasing prominence in light of technological advancements, finding applications across various multidisciplinary domains. An essential part of pattern recognition is classification where it involves the categorization of labelled...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English English |
Published: |
2023
|
Subjects: | |
Online Access: | https://eprints.ums.edu.my/id/eprint/40556/1/24%20PAGES.pdf https://eprints.ums.edu.my/id/eprint/40556/2/FULLTEXT.pdf https://eprints.ums.edu.my/id/eprint/40556/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.ums.eprints.40556 |
---|---|
record_format |
eprints |
spelling |
my.ums.eprints.405562024-09-27T00:37:13Z https://eprints.ums.edu.my/id/eprint/40556/ Polynomials feature-transformed heap dimensionality reduction and stacking ensemble for spectrometry data classification Nur Hasshima Hasbi QD1-65 General Including alchemy Pattern recognition has emerged as a burgeoning field of study with increasing prominence in light of technological advancements, finding applications across various multidisciplinary domains. An essential part of pattern recognition is classification where it involves the categorization of labelled samples based on their data features. Fourier Transform Infrared (FTIR) spectroscopy, a well-established spectroscopic technique, have long been used to detect organic, polymeric, and even inorganic materials. This research endeavours to develop an accurate and optimal classification framework on FTIR spectra data using a combination of heap dimensionality reduction (DR) technique, polynomial features transformation and a heuristic stacking ensemble technique. The high-dimensionality nature of FTIR data poses a significant challenge for classification. To address this issue, DR techniques are used. However, no DR technique is superior to all others. Depending on the dataset used, one method may produce a better approximation of a dataset than the other techniques. In this study, the high-dimensional data undergo multiple existing DR techniques. The resulting transformed features are consolidated into a heap and subsequently undergo polynomial feature transformation. Then Partial Least Square (PLS-DA) method is applied to obtain the final transformed features. The transformed features are then utilized as input for the stacking ensemble (SE) model, selected through a heuristic SE procedure. Artificial data was employed for the initial two experiments, while the complete framework was tested on the six FTIR datasets for the third experiment to assess its applicability to real-world datasets. The experimental results on these six datasets revealed that the proposed framework was outperformed the other examined models. Notably, an average accuracy, sensitivity, and specificity of up to 99% was achieved for the D06 dataset. As a result, this framework holds potential not only for the classification of FTIR data but also for other high-dimensional data in general. 2023 Thesis NonPeerReviewed text en https://eprints.ums.edu.my/id/eprint/40556/1/24%20PAGES.pdf text en https://eprints.ums.edu.my/id/eprint/40556/2/FULLTEXT.pdf Nur Hasshima Hasbi (2023) Polynomials feature-transformed heap dimensionality reduction and stacking ensemble for spectrometry data classification. Masters thesis, Universiti Malaysia Sabah. |
institution |
Universiti Malaysia Sabah |
building |
UMS Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaysia Sabah |
content_source |
UMS Institutional Repository |
url_provider |
http://eprints.ums.edu.my/ |
language |
English English |
topic |
QD1-65 General Including alchemy |
spellingShingle |
QD1-65 General Including alchemy Nur Hasshima Hasbi Polynomials feature-transformed heap dimensionality reduction and stacking ensemble for spectrometry data classification |
description |
Pattern recognition has emerged as a burgeoning field of study with increasing prominence in light of technological advancements, finding applications across various multidisciplinary domains. An essential part of pattern recognition is classification where it involves the categorization of labelled samples based on their data features. Fourier Transform Infrared (FTIR) spectroscopy, a well-established spectroscopic technique, have long been used to detect organic, polymeric, and even inorganic materials. This research endeavours to develop an accurate and optimal classification framework on FTIR spectra data using a combination of heap dimensionality reduction (DR) technique, polynomial features transformation and a heuristic stacking ensemble technique. The high-dimensionality nature of FTIR data poses a significant challenge for classification. To address this issue, DR techniques are used. However, no DR technique is superior to all others. Depending on the dataset used, one method may produce a better approximation of a dataset than the other techniques. In this study, the high-dimensional data undergo multiple existing DR techniques. The resulting transformed features are consolidated into a heap and subsequently undergo polynomial feature transformation. Then Partial Least Square (PLS-DA) method is applied to obtain the final transformed features. The transformed features are then utilized as input for the stacking ensemble (SE) model, selected through a heuristic SE procedure. Artificial data was employed for the initial two experiments, while the complete framework was tested on the six FTIR datasets for the third experiment to assess its applicability to real-world datasets. The experimental results on these six datasets revealed that the proposed framework was outperformed the other examined models. Notably, an average accuracy, sensitivity, and specificity of up to 99% was achieved for the D06 dataset. As a result, this framework holds potential not only for the classification of FTIR data but also for other high-dimensional data in general. |
format |
Thesis |
author |
Nur Hasshima Hasbi |
author_facet |
Nur Hasshima Hasbi |
author_sort |
Nur Hasshima Hasbi |
title |
Polynomials feature-transformed heap dimensionality reduction and stacking ensemble for spectrometry data classification |
title_short |
Polynomials feature-transformed heap dimensionality reduction and stacking ensemble for spectrometry data classification |
title_full |
Polynomials feature-transformed heap dimensionality reduction and stacking ensemble for spectrometry data classification |
title_fullStr |
Polynomials feature-transformed heap dimensionality reduction and stacking ensemble for spectrometry data classification |
title_full_unstemmed |
Polynomials feature-transformed heap dimensionality reduction and stacking ensemble for spectrometry data classification |
title_sort |
polynomials feature-transformed heap dimensionality reduction and stacking ensemble for spectrometry data classification |
publishDate |
2023 |
url |
https://eprints.ums.edu.my/id/eprint/40556/1/24%20PAGES.pdf https://eprints.ums.edu.my/id/eprint/40556/2/FULLTEXT.pdf https://eprints.ums.edu.my/id/eprint/40556/ |
_version_ |
1811684090950189056 |
score |
13.211869 |