Document classification based on kNN algorithm by term vector space reduction

Classification (of information); Data handling; Data mining; Information retrieval systems; Learning algorithms; Text processing; Vectors; Document Classification; Space reductions; Text classifiers; Text mining; Textual data; Unstructured data; Vector spaces

Saved in:
Bibliographic Details
Main Authors: Moldagulova A., Sulaiman R.B.
Other Authors: 57160071400
Format: Conference Paper
Published: IEEE Computer Society 2023
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1833412828896165888
author Moldagulova A.
Sulaiman R.B.
author2 57160071400
author_facet 57160071400
Moldagulova A.
Sulaiman R.B.
author_sort Moldagulova A.
building UNITEN Library
collection Institutional Repository
content_provider Universiti Tenaga Nasional
content_source UNITEN Institutional Repository
continent Asia
country Malaysia
description Classification (of information); Data handling; Data mining; Information retrieval systems; Learning algorithms; Text processing; Vectors; Document Classification; Space reductions; Text classifiers; Text mining; Textual data; Unstructured data; Vector spaces
format Conference Paper
id my.uniten.dspace-23499
institution Universiti Tenaga Nasional
publishDate 2023
publisher IEEE Computer Society
record_format dspace
spelling my.uniten.dspace-234992023-05-29T14:49:57Z Document classification based on kNN algorithm by term vector space reduction Moldagulova A. Sulaiman R.B. 57160071400 25825633600 Classification (of information); Data handling; Data mining; Information retrieval systems; Learning algorithms; Text processing; Vectors; Document Classification; Space reductions; Text classifiers; Text mining; Textual data; Unstructured data; Vector spaces Nowadays there is an increasing interest in the area of unstructured data analysis. The vast majority of unstructured data belongs to unstructured text data. Retrieving useful information from huge volume of unstructured text data is very challenging task. Text mining is a thought-provoking research area as it tries to discover knowledge from unstructured text. This paper deals with methods used for handling unstructured text data in particular document classification problems. Most document classification methods based on term vector space model of representation of unstructured textual data. The term vector space model is easy to implement, provides uniform representation for documents. However feature space for a large collection of documents can reach millions and be sparse. One of the issues is to reduce the dimension of the term-document matrix. In this research we proposed an approach for reduction of term vector space in KNN algorithm. � ICROS. Final 2023-05-29T06:49:56Z 2023-05-29T06:49:56Z 2018 Conference Paper 2-s2.0-85060480043 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85060480043&partnerID=40&md5=d52c11efb08aee7a1a10a37b9778cd46 https://irepository.uniten.edu.my/handle/123456789/23499 2018-October 8571540 387 391 IEEE Computer Society Scopus
spellingShingle Moldagulova A.
Sulaiman R.B.
Document classification based on kNN algorithm by term vector space reduction
title Document classification based on kNN algorithm by term vector space reduction
title_full Document classification based on kNN algorithm by term vector space reduction
title_fullStr Document classification based on kNN algorithm by term vector space reduction
title_full_unstemmed Document classification based on kNN algorithm by term vector space reduction
title_short Document classification based on kNN algorithm by term vector space reduction
title_sort document classification based on knn algorithm by term vector space reduction
url_provider http://dspace.uniten.edu.my/