Identification of significant features and machine learning technique in predicting helpful reviews

Consumers nowadays rely heavily on online reviews in making their purchase decisions. However, they are often overwhelmed by the mass amount of product reviews that are being generated on online platforms. Therefore, it is deemed essential to determine the helpful reviews, as it will significantly r...

Full description

Saved in:
Bibliographic Details
Main Authors: Quaderi, Shah Jafor Sadeek, Varathan, Kasturi Dewi
Format: Article
Published: PeerJ 2024
Subjects:
Online Access:http://eprints.um.edu.my/45753/
https://doi.org/10.7717/peerj-cs.1745
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.45753
record_format eprints
spelling my.um.eprints.457532024-11-11T07:09:22Z http://eprints.um.edu.my/45753/ Identification of significant features and machine learning technique in predicting helpful reviews Quaderi, Shah Jafor Sadeek Varathan, Kasturi Dewi QA75 Electronic computers. Computer science Consumers nowadays rely heavily on online reviews in making their purchase decisions. However, they are often overwhelmed by the mass amount of product reviews that are being generated on online platforms. Therefore, it is deemed essential to determine the helpful reviews, as it will significantly reduce the number of reviews that each consumer has to ponder. A review is identified as a helpful review if it has significant information that helps the reader in making a purchase decision. Many reviews posted online are lacking a sufficient amount of information used in the decision -making process. Past research has neglected much useful information that can be utilized in predicting helpful reviews. This research identifies significant information which is represented as features categorized as linguistic, metadata, readability, subjectivity, and polarity that have contributed to predicting helpful online reviews. Five machine learning models were compared on two Amazon open datasets, each consisting of 9,882,619 and 65,222 user reviews. The significant features used in the Random Forest technique managed to outperform other techniques used by previous researchers with an accuracy of 89.36%. PeerJ 2024-01 Article PeerReviewed Quaderi, Shah Jafor Sadeek and Varathan, Kasturi Dewi (2024) Identification of significant features and machine learning technique in predicting helpful reviews. PeerJ Computer Science, 10. e1745. ISSN 2376-5992, DOI https://doi.org/10.7717/peerj-cs.1745 <https://doi.org/10.7717/peerj-cs.1745>. https://doi.org/10.7717/peerj-cs.1745 10.7717/peerj-cs.1745
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Quaderi, Shah Jafor Sadeek
Varathan, Kasturi Dewi
Identification of significant features and machine learning technique in predicting helpful reviews
description Consumers nowadays rely heavily on online reviews in making their purchase decisions. However, they are often overwhelmed by the mass amount of product reviews that are being generated on online platforms. Therefore, it is deemed essential to determine the helpful reviews, as it will significantly reduce the number of reviews that each consumer has to ponder. A review is identified as a helpful review if it has significant information that helps the reader in making a purchase decision. Many reviews posted online are lacking a sufficient amount of information used in the decision -making process. Past research has neglected much useful information that can be utilized in predicting helpful reviews. This research identifies significant information which is represented as features categorized as linguistic, metadata, readability, subjectivity, and polarity that have contributed to predicting helpful online reviews. Five machine learning models were compared on two Amazon open datasets, each consisting of 9,882,619 and 65,222 user reviews. The significant features used in the Random Forest technique managed to outperform other techniques used by previous researchers with an accuracy of 89.36%.
format Article
author Quaderi, Shah Jafor Sadeek
Varathan, Kasturi Dewi
author_facet Quaderi, Shah Jafor Sadeek
Varathan, Kasturi Dewi
author_sort Quaderi, Shah Jafor Sadeek
title Identification of significant features and machine learning technique in predicting helpful reviews
title_short Identification of significant features and machine learning technique in predicting helpful reviews
title_full Identification of significant features and machine learning technique in predicting helpful reviews
title_fullStr Identification of significant features and machine learning technique in predicting helpful reviews
title_full_unstemmed Identification of significant features and machine learning technique in predicting helpful reviews
title_sort identification of significant features and machine learning technique in predicting helpful reviews
publisher PeerJ
publishDate 2024
url http://eprints.um.edu.my/45753/
https://doi.org/10.7717/peerj-cs.1745
_version_ 1816130454686793728
score 13.223943