A review on feature selection methods for sentiment analysis

Text documents are normally represented as a feature-document matrix in sentiment analysis. Features can be single words from the text document or more complex pairs extracted by different schemes that adds information in order to enrich the feature-document matrix representation. Having diverse fea...

Full description

Saved in:
Bibliographic Details
Main Authors: Hung, Lai Po, Rayner Alfred, Mohd Hanafi Ahmad Hijazi
Format: Article
Language:en
Published: American Scientific Publishers 2015
Subjects:
Online Access:https://eprints.ums.edu.my/id/eprint/15367/1/A_Review_on_the_Ensemble_Framework_for_Sentiment_Analysis.pdf
https://eprints.ums.edu.my/id/eprint/15367/
https://doi.org/10.1166/asl.2015.6475
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Text documents are normally represented as a feature-document matrix in sentiment analysis. Features can be single words from the text document or more complex pairs extracted by different schemes that adds information in order to enrich the feature-document matrix representation. Having diverse feature types however creates a problem of high dimensionality due to the vast number of features and relations they hold. Thus, feature selection helps in ensuring that effective and efficient sentiment analysis applications can be developed by selecting features that are relevant and informative to assist classifiers to perform better and to reduce the processing load by narrowing down the feature set. This paper highlights methods used for feature selection, namely filter, wrapper and embedded. Prior to feature selection, preprocessing techniques are performed to reduce the amount of features first. This paper is concluded by summarizing this review and outlining the challenges faced and proposing the ensemble feature selection method for sentiment analysis data.