A review on feature selection methods for sentiment analysis
Text documents are normally represented as a feature-document matrix in sentiment analysis. Features can be single words from the text document or more complex pairs extracted by different schemes that adds information in order to enrich the feature-document matrix representation. Having diverse fea...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | en |
| Published: |
American Scientific Publishers
2015
|
| Subjects: | |
| Online Access: | https://eprints.ums.edu.my/id/eprint/15367/1/A_Review_on_the_Ensemble_Framework_for_Sentiment_Analysis.pdf https://eprints.ums.edu.my/id/eprint/15367/ https://doi.org/10.1166/asl.2015.6475 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Text documents are normally represented as a feature-document matrix in sentiment analysis. Features can be single words from the text document or more complex pairs extracted by different schemes that adds information in order to enrich the feature-document matrix representation. Having diverse feature types however creates a problem of high dimensionality due to the vast number of features and relations they hold. Thus, feature selection helps in ensuring that effective and efficient sentiment analysis applications can be developed by selecting features that are relevant and informative to assist classifiers to perform better and to reduce the processing load by narrowing down the feature set. This paper highlights methods used for feature selection, namely filter, wrapper and embedded. Prior to feature selection, preprocessing techniques are performed to reduce the amount of features first. This paper is concluded by summarizing this review and outlining the challenges faced and proposing the ensemble feature selection method for sentiment analysis data. |
|---|
