An empirical study on CO2 emissions in ASEAN countries

This paper proposes a local feature selection (FS) measure namely, Categorical Descriptor Term (CTD) for text categorization. It is derived based on classic term weighting scheme, TFIDF. The method explicitly chooses feature set for each category by only selecting set of terms from relevant category...

詳細記述

保存先:
書誌詳細
主要な著者: Bong, Chih How, Narayanan, Kulathuramaiyer
フォーマット: Conference or Workshop Item
言語:English
出版事項: 2012
主題:
オンライン・アクセス:http://ir.unimas.my/id/eprint/8478/1/An%20Empirical%20Study%20of%20Feature%20Selection%20for%20Text%20Categorization%20based%20on%20Term%20%28abstract%29.pdf
http://ir.unimas.my/id/eprint/8478/
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6396635&tag=1
タグ: タグ追加
タグなし, このレコードへの初めてのタグを付けませんか!
その他の書誌記述
要約:This paper proposes a local feature selection (FS) measure namely, Categorical Descriptor Term (CTD) for text categorization. It is derived based on classic term weighting scheme, TFIDF. The method explicitly chooses feature set for each category by only selecting set of terms from relevant category. Although past literatures have suggested that the use of features from irrelevant categories can improve the measure of text categorization, we believe that by incorporating only relevant feature can be highly effective. The experimental comparison is carried out between CTD and five wellknown feature selection measures: Information Gain, Chi-Square, Correlation Coefficient, Odd Ratio and GSS Coefficient. The results also show that our proposed method can perform comparatively well with other FS measures, especially on collection with highly overlapped topics.