The effect of term weighting measures on feature selection
Feature selection is an important stage in any text mining classification techniques. In this dissertation, we study and analyze Categorical Term Descriptor (CTD) (Bong, C.H., 2001) feature selection method. which gives comparative accuracy results compared to other well-known feature selection meth...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2007
|
Subjects: | |
Online Access: | http://ir.unimas.my/id/eprint/1714/2/Latifah%20Loh%20Abdullah.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Feature selection is an important stage in any text mining classification techniques. In this dissertation, we study and analyze Categorical Term Descriptor (CTD) (Bong, C.H., 2001) feature selection method. which gives comparative accuracy results compared to other well-known feature selection method like Information Gain and Chi-Square. Our goal is to evaluate the significance of each term weighting measure that forms the CTD method. Our experimental results have shown taht CTD does not handle datasets that contain misclassifications. We have proven that CTD performs well in categories which are distinct as opposed to general and miscellaneous categories. |
---|