Latifah Loh, Abdullah (2007) The effect of term weighting measures on feature selection. Masters thesis, Universiti Malaysia Sarawak (UNIMAS).
PDF (Please get the password by email to repository@unimas.my , or call ext: 3914 / 3942 / 3933)
Latifah Loh Abdullah.pdf Restricted to Registered users only Download (7MB) |
Abstract
Feature selection is an important stage in any text mining classification techniques. In this dissertation, we study and analyze Categorical Term Descriptor (CTD) (Bong, C.H., 2001) feature selection method. which gives comparative accuracy results compared to other well-known feature selection method like Information Gain and Chi-Square. Our goal is to evaluate the significance of each term weighting measure that forms the CTD method. Our experimental results have shown taht CTD does not handle datasets that contain misclassifications. We have proven that CTD performs well in categories which are distinct as opposed to general and miscellaneous categories.
Item Type: | Thesis (Masters) |
---|---|
Additional Information: | Thesis (M.Sc.) -- Universiti Malaysia Sarawak, 2007. |
Uncontrolled Keywords: | term weighting, features selection, Universiti Malaysia Sarawak, UNIMAS, postgraduate, research, IPTA, education, kuching, samarahan, sarawak, malaysia, universiti, university |
Subjects: | T Technology > T Technology (General) |
Divisions: | Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology Faculties, Institutes, Centres > Faculty of Computer Science and Information Technology Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology |
Depositing User: | Karen Kornalius |
Date Deposited: | 15 Apr 2014 02:08 |
Last Modified: | 07 Mar 2023 08:09 |
URI: | http://ir.unimas.my/id/eprint/1714 |
Actions (For repository members only: login required)
View Item |