Effective k-Means Clustering in Greedy Prepruned Tree-based Classification for Obstructive Sleep Apnea

Sim, Doreen Ying Ying and Ahmad I., Ismail and Chee Siong, Teh (2022) Effective k-Means Clustering in Greedy Prepruned Tree-based Classification for Obstructive Sleep Apnea. International Journal of Electrical and Electronic Engineering and Telecommunications, 11 (2). pp. 242-248. ISSN 2319-2518 (Online)

[img] PDF
Effective k-Means - Copy.pdf

Download (1MB)
Official URL: http://www.ijeetc.com/index.php?m=content&c=index&...


Incorporation of prepruned decision trees to kmeans clustering through one to three types of tree-depth controllers and cluster partitioning was done to develop a combined algorithm named as Greedy Pre-pruned Treebased Clustering (GPrTC) algorithm. Pre-pruned clustered decision trees are applied in a greedy concerted way to five datasets of obstructive sleep apnea and others from online data repositories. The optimal number of k clusters for kmeans clustering is determined after trees are greedily prepruned by tree-depth controllers of minimum number of leaf nodes, minimum number of parent nodes and maximum number of tree splitting. After applying the GPrTC algorithm to the assigned datasets, when compared with the conventional k-means clustering, results showed that the former has significantly lower average distortion per point and lower average run-time for 2-D and 3-D data over around 30 thousand points. Classification efficiency and speed of the former algorithm is more than two times better the latter algorithm over a higher range of points being run. GPrTC algorithm showed better classification accuracies than k-means clustering in almost all the assigned datasets. This concludes that the proposed algorithm is significantly much more efficient, less distortion and much faster than k-means clustering with moderately better in terms of classification and/or prediction accuracies. © 2022. Int. J. Elec. & Elecn. Eng. & Telcomm

Item Type: Article
Uncontrolled Keywords: Pre-pruned decision trees, k-means clustering, tree-depth controllers, GPrTC algorithm, average distortion per point, average run-time.
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Academic Faculties, Institutes and Centres > Faculty of Cognitive Sciences and Human Development
Depositing User: Siong
Date Deposited: 18 Jul 2022 01:55
Last Modified: 18 Jul 2022 01:55
URI: http://ir.unimas.my/id/eprint/38887

Actions (For repository members only: login required)

View Item View Item