Chee, Keong Ch’ng (2019) Comparing the performance of winsorize tree to other data mining techniques for cases involving outliers. International Journal of Recent Technology and Engineering, 8 (2S2). pp. 197-201. ISSN 2277-3878
PDF
Restricted to Registered users only Download (761kB) |
Abstract
Winsorize tree is a modified tree that reformed from classification and regression tree (CART). It lays on the strategy of handling and accommodating the outliers simultaneously in all nodes while generating the subsequence branches of tree. Normally, due to the existence of outlier, the accuracy rate of most of the classifiers will be affected. Therefore, we propose winsorize tree which could resist to anomaly data. It protects the originality of the data while performing the splitting process. In this study, winsorize tree was compared to other classifiers. The results obtained from five real datasets indicate that the proposed winsorize tree performs as good as or even better compare to the other data mining techniques based on the misclassification rate.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | winsorize tree algorithm; outlier; gini index; misclassification rate; classification; classification and regression tree; winsorized tree. |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | School of Quantitative Sciences |
Depositing User: | Mrs. Norazmilah Yaakub |
Date Deposited: | 19 Mar 2020 06:17 |
Last Modified: | 19 Mar 2020 06:17 |
URI: | https://repo.uum.edu.my/id/eprint/26925 |
Actions (login required)
View Item |