UUM Repository | Universiti Utara Malaysian Institutional Repository
FAQs | Feedback | Search Tips | Sitemap

Comparing the performance of winsorize tree to other data mining techniques for cases involving outliers


Chee, Keong Ch’ng (2019) Comparing the performance of winsorize tree to other data mining techniques for cases involving outliers. International Journal of Recent Technology and Engineering, 8 (2S2). pp. 197-201. ISSN 2277-3878

[img] PDF
Restricted to Registered users only

Download (761kB)

Abstract

Winsorize tree is a modified tree that reformed from classification and regression tree (CART). It lays on the strategy of handling and accommodating the outliers simultaneously in all nodes while generating the subsequence branches of tree. Normally, due to the existence of outlier, the accuracy rate of most of the classifiers will be affected. Therefore, we propose winsorize tree which could resist to anomaly data. It protects the originality of the data while performing the splitting process. In this study, winsorize tree was compared to other classifiers. The results obtained from five real datasets indicate that the proposed winsorize tree performs as good as or even better compare to the other data mining techniques based on the misclassification rate.

Item Type: Article
Uncontrolled Keywords: winsorize tree algorithm; outlier; gini index; misclassification rate; classification; classification and regression tree; winsorized tree.
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: School of Quantitative Sciences
Depositing User: Mrs. Norazmilah Yaakub
Date Deposited: 19 Mar 2020 06:17
Last Modified: 19 Mar 2020 06:17
URI: http://repo.uum.edu.my/id/eprint/26925

Actions (login required)

View Item View Item