mailto:uumlib@uum.edu.my 24x7 Service; AnyTime; AnyWhere

A statistical interestingness measures for XML based association rules

Mohd Shaharanee, Izwan Nizal and Hadzic, Fedja and Dillon, Tharam S. (2010) A statistical interestingness measures for XML based association rules. In: PRICAI 2010: Trends in Artificial Intelligence: 11th Pacific Rim International Conference on Artificial Intelligence, Daegu, Korea, August 30-September 2, 2010. Proceedings. Springer-Verlag Berlin Heidelberg, Berlin, pp. 194-205. ISBN 9783642152450

[thumbnail of En._Izwan_Nizal_Mohd_Shaharanee[1].pdf] PDF
Restricted to Repository staff only

Download (142kB)

Abstract

Recently mining frequent substructures from XML data has gained a considerable amount of interest. Different methods have been proposed and examined for mining frequent patterns from XML documents efficiently and effectively. While many frequent XML patterns generated are useful and interesting, it is common that a large portion of them is not considered as interesting or significant for the application at hand. In this paper, we present a systematic approach to ascertain whether the discovered XML patterns are significant and not just coincidental associations, and provide a precise statistical approach to support this framework. The proposed strategy combines data mining and statistical measurement techniques to discard the non significant patterns. In this paper we considered the "Prions" database that describes the protein instances stored for Human Prions Protein. The proposed unified framework is applied on this dataset to demonstrate its effectiveness in assessing interestingness of discovered XML patterns by statistical means.When the dataset is used for classification/prediction purposes, the proposed approach will discard non significant XML patterns, without the cost of a reduction in the accuracy of the pattern set as a whole.

Item Type: Book Section
Uncontrolled Keywords: data mining, interesting rules, statistical analysis, semi-structured data
Subjects: Q Science > Q Science (General)
Divisions: College of Arts and Sciences
Depositing User: Dr. Izwan Nizal Mohd Shaharanee
Date Deposited: 12 Nov 2010 01:32
Last Modified: 05 Dec 2016 08:52
URI: https://repo.uum.edu.my/id/eprint/1514

Actions (login required)

View Item View Item