mailto:uumlib@uum.edu.my 24x7 Service; AnyTime; AnyWhere

Integrated bisect K-means and firefly algorithm for hierarchical text clustering

Mohammed, Athraa Jasim and Yusof, Yuhanis and Husni, Husniza (2016) Integrated bisect K-means and firefly algorithm for hierarchical text clustering. Journal of Engineering and Applied Sciences, 11 (3). pp. 522-527. ISSN 1816-949X

[thumbnail of JEAS 11 3 2016 522-527.pdf] PDF
Restricted to Registered users only

Download (257kB) | Request a copy

Abstract

Hierarchical text clustering plays a significant role in systematically browsing, summarizing and organizing documents into structure manner. However, the Bisect K-means which is a well-known hierarchical clustering algorithm is only able to generate local optimal solutions due to the employment of K-means as part of its process. In this study, we propose to replace the K-means with firefly algorithm, hence producing a Bisect FA for hierarchical clustering. At each level of the proposed Bisect FA, firefly algorithm works to produce the best clusters. For evaluation purposes, we performed experiments on 20 newsgroups dataset that is commonly used in text clustering studies.The results demonstrate that Bisect FA obtains more accurate and compact clustering than Bisect K-means, K-means and C-firefly algorithms. Such a result indicates that the proposed Bisect FA is a competitive algorithm for unsupervised learning.

Item Type: Article
Uncontrolled Keywords: Hierarchical Text Clustering, firefly algorithm, bisect K-means divisive clustering, documents
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: School of Computing
Depositing User: Dr. Yuhanis Yusof
Date Deposited: 18 Jan 2017 03:34
Last Modified: 18 Jan 2017 03:34
URI: https://repo.uum.edu.my/id/eprint/20644

Actions (login required)

View Item View Item