Mohammed, Athraa Jasim and Yusof, Yuhanis and Husni, Husniza (2013) Weight-based firefly algorithm for document clustering. In: First International Conference on Advanced Data and Information Engineering (DaEng-2013), December 16-18th, 2013, Kuala Lumpur, Malaysia.
PDF
Restricted to Repository staff only Download (1MB) | Request a copy |
Abstract
Existing clustering techniques have many drawbacks and this includes being trapped in a local optima. In this paper, we introduce the utilization of a new meta-heuristics algorithm, namely the Firefly algorithm (FA) to increase solution diversity. FA is a nature-inspired algorithm that is used in many optimization problems.The FA is realized in document clustering by executing it on Reuters-21578 database.The algorithm identifies documents that has the highest light intensity in a search space and represents it as a centroid.This is followed by recognizing similar documents using the cosine similarity function.Documents that are similar to the centroid are located into one cluster and dissimilar in the other.Experiments performed on the chosen dataset produce high values of Purity and F-measure.Hence, suggesting that the proposed Firefly algorithm is a possible approach in document clustering.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Additional Information: | Series Title: Lecture Notes in Electrical Engineering |
Uncontrolled Keywords: | Firefly algorithm, partitional clustering, hierarchical clustering, text clustering. |
Subjects: | Q Science > QA Mathematics > QA76 Computer software |
Divisions: | College of Arts and Sciences |
Depositing User: | Dr. Yuhanis Yusof |
Date Deposited: | 10 Feb 2016 08:02 |
Last Modified: | 10 Feb 2016 08:02 |
URI: | https://repo.uum.edu.my/id/eprint/17115 |
Actions (login required)
View Item |