mailto:uumlib@uum.edu.my 24x7 Service; AnyTime; AnyWhere

A Meta-heuristic Algorithm for the Minimal High-Quality Feature Extraction of Online Reviews

Mat Zin, Harnani and Mustapha, Norwati and Azmi Murad, Masrah Azrifah and Mohd Sharef, Nurfadhlina (2022) A Meta-heuristic Algorithm for the Minimal High-Quality Feature Extraction of Online Reviews. Journal of Information and Communication Technology, 21 (4). pp. 571-593. ISSN 2180-3862

[thumbnail of JICT 21 04 2022 571-593.pdf]
Preview
PDF - Published Version
Available under License Attribution 4.0 International (CC BY 4.0).

Download (1MB) | Preview

Abstract

Feature extraction and selection are critical in sentiment analysis (SA) to extract and select only the appropriate features by removing those deemed redundant. As such, the successful implementation of this process leads to better classification accuracy. Inevitably, selecting high-quality minimal features can be challenging given the inherent complication in dealing with over-fitting issues. Most of the current studies used a heuristic method to perform the classification process that will result in selecting and examining only a single feature subset, while ignoring the other subsets that might give better results. This study explored the effect of using the meta-heuristic method together with the ensemble classification method in the sentiment classification of online reviews. Adding to that point, the extraction and selection of relevant features used feature ranking, hyper-parameter optimization, crossover, and mutation, while the classification process utilized the ensemble classifier. The proposed method was tested on the polarity movie review dataset v2.0 and product review dataset (books, electronics, kitchen, and music). The test results indicated that the proposed method significantly improved the classification results by 94%, which far exceeded the existing method. Therefore, the proposed feature extraction and selection method can help in improving the performance of SA in online reviews and, at the same time, reduce the number of extracted features.

Item Type: Article
Uncontrolled Keywords: Feature extraction, feature selection, online reviews, meta-heuristics, sentiment analysis
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: School of Computing
Depositing User: Mrs Nurin Jazlina Hamid
Date Deposited: 24 Jan 2023 06:15
Last Modified: 09 Feb 2023 03:13
URI: https://repo.uum.edu.my/id/eprint/29111

Actions (login required)

View Item View Item