UUM Repository | Universiti Utara Malaysian Institutional Repository
FAQs | Feedback | Search Tips | Sitemap

Malay part-of-speech tagging: An me-based approach


Abu Bakar, Juhaida and Omar, Khairuddin and Nasrudin, Mohammad Faidzul and Murah, Mohd Zamri (2016) Malay part-of-speech tagging: An me-based approach. In: International Conference on ICT for Transformation 2016, 05-07 April 2016, Center for postgraduate UMS Sabah Malaysia.. (Unpublished)

[img] PDF
Restricted to Registered users only

Download (681kB) | Request a copy

Abstract

Research on Malay Part-of-Speech (POS) tagging has greatly increased over the past few years. Based on previous literature, POS-tags are known as the first phase in the automated text analysis; and the development of language technologies can barely initiate without this initial phase.Malay language can be written in either the Roman or Jawi scripts.We highlight the existing POS-tags approaches and techniques; and suggest the development of Malay Jawi POS-tags using ME-based approach – using specific contextual information of Malay corpora that has been written in Jawi script. We conduct our test on NUWT Corpus.It has been found out that the ME-based approach reaches an accuracy level of 89.30% in average; and yields the precision and recall rates of 94% for the highest level of accuracy achieved.

Item Type: Conference or Workshop Item (Paper)
Additional Information: Organized by: Universiti Utara Malaysia
Uncontrolled Keywords: NLP pipeline task, POS-tags, tagging approach, Malay language, Jawi
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: School of Computing
Depositing User: Mrs. Norazmilah Yaakub
Date Deposited: 28 Feb 2018 02:02
Last Modified: 28 Feb 2018 02:02
URI: http://repo.uum.edu.my/id/eprint/23520

Actions (login required)

View Item View Item