Abu Bakar, Juhaida and Omar, Khairuddin and Nasrudin, Mohammad Faidzul and Murah, Mohd Zamri (2016) Malay part-of-speech tagging: An me-based approach. In: International Conference on ICT for Transformation 2016, 05-07 April 2016, Center for postgraduate UMS Sabah Malaysia.. (Unpublished)
![]() |
PDF
Restricted to Registered users only Download (681kB) | Request a copy |
Abstract
Research on Malay Part-of-Speech (POS) tagging has greatly increased over the past few years. Based on previous literature, POS-tags are known as the first phase in the automated text analysis; and the development of language technologies can barely initiate without this initial phase.Malay language can be written in either the Roman or Jawi scripts.We highlight the existing POS-tags approaches and techniques; and suggest the development of Malay Jawi POS-tags using ME-based approach – using specific contextual information of Malay corpora that has been written in Jawi script. We conduct our test on NUWT Corpus.It has been found out that the ME-based approach reaches an accuracy level of 89.30% in average; and yields the precision and recall rates of 94% for the highest level of accuracy achieved.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Additional Information: | Organized by: Universiti Utara Malaysia |
Uncontrolled Keywords: | NLP pipeline task, POS-tags, tagging approach, Malay language, Jawi |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | School of Computing |
Depositing User: | Mrs. Norazmilah Yaakub |
Date Deposited: | 28 Feb 2018 02:02 |
Last Modified: | 28 Feb 2018 02:02 |
URI: | https://repo.uum.edu.my/id/eprint/23520 |
Actions (login required)
![]() |
View Item |