UUM Repository | Universiti Utara Malaysian Institutional Repository
FAQs | Feedback | Search Tips | Sitemap

Divergence analysis and processing for Mandarin-English parallel text exploitation


Shun, Chieh Lin and Jhing, Fa Wang (2004) Divergence analysis and processing for Mandarin-English parallel text exploitation. In: Knowledge Management International Conference and Exhibition 2004 (KMICE 2004), 14-15 February 2004, Evergreen Laurel Hotel, Penang.

[img]
Preview
PDF
Download (66kB) | Preview

Abstract

Previous work shows that the process of parallel text exploitation to extract mappings between language pairs raises the capability of language translation. However, while this process can be fully automated, one thorny problem called “divergence” causes indisposed mapping extraction. Therefore, this paper discuss the issues of parallel text exploitation, in general, with special emphasis on divergence analysis and processing. In the experiments on a Mandarin-English travel conversation corpus of 11,885 sentence pairs, the perplexity with the alignments in IBM translation model is reduced averagely from 13.65 to 4.18.

Item Type: Conference or Workshop Item (Paper)
Additional Information: ISBN 983-2865-90-5 Organized by: Faculty of Information Technology, UUM
Uncontrolled Keywords: Divergence Analysis and Processing, Parallel Text Exploitation
Subjects: Q Science > QA Mathematics > QA76 Computer software
Divisions: College of Arts and Sciences
Depositing User: Mrs. Norazmilah Yaakub
Date Deposited: 10 May 2015 06:59
Last Modified: 10 May 2015 06:59
URI: http://repo.uum.edu.my/id/eprint/13872

Actions (login required)

View Item View Item