Improving English-to-Chinese translation for technical terms using morphological information

Xianchao Wu, Naoaki Okazaki, Takashi Tsunakawa, Jun'ichi Tsujii

Research output: Contribution to conferencePaperpeer-review

7 Citations (Scopus)

Abstract

The continuous emergence of new technical terms and the difficulty of keeping up with neologism in parallel corpora deteriorate the performance of statistical machine translation (SMT) systems. This paper explores the use of morphological information to improve English-to-Chinese translation for technical terms. To reduce the morphemelevel translation ambiguity, we group the morphemes into morpheme phrases and propose the use of domain information for translation candidate selection. In order to find correspondences of morpheme phrases between the source and target languages, we propose an algorithm to mine morpheme phrase translation pairs from a bilingual lexicon. We also build a cascaded translation model that dynamically shifts translation units from phrase level to word and morpheme phrase levels. The experimental results show the significant improvements over the current phrase-based SMT systems.

Original languageEnglish
Publication statusPublished - 2008 Dec 1
Event8th Biennial Conference of the Association for Machine Translation in the Americas, AMTA 2008 - Waikiki, HI, United States
Duration: 2008 Oct 212008 Oct 25

Other

Other8th Biennial Conference of the Association for Machine Translation in the Americas, AMTA 2008
CountryUnited States
CityWaikiki, HI
Period08/10/2108/10/25

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Software

Fingerprint Dive into the research topics of 'Improving English-to-Chinese translation for technical terms using morphological information'. Together they form a unique fingerprint.

Cite this