A Chinese-Japanese lexical machine translation through a Pivot language

Takashi Tsunakawa, Naoaki Okazaki, Xiao Liu, Junichi Tsujii

Research output: Contribution to journalArticlepeer-review

5 Citations (Scopus)

Abstract

The bilingual lexicon is an expensive but critical resource for multilingual applications in natural language processing. This article proposes an integrated framework for building a bilingual lexicon between the Chinese and Japanese languages. Since the language pair Chinese-Japanese does not include English, which is a central language of the world, few large-scale bilingual resources between Chinese and Japanese have been constructed. One solution to alleviate this problem is to build a Chinese-Japanese bilingual lexicon through English as the pivot language. In addition to the pivotal approach, we can make use of the characteristics of Chinese and Japanese languages that use Han characters. We incorporate a translation model obtained from a small Chinese-Japanese lexicon and use the similarity of the hanzi and kanji characters by using the log-linear model. Our experimental results show that the use of the pivotal approach can improve the translation performance over the translation model built from a small Chinese-Japanese lexicon. The results also demonstrate that the similarity between the hanzi and kanji characters provides a positive effect for translating technical terms.

Original languageEnglish
Article number9
JournalACM Transactions on Asian Language Information Processing
Volume8
Issue number2
DOIs
Publication statusPublished - 2009 May 1

Keywords

  • Bilingual lexicon
  • Han characters
  • Hanzi
  • Kanji
  • Pivot language
  • Statistical machine translation

ASJC Scopus subject areas

  • Computer Science(all)

Cite this