TY - GEN
T1 - Building a bilingual lexicon using phrase-based statistical machine translation via a pivot language
AU - Tsunakawa, Takashi
AU - Okazaki, Naoaki
AU - Tsujii, Jun'ichi
PY - 2008/12/1
Y1 - 2008/12/1
N2 - This paper proposes a novel method for building a bilingual lexicon through a pivot language by using phrase-based statistical machine translation (SMT). Given two bilingual lexicons between language pairs Lf-Lp and Lp-Le, we assume these lexicons as parallel corpora. Then, we merge the extracted two phrase tables into one phrase table between Lf and Le. Finally, we construct a phrase-based SMT system for translating the terms in the lexicon Lf-Lp into terms of Le and, obtain a new lexicon Lf-Le. In our experiments with Chinese-English and Japanese-English lexicons, our system could cover 72.8% of Chinese terms and drastically improve the utilization ratio.
AB - This paper proposes a novel method for building a bilingual lexicon through a pivot language by using phrase-based statistical machine translation (SMT). Given two bilingual lexicons between language pairs Lf-Lp and Lp-Le, we assume these lexicons as parallel corpora. Then, we merge the extracted two phrase tables into one phrase table between Lf and Le. Finally, we construct a phrase-based SMT system for translating the terms in the lexicon Lf-Lp into terms of Le and, obtain a new lexicon Lf-Le. In our experiments with Chinese-English and Japanese-English lexicons, our system could cover 72.8% of Chinese terms and drastically improve the utilization ratio.
UR - http://www.scopus.com/inward/record.url?scp=80053414847&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80053414847&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:80053414847
SN - 9781905593446
T3 - Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference
SP - 127
EP - 130
BT - Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference
T2 - 22nd International Conference on Computational Linguistics, Coling 2008
Y2 - 18 August 2008 through 22 August 2008
ER -