Combinations of various language model technologies including data expansion and adaptation in spontaneous speech recognition

Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito

Research output: Contribution to journalConference articlepeer-review

6 Citations (Scopus)

Abstract

This paper demonstrates combinations of various language model (LM) technologies simultaneously, not only modeling techniques but also those for training data expansion based on external language resources and unsupervised adaptation for spontaneous speech recognition. Although forming combinations of various LM technologies has been examined, previous works focused on only modeling techniques. In fact, the previous works did not consider other important functionalities in practical spontaneous language modeling; a use of external language resources and an unsupervised LM adaptation. There- fore, our examination employs not only manual transcriptions of target domain speech but also out-of-domain text resources for spontaneous language modeling. In addition, the unsupervised LM adaptation based on multi-pass decoding is aggres- sively introduced to the combination. Our experimental results show a significant word error rate reduction by combining various technologies compared to using each technology individu- ally in Japanese spontaneous speech recognition task. Further-more, we also reveal relationships between the technologies.

Original languageEnglish
Pages (from-to)463-467
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2015-January
Publication statusPublished - 2015 Jan 1
Event16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 - Dresden, Germany
Duration: 2015 Sep 62015 Sep 10

Keywords

  • Combination of various language model technologies
  • External language resources
  • Spontaneous language modeling
  • Unsupervised adaptation

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Modelling and Simulation

Fingerprint Dive into the research topics of 'Combinations of various language model technologies including data expansion and adaptation in spontaneous speech recognition'. Together they form a unique fingerprint.

Cite this