Detailed description of triphone model using SSS-free algorithm

Motoyuki Suzuki, Daisuke Honma, Akinori Ito, Shozo Makino

Research output: Contribution to journalConference articlepeer-review

Abstract

The triphone model is frequently used as an acoustic model. It is effective for modeling phonetic variations caused by coarticulation. However, it is known that acoustic features of phonemes are also affected by other factors such as speaking style and speaking speed. In this paper, a new acoustic model is proposed. All training data which have the same phoneme context are automatically clustered into several clusters based on acoustic similarity, and a "sub-triphones" is trained using training data corresponding to a cluster. In experiments, the sub-triphone model achieved about 5% higher phoneme accuracy than the triphone model.

Original languageEnglish
Pages (from-to)1399-1402
Number of pages4
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publication statusPublished - 2009 Nov 26
Event10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009 - Brighton, United Kingdom
Duration: 2009 Sep 62009 Sep 10

Keywords

  • HMnet
  • SSS-free
  • Sub-triphone model
  • Triphone

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Sensory Systems

Fingerprint Dive into the research topics of 'Detailed description of triphone model using SSS-free algorithm'. Together they form a unique fingerprint.

Cite this