Evaluation of prosodic contextual factors for HMM-based speech synthesis

Shuji Yokomizo, Takashi Nose, Takao Kobayashi

研究成果: Conference contribution

10 被引用数 (Scopus)

抄録

This paper explores the effect of prosodic contextual factors for speech synthesis based on hidden Markov model (HMM). In the HMM-based speech synthesis, to model not only the phonetic features but also the prosodic ones, a variety of contextual factors are taken into account in the model training. In a baseline system, a lot of contextual factors are used, and the resultant cost for parameter tying by context clustering becomes relatively high compared to that in the speech recognition. We examine the choice of prosodic contexts by objective measures for English and Japanese speech data which have difference linguistic and prosodic characteristics. Experimental results show that more compact context sets give also comparable or close performance to the conventional full context.

本文言語English
ホスト出版物のタイトルProceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
出版社International Speech Communication Association
ページ430-433
ページ数4
出版ステータスPublished - 2010
外部発表はい

出版物シリーズ

名前Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010

ASJC Scopus subject areas

  • 言語および言語学
  • 言語聴覚療法

フィンガープリント

「Evaluation of prosodic contextual factors for HMM-based speech synthesis」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル