A technique for controlling voice quality of synthetic speech using multiple regression HSMM

Makoto Tachibana, Takashi Nose, Junichi Yamagishi, Takao Kobayashi

研究成果: Conference contribution

8 被引用数 (Scopus)

抄録

This paper describes a technique for controlling voice quality of synthetic speech using multiple regression hidden semi-Markov model (HSMM). In the technique, we assume that the mean vectors of output and state duration distribution of HSMM are modeled by multiple regression with a parameter vector called voice quality control vector. We first choose three features for controlling voice qualities, that is, "smooth voice - nonsmooth voice," "warm - cold," "high-pitched - low-pitched," and then we attempt to control voice quality of synthetic speech for these features. From the results of several subjective tests, we show that the proposed technique can change these features of voice quality intuitively.

本文言語English
ホスト出版物のタイトルINTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP
出版社International Speech Communication Association
ページ2438-2441
ページ数4
5
ISBN(印刷版)9781604234497
出版ステータスPublished - 2006 1 1
外部発表はい
イベントINTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP - Pittsburgh, PA, United States
継続期間: 2006 9 172006 9 21

Other

OtherINTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP
国/地域United States
CityPittsburgh, PA
Period06/9/1706/9/21

ASJC Scopus subject areas

  • コンピュータ サイエンス(全般)

フィンガープリント

「A technique for controlling voice quality of synthetic speech using multiple regression HSMM」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル