Emotional speech recognition based on style estimation and adaptation with multiple-regression HMM

Yusuke Ijima, Makoto Tachibana, Takashi Nose, Takao Kobayashi

研究成果: Conference contribution

10 被引用数 (Scopus)

抄録

This paper proposes a technique for emotional speech recognition which enables us to extract paralinguistic information as well as linguistic information contained in speech signal. The technique is based on style estimation and style adaptation using multiple-regression HMM. Recognition process consists of two stages. In the first stage, a style vector that represents the emotional expression category and intensity of its variation of input speech is estimated on a sentence-by-sentence basis. Then the acoustic models are adapted using the estimated style vector and standard HMM-based speech recognition is performed in the second stage. We assess the performance of the proposed technique on the recognition of acted emotional speech uttered by both professional narrators and non-professional speakers and show the effectiveness of the technique.

本文言語English
ホスト出版物のタイトル2009 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings, ICASSP 2009
ページ4157-4160
ページ数4
DOI
出版ステータスPublished - 2009
外部発表はい
イベント2009 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009 - Taipei, Taiwan, Province of China
継続期間: 2009 4 192009 4 24

出版物シリーズ

名前ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN(印刷版)1520-6149

Other

Other2009 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009
国/地域Taiwan, Province of China
CityTaipei
Period09/4/1909/4/24

ASJC Scopus subject areas

  • ソフトウェア
  • 信号処理
  • 電子工学および電気工学

フィンガープリント

「Emotional speech recognition based on style estimation and adaptation with multiple-regression HMM」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル