A precise evaluation method of prosodic quality of non-native speakers using average voice and prosody substitution

Hafiyan Prafianto, Takashi Nose, Akinori Ito

研究成果: Conference contribution

1 被引用数 (Scopus)

抄録

We propose a method to improve the consistency of human evaluation of non-native speaker's utterance, with a capability to evaluate features such as accent and rhythm. In this method, human evaluators evaluate the accent and the rhythm independently by using average voice model and prosody substitution. We also investigated the advantages of evaluating those features independently. We found that, when the prosodic features are not evaluated independently, the accent scores are affected by the goodness of the rhythm and vice versa. The correlation coefficient of the accent score and the rhythm score of identical utterances was 0.23 using the conventional method and -0.026 using the proposed method. This also leads to greater disagreement between the scores given by different evaluators. Using the conventional method, 23% of the pairs between evaluators have their inter-evaluator correlation of the rhythm score more than 0.5, while using this proposed method, 67% of the pairs have the inter-evaluator correlation more than 0.5.

本文言語English
ホスト出版物のタイトルICALIP 2016 - 2016 International Conference on Audio, Language and Image Processing - Proceedings
編集者Fa-Long Luo, Xiaoqing Yu, Wanggen Wan
出版社Institute of Electrical and Electronics Engineers Inc.
ページ208-212
ページ数5
ISBN(電子版)9781509006533
DOI
出版ステータスPublished - 2017 2 7
イベント5th International Conference on Audio, Language and Image Processing, ICALIP 2016 - Shanghai, China
継続期間: 2016 7 112016 7 12

出版物シリーズ

名前ICALIP 2016 - 2016 International Conference on Audio, Language and Image Processing - Proceedings

Other

Other5th International Conference on Audio, Language and Image Processing, ICALIP 2016
国/地域China
CityShanghai
Period16/7/1116/7/12

ASJC Scopus subject areas

  • ソフトウェア
  • コンピュータ ビジョンおよびパターン認識
  • コンピュータ グラフィックスおよびコンピュータ支援設計

フィンガープリント

「A precise evaluation method of prosodic quality of non-native speakers using average voice and prosody substitution」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル