A maximum likelihood approach to the detection of moments of maximum excitation and its application to high-quality speech parameterization

Ranniery Maia, Yannis Stylianou, Masami Akamine

研究成果: Conference article査読

1 被引用数 (Scopus)

抄録

This paper presents an algorithm to detect moments of maximum excitation (MME) in speech. It assumes a model in which speech can be represented as a sequence of pulses located at the MME convolved with a time-varying minimum-phase impulse response. By considering that in the glottal cycle speech concentrates more energy at the MME than at other instants, the locations and amplitudes of the excitation pulses are determined through maximum likelihood estimation. The suggested approach provides a fully automatic and consistent method for the detection of MME in speech without relying on ad hoc procedures which usually do not work well across different speech styles without a required amount of adjustments. Experiments with speech parameterization, in the context of complex cepstrum analysis and synthesis, have shown that the proposed MME-based processing can improve signal to error reconstruction ratio up to 10%, when compared to the use of glottal closure instant estimations provided by a well-known algorithm.

本文言語English
ページ(範囲)603-607
ページ数5
ジャーナルProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
2015-January
出版ステータスPublished - 2015 1月 1
外部発表はい
イベント16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 - Dresden, Germany
継続期間: 2015 9月 62015 9月 10

ASJC Scopus subject areas

  • 言語および言語学
  • 人間とコンピュータの相互作用
  • 信号処理
  • ソフトウェア
  • モデリングとシミュレーション

フィンガープリント

「A maximum likelihood approach to the detection of moments of maximum excitation and its application to high-quality speech parameterization」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル