Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis

Ranniery Maia, Mark J.F. Gales, Yannis Stylianou, Masami Akamine

Research output: Contribution to journalConference articlepeer-review

4 Citations (Scopus)

Abstract

This paper presents an approach for complex cepstrum analysis based on the minimum mean squared error criterion, and describes its application to statistical parametric speech synthesis. The proposed method alleviates some of the issues associated with conventional complex cepstrum analysis, such as choice of the window, phase unwrapping, and the need for accurate pitch marks. Given initial estimates of warped complex cepstra and respective analysis instants, the method iteratively optimizes the complex cepstrum on a warped quefrency domain by minimizing the mean squared error between the natural and the reconstructed speech waveforms. When applied to statistical parametric speech synthesis, the optimized complex cepstrum results in better performance in terms of synthesized speech quality, specially for emotional databases, when compared with the complex cepstrum calculated through conventional methods.

Original languageEnglish
Pages (from-to)2336-2340
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publication statusPublished - 2013 Jan 1
Externally publishedYes
Event14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 - Lyon, France
Duration: 2013 Aug 252013 Aug 29

Keywords

  • Cepstral analysis
  • Complex cepstrum
  • Speech synthesis
  • Statistical parametric speech synthesis

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Modelling and Simulation

Fingerprint Dive into the research topics of 'Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis'. Together they form a unique fingerprint.

Cite this