A study on minimum spectral error analysis of speech

Takuma Hayasaka, Takashi Nose, Akinori Ito

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Conventional source-filter vocoders, such as WORLD, can quickly synthesize speech. However, the quality of synthetic speech is degraded due to speech parameters extraction errors. Therefore, this paper proposes minimum spectral error analysis, a speech analysis method that extracts speech parameters using Analysis-by-Synthesis (A-b-S), to improve the quality of speech synthesized by WORLD. We update speech parameters to minimize the error between the amplitude spectra of natural and synthetic speech. We developed the calculation process of the amplitude spectrum of synthetic speech from speech parameters to perform this analysis. A preliminary experiment shows that we have successfully constructed the calculation process.

Original languageEnglish
Title of host publication2020 IEEE 9th Global Conference on Consumer Electronics, GCCE 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages362-363
Number of pages2
ISBN (Electronic)9781728198026
DOIs
Publication statusPublished - 2020 Oct 13
Event9th IEEE Global Conference on Consumer Electronics, GCCE 2020 - Kobe, Japan
Duration: 2020 Oct 132020 Oct 16

Publication series

Name2020 IEEE 9th Global Conference on Consumer Electronics, GCCE 2020

Conference

Conference9th IEEE Global Conference on Consumer Electronics, GCCE 2020
CountryJapan
CityKobe
Period20/10/1320/10/16

Keywords

  • Analysis-by-Synthesis
  • source-filter model
  • speech analysis
  • vocoder

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Media Technology
  • Instrumentation
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition

Fingerprint Dive into the research topics of 'A study on minimum spectral error analysis of speech'. Together they form a unique fingerprint.

Cite this