HMM-based speech synthesis using sub-band basis spectrum model

Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Takehiko Kagoshima, Masami Akamine

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

In this paper, we propose HMM-based text-to-speech (TTS) using sub-band basis spectrum model (SBM). SBM can represent vocal tract spectra and phase char-acteristics by a linear combination of sub-band basis vectors. Some reports suggest that analysis-synthesized speech based on SBM is close to natural speech and SBM can perform effectively in TTS. Therefore, the SBM framework is expected to have good effects on HMM-based TTS by improving speech quality. Subjective ex-perimental results show that the proposed method im-proves speech quality in some conditions.

Original languageEnglish
Title of host publication13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Pages1438-1441
Number of pages4
Publication statusPublished - 2012
Event13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 - Portland, OR, United States
Duration: 2012 Sep 92012 Sep 13

Publication series

Name13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Volume2

Conference

Conference13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
CountryUnited States
CityPortland, OR
Period12/9/912/9/13

Keywords

  • Hidden Markov model
  • Phase feature
  • Speech synthesis
  • Sub-band basis spectrum model

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Communication

Fingerprint Dive into the research topics of 'HMM-based speech synthesis using sub-band basis spectrum model'. Together they form a unique fingerprint.

Cite this