Vector quantization of speech signals using principal component analysis

Minoru Kohata, Hideaki Sone, Hiroshi Echigo, Tasuku Takagis

Research output: Contribution to journalArticle

Abstract

A new method of designing a vector quantizer is presented for bandwidth compression of speech signals, and some experimental results are shown. In this method, spectral parameters are extracted first from the DFT spectrum of input speech signals using a psychological frequency scale‐the so‐called Mel scale. This parameter is called the Mel‐scaled spectrum. The number of Mel‐scaled spectrum is reduced and the cepstrum of this parameter is calculated. Then this Mel‐scaled cepstrum is vector quantized and the codebook‐vector of the vector quantizer is determined by the algorithm using principal component analysis. With this algorithm, codebook‐vectors can be designed considering the statistical characteristics of the Mel‐scaled cepstrum. Also, the reduction of parameters by Mel‐scale can decrease the size of the codebook memory without greatly degrading the synthesized speech quality. Using the forementioned method two codebooks are designed: one contains 256 vectors, and the other contains 2048 vectors. The quantization error is compared with those designed by the well‐known LBG algorithm. The simulation results show that the codebooks designed by the proposed method present less quantization error and degradation of synthesized speech quality than those designed by LBG algorithm.

Original languageEnglish
Pages (from-to)16-26
Number of pages11
JournalElectronics and Communications in Japan (Part I: Communications)
Volume70
Issue number5
DOIs
Publication statusPublished - 1987

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Vector quantization of speech signals using principal component analysis'. Together they form a unique fingerprint.

  • Cite this