Improving Pronunciation Clarity of Dysarthric Speech Using CycleGAN with Multiple Speakers

Shuhei Imai, Takashi Nose, Aoi Kanagaki, Satoshi Watanabe, Akinori Ito

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we propose a method that improves pronunciation clarity of dysarthric speech using CycleGAN based non-parallel voice conversion. This method converts dysarthric speech into healthy speech using CycleGAN. We considered the use of single and multiple speakers as healthy speech. The subjective evaluations showed the effectiveness of using multiple speakers as healthy speech.

Original languageEnglish
Title of host publication2020 IEEE 9th Global Conference on Consumer Electronics, GCCE 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages366-367
Number of pages2
ISBN (Electronic)9781728198026
DOIs
Publication statusPublished - 2020 Oct 13
Event9th IEEE Global Conference on Consumer Electronics, GCCE 2020 - Kobe, Japan
Duration: 2020 Oct 132020 Oct 16

Publication series

Name2020 IEEE 9th Global Conference on Consumer Electronics, GCCE 2020

Conference

Conference9th IEEE Global Conference on Consumer Electronics, GCCE 2020
CountryJapan
CityKobe
Period20/10/1320/10/16

Keywords

  • CycleGAN
  • Dysarthria
  • Pronunciation clarity
  • Voice conversion

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Media Technology
  • Instrumentation
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition

Fingerprint Dive into the research topics of 'Improving Pronunciation Clarity of Dysarthric Speech Using CycleGAN with Multiple Speakers'. Together they form a unique fingerprint.

Cite this