Construction and analysis of phonetically and prosodically balanced emotional speech database

Emika Takeishi, Takashi Nose, Yuya Chiba, Akinori Ito

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Citations (Scopus)

Abstract

We designed an emotional speech database that can be used for emotion recognition as well as recognition and synthesis of speech with various emotions. The database was designed by compiling tweets acquired from Twitter and selecting emotion-dependent tweets considering phonetic and prosodic balance. We classified gathered tweets into four emotions: joy, anger, sadness and neutral, and then selected 50 sentences from sentences of each emotion based on the entropy-based algorithm. We compared the selected sentence sets with randomly selected sentence sets from aspects of phonetic and prosodic balance and sentence length, and confirmed that the sets selected by the algorithm were more balanced. Next, we recorded emotional speech based on the selected sentences. Then, we evaluated the speech from the viewpoint of emotional recognition and emotional speech recognition.

Original languageEnglish
Title of host publication2016 Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages16-21
Number of pages6
ISBN (Electronic)9781509035168
DOIs
Publication statusPublished - 2017 May 3
Event19th Annual Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA 2016 - Bali, Indonesia
Duration: 2016 Oct 262016 Oct 28

Publication series

Name2016 Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA 2016

Other

Other19th Annual Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA 2016
Country/TerritoryIndonesia
CityBali
Period16/10/2616/10/28

Keywords

  • emotion recognition
  • emotional speech database
  • emotional speech recognition
  • speech corpus

ASJC Scopus subject areas

  • Information Systems
  • Signal Processing
  • Information Systems and Management
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Construction and analysis of phonetically and prosodically balanced emotional speech database'. Together they form a unique fingerprint.

Cite this