Abstract
This study investigated effects, on a speech intelligibility, of asynchronicity between a speech signal and a talker's moving image induced by time-expansion of the speech signal. First, a word intelligibility test (Exp. 1) was administered to younger listeners. Words were processed using STRAIGHT software to expand the speech signal by 0 to 400 ms. The word intelligibility test was administered under three conditions: visual-only, auditory-only, and auditory-visual (AV) conditions. Results showed that intelligibility scores under the AV condition were statistically higher than those under the auditory-only condition, even when the speech signal was expanded by 400 ms. Second, a sentence intelligibility test (Exp. 2) was administered to older adults. For all sentences, each phrase was expanded by 0 to 400 ms. This test was administered under the same conditions as those used for Exp. 1. Results showed that sentence intelligibility scores under the AV condition were statistically higher than those under the audio-only condition when the length of expansion was less than or equal to 200 ms. The results of Exp. 1 and Exp. 2 suggest that the talker's moving image is effective to enhance speech intelligibility if the lag between the speech signal and the talker's moving image is less than or equal to 200 ms.
Original language | English |
---|---|
Publication status | Published - 2007 |
Event | 2007 International Conference on Auditory-Visual Speech Processing, AVSP 2007 - Hilvarenbeek, Netherlands Duration: 2007 Aug 31 → 2007 Sept 3 |
Conference
Conference | 2007 International Conference on Auditory-Visual Speech Processing, AVSP 2007 |
---|---|
Country/Territory | Netherlands |
City | Hilvarenbeek |
Period | 07/8/31 → 07/9/3 |
Keywords
- audio-visual integration
- speech rate conversion
- word or sentence intelligibility
ASJC Scopus subject areas
- Language and Linguistics
- Speech and Hearing
- Otorhinolaryngology