Influence of large system latency of virtual auditory display on behavior of head movement in sound localization task

Satoshi Yairi, Yukio Iwaya, Yoiti Suzuki

Research output: Contribution to journalArticlepeer-review

11 Citations (Scopus)

Abstract

Virtual Auditory Display (VAD) is expected to be applied as a new communication tool in the near future. However, in computer network communication, a large latency may sometimes occur. Therefore, a listener's head movement and localization accuracy during sound localization tasks in which system latency (SL) of VAD up to 2 s were investigated. Listeners were asked to localize a virtual sound source and to face the direction of the perceived sound image. A virtual sound source with one of seven kinds of different system latencies (12, 50, 100, 200, 500, 1000 and 2000 ms) was presented to a listener. A software VAD system on a Linux personal computer developed by the authors (SL: 12 ms) was used in the experiments. While the detection threshold of SL is about 75 ms, no significant influence of the accuracy of sound localization was observed with the tested SLs. This result agreed with previous studies. On the other hand, the time needed for the sound localization increased as the SL increased. Moreover, a remarkable overshoot was observed in the listener's head movement, particularly when the system latency was greater than 500 ms. It was found that the overshoot might make the time needed for the localization increase in proportion to twice the duration of SL. Consequently, it is important to keep the SL of VAD smaller than 500 ms, even if there is a large latency in network communications.

Original languageEnglish
Pages (from-to)1016-1023
Number of pages8
JournalActa Acustica united with Acustica
Volume94
Issue number6
DOIs
Publication statusPublished - 2008 Nov 1

ASJC Scopus subject areas

  • Music
  • Acoustics and Ultrasonics

Fingerprint

Dive into the research topics of 'Influence of large system latency of virtual auditory display on behavior of head movement in sound localization task'. Together they form a unique fingerprint.

Cite this