Proposal of a sound source separation method using image signal processing of a spatio-temporal sound pressure distribution image

Kenji Ozawa, Masaaki Ito, Genya Shimizu, Masanori Morise, Shuichi Sakamoto

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

This paper proposes a sound source separation method using image signal processing and a microphone array. First, a spatio-temporal sound pressure distribution (STSPD) image is formed based on microphone outputs. Two-dimensional fast Fourier transform (2D FFT) transforms this image into a spectrum, in which sounds from different directions are separated into the components on different lines naturally. To separate sound sources, every line in the spectrum is extracted and 2D inverse FFT is applied. A method to restore a fine STSPD image from the sparse-microphone array is also proposed. Although the basic performance of the proposed method is comparable to a conventional delay and sum array, methods that are more sophisticated can be applied for improved performance.

Original languageEnglish
Title of host publicationAES International Conference on Spatial Reproduction 2018
Subtitle of host publicationAesthetics and Science
PublisherAudio Engineering Society
Pages148-154
Number of pages7
ISBN (Electronic)9781510870406
Publication statusPublished - 2018 Jan 1
EventAES International Conference on Spatial Reproduction 2018: Aesthetics and Science - Tokyo, Japan
Duration: 2018 Aug 72018 Aug 9

Publication series

NameProceedings of the AES International Conference

Other

OtherAES International Conference on Spatial Reproduction 2018: Aesthetics and Science
CountryJapan
CityTokyo
Period18/8/718/8/9

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Acoustics and Ultrasonics

Fingerprint Dive into the research topics of 'Proposal of a sound source separation method using image signal processing of a spatio-temporal sound pressure distribution image'. Together they form a unique fingerprint.

  • Cite this

    Ozawa, K., Ito, M., Shimizu, G., Morise, M., & Sakamoto, S. (2018). Proposal of a sound source separation method using image signal processing of a spatio-temporal sound pressure distribution image. In AES International Conference on Spatial Reproduction 2018: Aesthetics and Science (pp. 148-154). (Proceedings of the AES International Conference). Audio Engineering Society.