TY - GEN
T1 - Proposal of a sound source separation method using image signal processing of a spatio-temporal sound pressure distribution image
AU - Ozawa, Kenji
AU - Ito, Masaaki
AU - Shimizu, Genya
AU - Morise, Masanori
AU - Sakamoto, Shuichi
N1 - Funding Information:
This work was supported by JSPS KAKENHI Grant (JP16K06384) and the Cooperative Research Project (H28/A10) of the Research Institute of Electrical Communication, Tohoku University.
Publisher Copyright:
© (2018) by the Audio Engineering Society All rights reserved.
PY - 2018
Y1 - 2018
N2 - This paper proposes a sound source separation method using image signal processing and a microphone array. First, a spatio-temporal sound pressure distribution (STSPD) image is formed based on microphone outputs. Two-dimensional fast Fourier transform (2D FFT) transforms this image into a spectrum, in which sounds from different directions are separated into the components on different lines naturally. To separate sound sources, every line in the spectrum is extracted and 2D inverse FFT is applied. A method to restore a fine STSPD image from the sparse-microphone array is also proposed. Although the basic performance of the proposed method is comparable to a conventional delay and sum array, methods that are more sophisticated can be applied for improved performance.
AB - This paper proposes a sound source separation method using image signal processing and a microphone array. First, a spatio-temporal sound pressure distribution (STSPD) image is formed based on microphone outputs. Two-dimensional fast Fourier transform (2D FFT) transforms this image into a spectrum, in which sounds from different directions are separated into the components on different lines naturally. To separate sound sources, every line in the spectrum is extracted and 2D inverse FFT is applied. A method to restore a fine STSPD image from the sparse-microphone array is also proposed. Although the basic performance of the proposed method is comparable to a conventional delay and sum array, methods that are more sophisticated can be applied for improved performance.
UR - http://www.scopus.com/inward/record.url?scp=85060798646&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85060798646&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85060798646
T3 - Proceedings of the AES International Conference
SP - 148
EP - 154
BT - AES International Conference on Spatial Reproduction 2018
PB - Audio Engineering Society
T2 - AES International Conference on Spatial Reproduction 2018: Aesthetics and Science
Y2 - 7 August 2018 through 9 August 2018
ER -