TY - GEN
T1 - Separation of Two Sound Sources in the Same Direction by Image Signal Processing
AU - Ozawa, Kenji
AU - Koshimizu, Yuta
AU - Morise, Masanori
AU - Sakamoto, Shuichi
N1 - Funding Information:
Work supported by JSPS KAKENHI (JP16K06384) and the Cooperative Res. Project (H28/A10) of the Res. Inst. of Electr. Commun., Tohoku Univ.
Publisher Copyright:
© 2018 IEEE.
PY - 2018/12/12
Y1 - 2018/12/12
N2 - This paper proposes a novel method to separate a target sound from another noise in the same direction using a microphone array. Four microphones are arranged to focus the target sound, and their outputs are converted to a spatio-temporal sound pressure distribution (STSPD) image. The focused source forms vertical stripes in the image; thus, their two-dimensional fast Fourier transform (2D-FFT) spectral components are localized as the direct current (DC) components along the spatial axis. Meanwhile, a noise signal forms parabola stripes in the STSPD image and their 2D-FFT components are spread into the target spectrum. However, the leaked DC components can be estimated from the other components, and subtracted from the target spectrum. A computational experiment shows that the noise is suppressed by approximately 25 dB, when the positions of the target and noise sources are known.
AB - This paper proposes a novel method to separate a target sound from another noise in the same direction using a microphone array. Four microphones are arranged to focus the target sound, and their outputs are converted to a spatio-temporal sound pressure distribution (STSPD) image. The focused source forms vertical stripes in the image; thus, their two-dimensional fast Fourier transform (2D-FFT) spectral components are localized as the direct current (DC) components along the spatial axis. Meanwhile, a noise signal forms parabola stripes in the STSPD image and their 2D-FFT components are spread into the target spectrum. However, the leaked DC components can be estimated from the other components, and subtracted from the target spectrum. A computational experiment shows that the noise is suppressed by approximately 25 dB, when the positions of the target and noise sources are known.
KW - Spatio-temporal sound pressure distribution image
KW - Spectral subtraction
KW - Spherical wave
KW - Two-dimensional FFT
UR - http://www.scopus.com/inward/record.url?scp=85060289662&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85060289662&partnerID=8YFLogxK
U2 - 10.1109/GCCE.2018.8574514
DO - 10.1109/GCCE.2018.8574514
M3 - Conference contribution
AN - SCOPUS:85060289662
T3 - 2018 IEEE 7th Global Conference on Consumer Electronics, GCCE 2018
SP - 663
EP - 664
BT - 2018 IEEE 7th Global Conference on Consumer Electronics, GCCE 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th IEEE Global Conference on Consumer Electronics, GCCE 2018
Y2 - 9 October 2018 through 12 October 2018
ER -