TY - GEN
T1 - Muting machine speech using audio watermarking
AU - Ito, Akinori
N1 - Funding Information:
Part of this work was supported by JSPS Kakenhi JP17H00823.
Publisher Copyright:
© Springer Nature Switzerland AG 2019.
PY - 2019
Y1 - 2019
N2 - Spoken dialog systems have become popular and are used in a home environment, such as smart speakers. A problem will occur when two or more smart speakers are in the same environment, in which a dialog system misdetects the other dialog systems voice as a users voice. In this paper, a method to mute synthesized speech is proposed to prevent a speech recognizer from recognizing speech uttered by a machine. The audio watermark technique is used to indicate that a machine utters the speech, and the speech recognizer attenuates the observed speech if it contains the watermark. The watermark is embedded in high frequency so that humans cannot perceive the watermark and the watermark is robustly extracted. From the experimental result, we found that the proposed method robustly determine the existence of the watermark when the SNR is no less than 0 dB.
AB - Spoken dialog systems have become popular and are used in a home environment, such as smart speakers. A problem will occur when two or more smart speakers are in the same environment, in which a dialog system misdetects the other dialog systems voice as a users voice. In this paper, a method to mute synthesized speech is proposed to prevent a speech recognizer from recognizing speech uttered by a machine. The audio watermark technique is used to indicate that a machine utters the speech, and the speech recognizer attenuates the observed speech if it contains the watermark. The watermark is embedded in high frequency so that humans cannot perceive the watermark and the watermark is robustly extracted. From the experimental result, we found that the proposed method robustly determine the existence of the watermark when the SNR is no less than 0 dB.
KW - Muting
KW - Spoken dialog systems
KW - Watermarking
UR - http://www.scopus.com/inward/record.url?scp=85057105458&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85057105458&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-03748-2_9
DO - 10.1007/978-3-030-03748-2_9
M3 - Conference contribution
AN - SCOPUS:85057105458
SN - 9783030037475
T3 - Smart Innovation, Systems and Technologies
SP - 74
EP - 81
BT - Recent Advances in Intelligent Information Hiding and Multimedia Signal Processing - Proceeding of the Fourteenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing
A2 - Jain, Lakhmi C.
A2 - Jain, Lakhmi C.
A2 - Tsai, Pei-Wei
A2 - Ito, Akinori
A2 - Pan, Jeng-Shyang
A2 - Jain, Lakhmi C.
PB - Springer Science and Business Media Deutschland GmbH
T2 - 14th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2018
Y2 - 26 November 2018 through 28 November 2018
ER -