Low latency and high quality two-stage human-voice-enhancement system for a hose-shaped rescue robot

Yoshiaki Bando, Hiroshi Saruwatari, Nobutaka Ono, Shoji Makino, Katustoshi Itoyama, Daichi Kitamura, Masaru Ishimura, Moe Takakusaki, Narumi Mae, Kouei Yamaoka, Yutaro Matsui, Yuichi Ambe, Masashi Konyo, Satoshi Tadokoro, Kazuyoshi Yoshii, Hiroshi G. Okuno

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

This paper presents the design and implementation of a two-stage human-voice enhancement system for a hose-shaped rescue robot. When a microphoneequipped hose-shaped robot is used to search for a victim under a collapsed building, human-voice enhancement is crucial because the sound captured by a microphone array is contaminated by the ego-noise of the robot. For achieving both low latency and high quality, our system combines online and offline human-voice enhancement, providing an overview first and then details on demand. The online enhancement is used for searching for a victim in real time, while the offline one facilitates scrutiny by listening to highly enhanced human voices. Our online enhancement is based on an online robust principal component analysis, and our offline enhancement is based on an independent lowrank matrix analysis. The two enhancement methods are integrated with Robot Operating System (ROS). Experimental results showed that both the online and offline enhancement methods outperformed conventional methods.

Original languageEnglish
Pages (from-to)198-212
Number of pages15
JournalJournal of Robotics and Mechatronics
Volume29
Issue number1
DOIs
Publication statusPublished - 2017 Feb

Keywords

  • Blind human-voice enhancement
  • Hose-shaped rescue robot
  • Robot audition
  • Search and rescue

ASJC Scopus subject areas

  • Computer Science(all)
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Low latency and high quality two-stage human-voice-enhancement system for a hose-shaped rescue robot'. Together they form a unique fingerprint.

  • Cite this

    Bando, Y., Saruwatari, H., Ono, N., Makino, S., Itoyama, K., Kitamura, D., Ishimura, M., Takakusaki, M., Mae, N., Yamaoka, K., Matsui, Y., Ambe, Y., Konyo, M., Tadokoro, S., Yoshii, K., & Okuno, H. G. (2017). Low latency and high quality two-stage human-voice-enhancement system for a hose-shaped rescue robot. Journal of Robotics and Mechatronics, 29(1), 198-212. https://doi.org/10.20965/jrm.2017.p0198