TY - GEN
T1 - Novel scheme of real-time direction finding and tracking of multiple speakers by robot-embedded microphone array
AU - Su, Daobilige
AU - Sekikawa, Masashi
AU - Nakazawa, Kazuo
AU - Hamada, Nozomu
PY - 2013/1/1
Y1 - 2013/1/1
N2 - Recently, interest on artificial robot audition is growing for developing human-robot interaction. The main purposes of an artificial audio system mounted on mobile robot are localizing sound sources, separating speech signal that is relevant to a particular speaker such as robot's master, and processing speech sources to extract useful information such as master's uttering commands. This paper reports a novel proposed method of a speaker's direction tracking algorithm, and a realization of the real tracking system on a mobile robot. Basic approach of this study belongs to a category of direction finding known as sparseness-based one which employs time-frequency decomposition and disjoint property between different speech signals. The novel points in the proposed source tracking exist on a reliable data selection from time-frequency cells and the application of mean shift tracking to the kernel density estimator derived from these reliable time-frequency components. A wheel-based mobile robot is developed and built-in audio processing system. Experiments are conducted and demonstrate the ability to localize in real environments.
AB - Recently, interest on artificial robot audition is growing for developing human-robot interaction. The main purposes of an artificial audio system mounted on mobile robot are localizing sound sources, separating speech signal that is relevant to a particular speaker such as robot's master, and processing speech sources to extract useful information such as master's uttering commands. This paper reports a novel proposed method of a speaker's direction tracking algorithm, and a realization of the real tracking system on a mobile robot. Basic approach of this study belongs to a category of direction finding known as sparseness-based one which employs time-frequency decomposition and disjoint property between different speech signals. The novel points in the proposed source tracking exist on a reliable data selection from time-frequency cells and the application of mean shift tracking to the kernel density estimator derived from these reliable time-frequency components. A wheel-based mobile robot is developed and built-in audio processing system. Experiments are conducted and demonstrate the ability to localize in real environments.
KW - kernel density estimator
KW - mean shift tracking
KW - microphone array
KW - robot audition
KW - sound source localization
UR - http://www.scopus.com/inward/record.url?scp=84876209228&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84876209228&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-37374-9_44
DO - 10.1007/978-3-642-37374-9_44
M3 - Conference contribution
AN - SCOPUS:84876209228
SN - 9783642373732
T3 - Advances in Intelligent Systems and Computing
SP - 453
EP - 462
BT - An Edition of the Presented Papers from the 1st International Conference on Robot Intelligence Technology and Applications
PB - Springer Verlag
T2 - 1st International Conference on Robot Intelligence Technology and Applications, RiTA 2012
Y2 - 16 December 2012 through 18 December 2012
ER -