Novel scheme of real-time direction finding and tracking of multiple speakers by robot-embedded microphone array

Daobilige Su, Masashi Sekikawa, Kazuo Nakazawa, Nozomu Hamada

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Recently, interest on artificial robot audition is growing for developing human-robot interaction. The main purposes of an artificial audio system mounted on mobile robot are localizing sound sources, separating speech signal that is relevant to a particular speaker such as robot's master, and processing speech sources to extract useful information such as master's uttering commands. This paper reports a novel proposed method of a speaker's direction tracking algorithm, and a realization of the real tracking system on a mobile robot. Basic approach of this study belongs to a category of direction finding known as sparseness-based one which employs time-frequency decomposition and disjoint property between different speech signals. The novel points in the proposed source tracking exist on a reliable data selection from time-frequency cells and the application of mean shift tracking to the kernel density estimator derived from these reliable time-frequency components. A wheel-based mobile robot is developed and built-in audio processing system. Experiments are conducted and demonstrate the ability to localize in real environments.

Original languageEnglish
Title of host publicationAdvances in Intelligent Systems and Computing
Pages453-462
Number of pages10
Volume208 AISC
DOIs
Publication statusPublished - 2013
Event1st International Conference on Robot Intelligence Technology and Applications, RiTA 2012 - Gwangju, Korea, Republic of
Duration: 2012 Dec 162012 Dec 18

Publication series

NameAdvances in Intelligent Systems and Computing
Volume208 AISC
ISSN (Print)21945357

Other

Other1st International Conference on Robot Intelligence Technology and Applications, RiTA 2012
CountryKorea, Republic of
CityGwangju
Period12/12/1612/12/18

Fingerprint

Microphones
Mobile robots
Robots
Audio systems
Speech processing
Human robot interaction
Audition
Wheels
Acoustic waves
Decomposition
Processing
Experiments

Keywords

  • kernel density estimator
  • mean shift tracking
  • microphone array
  • robot audition
  • sound source localization

ASJC Scopus subject areas

  • Computer Science(all)
  • Control and Systems Engineering

Cite this

Su, D., Sekikawa, M., Nakazawa, K., & Hamada, N. (2013). Novel scheme of real-time direction finding and tracking of multiple speakers by robot-embedded microphone array. In Advances in Intelligent Systems and Computing (Vol. 208 AISC, pp. 453-462). (Advances in Intelligent Systems and Computing; Vol. 208 AISC). https://doi.org/10.1007/978-3-642-37374-9_44

Novel scheme of real-time direction finding and tracking of multiple speakers by robot-embedded microphone array. / Su, Daobilige; Sekikawa, Masashi; Nakazawa, Kazuo; Hamada, Nozomu.

Advances in Intelligent Systems and Computing. Vol. 208 AISC 2013. p. 453-462 (Advances in Intelligent Systems and Computing; Vol. 208 AISC).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Su, D, Sekikawa, M, Nakazawa, K & Hamada, N 2013, Novel scheme of real-time direction finding and tracking of multiple speakers by robot-embedded microphone array. in Advances in Intelligent Systems and Computing. vol. 208 AISC, Advances in Intelligent Systems and Computing, vol. 208 AISC, pp. 453-462, 1st International Conference on Robot Intelligence Technology and Applications, RiTA 2012, Gwangju, Korea, Republic of, 12/12/16. https://doi.org/10.1007/978-3-642-37374-9_44
Su D, Sekikawa M, Nakazawa K, Hamada N. Novel scheme of real-time direction finding and tracking of multiple speakers by robot-embedded microphone array. In Advances in Intelligent Systems and Computing. Vol. 208 AISC. 2013. p. 453-462. (Advances in Intelligent Systems and Computing). https://doi.org/10.1007/978-3-642-37374-9_44
Su, Daobilige ; Sekikawa, Masashi ; Nakazawa, Kazuo ; Hamada, Nozomu. / Novel scheme of real-time direction finding and tracking of multiple speakers by robot-embedded microphone array. Advances in Intelligent Systems and Computing. Vol. 208 AISC 2013. pp. 453-462 (Advances in Intelligent Systems and Computing).
@inproceedings{0f8f9c6bf45b4ee89f799fa31c65e9a3,
title = "Novel scheme of real-time direction finding and tracking of multiple speakers by robot-embedded microphone array",
abstract = "Recently, interest on artificial robot audition is growing for developing human-robot interaction. The main purposes of an artificial audio system mounted on mobile robot are localizing sound sources, separating speech signal that is relevant to a particular speaker such as robot's master, and processing speech sources to extract useful information such as master's uttering commands. This paper reports a novel proposed method of a speaker's direction tracking algorithm, and a realization of the real tracking system on a mobile robot. Basic approach of this study belongs to a category of direction finding known as sparseness-based one which employs time-frequency decomposition and disjoint property between different speech signals. The novel points in the proposed source tracking exist on a reliable data selection from time-frequency cells and the application of mean shift tracking to the kernel density estimator derived from these reliable time-frequency components. A wheel-based mobile robot is developed and built-in audio processing system. Experiments are conducted and demonstrate the ability to localize in real environments.",
keywords = "kernel density estimator, mean shift tracking, microphone array, robot audition, sound source localization",
author = "Daobilige Su and Masashi Sekikawa and Kazuo Nakazawa and Nozomu Hamada",
year = "2013",
doi = "10.1007/978-3-642-37374-9_44",
language = "English",
isbn = "9783642373732",
volume = "208 AISC",
series = "Advances in Intelligent Systems and Computing",
pages = "453--462",
booktitle = "Advances in Intelligent Systems and Computing",

}

TY - GEN

T1 - Novel scheme of real-time direction finding and tracking of multiple speakers by robot-embedded microphone array

AU - Su, Daobilige

AU - Sekikawa, Masashi

AU - Nakazawa, Kazuo

AU - Hamada, Nozomu

PY - 2013

Y1 - 2013

N2 - Recently, interest on artificial robot audition is growing for developing human-robot interaction. The main purposes of an artificial audio system mounted on mobile robot are localizing sound sources, separating speech signal that is relevant to a particular speaker such as robot's master, and processing speech sources to extract useful information such as master's uttering commands. This paper reports a novel proposed method of a speaker's direction tracking algorithm, and a realization of the real tracking system on a mobile robot. Basic approach of this study belongs to a category of direction finding known as sparseness-based one which employs time-frequency decomposition and disjoint property between different speech signals. The novel points in the proposed source tracking exist on a reliable data selection from time-frequency cells and the application of mean shift tracking to the kernel density estimator derived from these reliable time-frequency components. A wheel-based mobile robot is developed and built-in audio processing system. Experiments are conducted and demonstrate the ability to localize in real environments.

AB - Recently, interest on artificial robot audition is growing for developing human-robot interaction. The main purposes of an artificial audio system mounted on mobile robot are localizing sound sources, separating speech signal that is relevant to a particular speaker such as robot's master, and processing speech sources to extract useful information such as master's uttering commands. This paper reports a novel proposed method of a speaker's direction tracking algorithm, and a realization of the real tracking system on a mobile robot. Basic approach of this study belongs to a category of direction finding known as sparseness-based one which employs time-frequency decomposition and disjoint property between different speech signals. The novel points in the proposed source tracking exist on a reliable data selection from time-frequency cells and the application of mean shift tracking to the kernel density estimator derived from these reliable time-frequency components. A wheel-based mobile robot is developed and built-in audio processing system. Experiments are conducted and demonstrate the ability to localize in real environments.

KW - kernel density estimator

KW - mean shift tracking

KW - microphone array

KW - robot audition

KW - sound source localization

UR - http://www.scopus.com/inward/record.url?scp=84876209228&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84876209228&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-37374-9_44

DO - 10.1007/978-3-642-37374-9_44

M3 - Conference contribution

SN - 9783642373732

VL - 208 AISC

T3 - Advances in Intelligent Systems and Computing

SP - 453

EP - 462

BT - Advances in Intelligent Systems and Computing

ER -