Mean shift based direction finding and separation of multiple sources using microphone array in the presence of spatial aliasing

Riyo Ishida, Masashi Sekikawa, Yasue Mitsukura, Nozomu Hamada

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper proposes a mean shift based method of blind source separation as well as source direction estimation through time-frequency (T-F) decomposition. The basic cue of T-F cell adopted here is the wave propagation vector and the method is applicable to arbitrary array configuration in 3-D space. In addition, it also applicable to wide extent array, thus the method cope with spatial aliasing problem. In order to solve such aliasing problem causing ambiguity of phase, norm normalization property of propagation direction vector is employed. By applying mean shift for both mode search and clustering of probability density estimation, efficient separation procedure is realized as a binary masking in T-F domain. The proposed approach is implemented in four microphones with wide sensor spacing where spatial aliasing may occur. Efficient separation results even for multiple sources are demonstrated.

Original languageEnglish
Title of host publication2015 10th Asian Control Conference: Emerging Control Techniques for a Sustainable World, ASCC 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Print)9781479978625
DOIs
Publication statusPublished - 2015 Sept 8
Event10th Asian Control Conference, ASCC 2015 - Kota Kinabalu, Malaysia
Duration: 2015 May 312015 Jun 3

Other

Other10th Asian Control Conference, ASCC 2015
Country/TerritoryMalaysia
CityKota Kinabalu
Period15/5/3115/6/3

Keywords

  • DOA estimation
  • kernel density estimation
  • mean shift
  • microphone array
  • source separation
  • spatial aliasing
  • time-frequency masking

ASJC Scopus subject areas

  • Control and Systems Engineering

Fingerprint

Dive into the research topics of 'Mean shift based direction finding and separation of multiple sources using microphone array in the presence of spatial aliasing'. Together they form a unique fingerprint.

Cite this