Robust speaker direction estimation with particle filtering
E. Warsitz, R. Haeb-Umbach, in: IEEE Workshop on Multimedia Signal Processing (MMSP 2004), 2004, pp. 367–370.
Download (ext.)
Conference Paper
| English
Author
Warsitz, Ernst;
Haeb-Umbach, ReinholdLibreCat
Abstract
The paper is concerned with binaural signal processing for a bimodal human-robot interface with hearing and vision. The two microphone signals are processed to obtain an enhanced single-channel input signal for the subsequent speech recognizer and to localize the acoustic source, an important information for establishing a natural human-robot communication. We utilize a robust adaptive algorithm for filter-and-sum beamforming (FSB) and extract speaker direction information from the resulting FIR filter coefficients. Further, particle filtering is applied which conducts a nonlinear Bayesian tracking of speaker movement. Good location accuracy can be achieved even in highly reverberant environments. The results obtained outperform the conventional generalized cross correlation (GCC) method.
Keywords
bimodal human-robot interface;
binaural signal processing;
enhanced single-channel input signal;
filter-and-sum beamforming;
filtering theory;
FIR filter coefficient;
generalized cross correlation method;
microphones;
microphone signal;
nonlinear Bayesian tracking;
particle filtering;
robust adaptive algorithm;
robust speaker direction estimation;
signal processing;
speech enhancement;
speech recognition;
speech recognizer;
user interfaces
Publishing Year
Proceedings Title
IEEE Workshop on Multimedia Signal Processing (MMSP 2004)
Page
367-370
LibreCat-ID
Cite this
Warsitz E, Haeb-Umbach R. Robust speaker direction estimation with particle filtering. In: IEEE Workshop on Multimedia Signal Processing (MMSP 2004). ; 2004:367-370. doi:10.1109/MMSP.2004.1436569
Warsitz, E., & Haeb-Umbach, R. (2004). Robust speaker direction estimation with particle filtering. In IEEE Workshop on Multimedia Signal Processing (MMSP 2004) (pp. 367–370). https://doi.org/10.1109/MMSP.2004.1436569
@inproceedings{Warsitz_Haeb-Umbach_2004, title={Robust speaker direction estimation with particle filtering}, DOI={10.1109/MMSP.2004.1436569}, booktitle={IEEE Workshop on Multimedia Signal Processing (MMSP 2004)}, author={Warsitz, Ernst and Haeb-Umbach, Reinhold}, year={2004}, pages={367–370} }
Warsitz, Ernst, and Reinhold Haeb-Umbach. “Robust Speaker Direction Estimation with Particle Filtering.” In IEEE Workshop on Multimedia Signal Processing (MMSP 2004), 367–70, 2004. https://doi.org/10.1109/MMSP.2004.1436569.
E. Warsitz and R. Haeb-Umbach, “Robust speaker direction estimation with particle filtering,” in IEEE Workshop on Multimedia Signal Processing (MMSP 2004), 2004, pp. 367–370.
Warsitz, Ernst, and Reinhold Haeb-Umbach. “Robust Speaker Direction Estimation with Particle Filtering.” IEEE Workshop on Multimedia Signal Processing (MMSP 2004), 2004, pp. 367–70, doi:10.1109/MMSP.2004.1436569.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Link(s) to Main File(s)
Access Level
Closed Access