Blind speech separation employing directional statistics in an Expectation Maximization framework

D.H. Tran Vu, R. Haeb-Umbach, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010), 2010, pp. 241–244.

Conference Paper | English
Author
Tran Vu, Dang Hai; Haeb-Umbach, ReinholdLibreCat
Abstract
In this paper we propose to employ directional statistics in a complex vector space to approach the problem of blind speech separation in the presence of spatially correlated noise. We interpret the values of the short time Fourier transform of the microphone signals to be draws from a mixture of complex Watson distributions, a probabilistic model which naturally accounts for spatial aliasing. The parameters of the density are related to the a priori source probabilities, the power of the sources and the transfer function ratios from sources to sensors. Estimation formulas are derived for these parameters by employing the Expectation Maximization (EM) algorithm. The E-step corresponds to the estimation of the source presence probabilities for each time-frequency bin, while the M-step leads to a maximum signal-to-noise ratio (MaxSNR) beamformer in the presence of uncertainty about the source activity. Experimental results are reported for an implementation in a generalized sidelobe canceller (GSC) like spatial beamforming configuration for 3 speech sources with significant coherent noise in reverberant environments, demonstrating the usefulness of the novel modeling framework.
Publishing Year
Proceedings Title
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010)
Page
241-244
LibreCat-ID

Cite this

Tran Vu DH, Haeb-Umbach R. Blind speech separation employing directional statistics in an Expectation Maximization framework. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010). ; 2010:241-244. doi:10.1109/ICASSP.2010.5495994
Tran Vu, D. H., & Haeb-Umbach, R. (2010). Blind speech separation employing directional statistics in an Expectation Maximization framework. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010) (pp. 241–244). https://doi.org/10.1109/ICASSP.2010.5495994
@inproceedings{Tran Vu_Haeb-Umbach_2010, title={Blind speech separation employing directional statistics in an Expectation Maximization framework}, DOI={10.1109/ICASSP.2010.5495994}, booktitle={IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010)}, author={Tran Vu, Dang Hai and Haeb-Umbach, Reinhold}, year={2010}, pages={241–244} }
Tran Vu, Dang Hai, and Reinhold Haeb-Umbach. “Blind Speech Separation Employing Directional Statistics in an Expectation Maximization Framework.” In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010), 241–44, 2010. https://doi.org/10.1109/ICASSP.2010.5495994.
D. H. Tran Vu and R. Haeb-Umbach, “Blind speech separation employing directional statistics in an Expectation Maximization framework,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010), 2010, pp. 241–244.
Tran Vu, Dang Hai, and Reinhold Haeb-Umbach. “Blind Speech Separation Employing Directional Statistics in an Expectation Maximization Framework.” IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010), 2010, pp. 241–44, doi:10.1109/ICASSP.2010.5495994.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)
Access Level
Restricted Closed Access

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar