Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models

L. Drude, A. Chinaev, D.H. Tran Vu, R. Haeb-Umbach, in: 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014), 2014.

Conference Paper | English
Author
Drude, LukasLibreCat; Chinaev, Aleksej; Tran Vu, Dang Hai; Haeb-Umbach, ReinholdLibreCat
Abstract
"In this contribution we derive a variational EM (VEM) algorithm for model selection in complex Watson mixture models, which have been recently proposed as a model of the distribution of normalized microphone array signals in the short-time Fourier transform domain. The VEM algorithm is applied to count the number of active sources in a speech mixture by iteratively estimating the mode vectors of the Watson distributions and suppressing the signals from the corresponding directions. A key theoretical contribution is the derivation of the MMSE estimate of a quadratic form involving the mode vector of the Watson distribution. The experimental results demonstrate the effectiveness of the source counting approach at moderately low SNR. It is further shown that the VEM algorithm is more robust w.r.t. used threshold values."
Publishing Year
Proceedings Title
39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014)
LibreCat-ID

Cite this

Drude L, Chinaev A, Tran Vu DH, Haeb-Umbach R. Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models. In: 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014). ; 2014.
Drude, L., Chinaev, A., Tran Vu, D. H., & Haeb-Umbach, R. (2014). Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models. In 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014).
@inproceedings{Drude_Chinaev_Tran Vu_Haeb-Umbach_2014, title={Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models}, booktitle={39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014)}, author={Drude, Lukas and Chinaev, Aleksej and Tran Vu, Dang Hai and Haeb-Umbach, Reinhold}, year={2014} }
Drude, Lukas, Aleksej Chinaev, Dang Hai Tran Vu, and Reinhold Haeb-Umbach. “Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models.” In 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014), 2014.
L. Drude, A. Chinaev, D. H. Tran Vu, and R. Haeb-Umbach, “Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models,” in 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014), 2014.
Drude, Lukas, et al. “Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models.” 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014), 2014.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)
Access Level
Restricted Closed Access
External material:
Supplementary Material
Description
Poster

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar