Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models

Drude, Lukas; Chinaev, Aleksej; Tran Vu, Dang Hai; Haeb-Umbach, Reinhold

Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models

L. Drude, A. Chinaev, D.H. Tran Vu, R. Haeb-Umbach, in: 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014), 2014.

Download (ext.)

https://groups.uni-paderborn.de/nt/pubs/2014/DrChTrHa2014.pdf

Conference Paper | English

Author

Drude, Lukas^LibreCat; Chinaev, Aleksej; Tran Vu, Dang Hai; Haeb-Umbach, Reinhold^LibreCat

Department

Nachrichtentechnik (NT) / Heinz Nixdorf Institut

Abstract

"In this contribution we derive a variational EM (VEM) algorithm for model selection in complex Watson mixture models, which have been recently proposed as a model of the distribution of normalized microphone array signals in the short-time Fourier transform domain. The VEM algorithm is applied to count the number of active sources in a speech mixture by iteratively estimating the mode vectors of the Watson distributions and suppressing the signals from the corresponding directions. A key theoretical contribution is the derivation of the MMSE estimate of a quadratic form involving the mode vector of the Watson distribution. The experimental results demonstrate the effectiveness of the source counting approach at moderately low SNR. It is further shown that the VEM algorithm is more robust w.r.t. used threshold values."

Publishing Year

2014

Proceedings Title

39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014)

LibreCat-ID

11752

Cite this

Drude L, Chinaev A, Tran Vu DH, Haeb-Umbach R. Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models. In: 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014). ; 2014.

Drude, L., Chinaev, A., Tran Vu, D. H., & Haeb-Umbach, R. (2014). Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models. In 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014).

@inproceedings{Drude_Chinaev_Tran Vu_Haeb-Umbach_2014, title={Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models}, booktitle={39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014)}, author={Drude, Lukas and Chinaev, Aleksej and Tran Vu, Dang Hai and Haeb-Umbach, Reinhold}, year={2014} }

Drude, Lukas, Aleksej Chinaev, Dang Hai Tran Vu, and Reinhold Haeb-Umbach. “Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models.” In 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014), 2014.

L. Drude, A. Chinaev, D. H. Tran Vu, and R. Haeb-Umbach, “Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models,” in 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014), 2014.

Drude, Lukas, et al. “Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models.” 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014), 2014.

All files available under the following license(s):

Copyright Statement: