Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation
L. Drude, R. Haeb-Umbach, IEEE Journal of Selected Topics in Signal Processing (2019).
Download
IEEE Jounal_2019_Drude_Paper.pdf
967.42 KB
Journal Article
| English
Abstract
We formulate a generic framework for blind source separation (BSS), which allows integrating data-driven spectro-temporal methods, such as deep clustering and deep attractor networks, with physically motivated probabilistic spatial methods, such as complex angular central Gaussian mixture models. The integrated model exploits the complementary strengths of the two approaches to BSS: the strong modeling power of neural networks, which, however, is based on supervised learning, and the ease of unsupervised learning of the spatial mixture models whose few parameters can be estimated on as little as a single segment of a real mixture of speech. Experiments are carried out on both artificially mixed speech and true recordings of speech mixtures. The experiments verify that the integrated models consistently outperform the individual components. We further extend the models to cope with noisy, reverberant speech and introduce a cross-domain teacher–student training where the mixture model serves as the teacher to provide training targets for the student neural network.
Publishing Year
Journal Title
IEEE Journal of Selected Topics in Signal Processing
eISSN
LibreCat-ID
Cite this
Drude L, Haeb-Umbach R. Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation. IEEE Journal of Selected Topics in Signal Processing. 2019. doi:10.1109/JSTSP.2019.2912565
Drude, L., & Haeb-Umbach, R. (2019). Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation. IEEE Journal of Selected Topics in Signal Processing. https://doi.org/10.1109/JSTSP.2019.2912565
@article{Drude_Haeb-Umbach_2019, title={Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation}, DOI={10.1109/JSTSP.2019.2912565}, journal={IEEE Journal of Selected Topics in Signal Processing}, author={Drude, Lukas and Haeb-Umbach, Reinhold}, year={2019} }
Drude, Lukas, and Reinhold Haeb-Umbach. “Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation.” IEEE Journal of Selected Topics in Signal Processing, 2019. https://doi.org/10.1109/JSTSP.2019.2912565.
L. Drude and R. Haeb-Umbach, “Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation,” IEEE Journal of Selected Topics in Signal Processing, 2019.
Drude, Lukas, and Reinhold Haeb-Umbach. “Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation.” IEEE Journal of Selected Topics in Signal Processing, 2019, doi:10.1109/JSTSP.2019.2912565.
All files available under the following license(s):
Creative Commons Public Domain Dedication (CC0 1.0):
Main File(s)
File Name
IEEE Jounal_2019_Drude_Paper.pdf
967.42 KB
Access Level
Open Access
Last Uploaded
2019-08-14T07:11:22Z