Unsupervised training of neural mask-based beamforming

L. Drude, J. Heymann, R. Haeb-Umbach, in: INTERSPEECH 2019, Graz, Austria, 2019.

Download
OA 223.41 KB
Conference Paper | English
Abstract
We present an unsupervised training approach for a neural network-based mask estimator in an acoustic beamforming application. The network is trained to maximize a likelihood criterion derived from a spatial mixture model of the observations. It is trained from scratch without requiring any parallel data consisting of degraded input and clean training targets. Thus, training can be carried out on real recordings of noisy speech rather than simulated ones. In contrast to previous work on unsupervised training of neural mask estimators, our approach avoids the need for a possibly pre-trained teacher model entirely. We demonstrate the effectiveness of our approach by speech recognition experiments on two different datasets: one mainly deteriorated by noise (CHiME 4) and one by reverberation (REVERB). The results show that the performance of the proposed system is on par with a supervised system using oracle target masks for training and with a system trained using a model-based teacher.
Publishing Year
Proceedings Title
INTERSPEECH 2019, Graz, Austria
LibreCat-ID

Cite this

Drude L, Heymann J, Haeb-Umbach R. Unsupervised training of neural mask-based beamforming. In: INTERSPEECH 2019, Graz, Austria. ; 2019.
Drude, L., Heymann, J., & Haeb-Umbach, R. (2019). Unsupervised training of neural mask-based beamforming. In INTERSPEECH 2019, Graz, Austria.
@inproceedings{Drude_Heymann_Haeb-Umbach_2019, title={Unsupervised training of neural mask-based beamforming}, booktitle={INTERSPEECH 2019, Graz, Austria}, author={Drude, Lukas and Heymann, Jahn and Haeb-Umbach, Reinhold}, year={2019} }
Drude, Lukas, Jahn Heymann, and Reinhold Haeb-Umbach. “Unsupervised Training of Neural Mask-Based Beamforming.” In INTERSPEECH 2019, Graz, Austria, 2019.
L. Drude, J. Heymann, and R. Haeb-Umbach, “Unsupervised training of neural mask-based beamforming,” in INTERSPEECH 2019, Graz, Austria, 2019.
Drude, Lukas, et al. “Unsupervised Training of Neural Mask-Based Beamforming.” INTERSPEECH 2019, Graz, Austria, 2019.
All files available under the following license(s):
Creative Commons License:
CC0Creative Commons Public Domain Dedication (CC0 1.0)
Main File(s)
Access Level
OA Open Access
Last Uploaded
2019-08-13T06:41:35Z


Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar