Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming

Heitkaemper, Jens; Heymann, Jahn; Haeb-Umbach, Reinhold

Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming

J. Heitkaemper, J. Heymann, R. Haeb-Umbach, in: ITG 2018, Oldenburg, Germany, 2018.

Download (ext.)

https://groups.uni-paderborn.de/nt/pubs/2018/ITG_2018_Heitkaemper_Paper.pdf

Conference Paper | English

Author

Heitkaemper, Jens^LibreCat; Heymann, Jahn^LibreCat; Haeb-Umbach, Reinhold^LibreCat

Department

Nachrichtentechnik (NT) / Heinz Nixdorf Institut

Abstract

We present a block-online multi-channel front end for automatic speech recognition in noisy and reverberated environments. It is an online version of our earlier proposed neural network supported acoustic beamformer, whose coefficients are calculated from noise and speech spatial covariance matrices which are estimated utilizing a neural mask estimator. However, the sparsity of speech in the STFT domain causes problems for the initial beamformer coefficients estimation in some frequency bins due to lack of speech observations. We propose two methods to mitigate this issue. The first is to lower the frequency resolution of the STFT, which comes with the additional advantage of a reduced time window, thus lowering the latency introduced by block processing. The second approach is to smooth beamforming coefficients along the frequency axis, thus exploiting their high interfrequency correlation. With both approaches the gap between offline and block-online beamformer performance, as measured by the word error rate achieved by a downstream speech recognizer, is significantly reduced. Experiments are carried out on two copora, representing noisy (CHiME-4) and noisy reverberant (voiceHome) environments.

Publishing Year

2018

Proceedings Title

ITG 2018, Oldenburg, Germany

LibreCat-ID

11837

Cite this

Heitkaemper J, Heymann J, Haeb-Umbach R. Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming. In: ITG 2018, Oldenburg, Germany. ; 2018.

Heitkaemper, J., Heymann, J., & Haeb-Umbach, R. (2018). Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming. In ITG 2018, Oldenburg, Germany.

@inproceedings{Heitkaemper_Heymann_Haeb-Umbach_2018, title={Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming}, booktitle={ITG 2018, Oldenburg, Germany}, author={Heitkaemper, Jens and Heymann, Jahn and Haeb-Umbach, Reinhold}, year={2018} }

Heitkaemper, Jens, Jahn Heymann, and Reinhold Haeb-Umbach. “Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming.” In ITG 2018, Oldenburg, Germany, 2018.

J. Heitkaemper, J. Heymann, and R. Haeb-Umbach, “Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming,” in ITG 2018, Oldenburg, Germany, 2018.

Heitkaemper, Jens, et al. “Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming.” ITG 2018, Oldenburg, Germany, 2018.

All files available under the following license(s):

Copyright Statement: