Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming

J. Heitkaemper, J. Heymann, R. Haeb-Umbach, in: ITG 2018, Oldenburg, Germany, 2018.

Conference Paper | English
Abstract
We present a block-online multi-channel front end for automatic speech recognition in noisy and reverberated environments. It is an online version of our earlier proposed neural network supported acoustic beamformer, whose coefficients are calculated from noise and speech spatial covariance matrices which are estimated utilizing a neural mask estimator. However, the sparsity of speech in the STFT domain causes problems for the initial beamformer coefficients estimation in some frequency bins due to lack of speech observations. We propose two methods to mitigate this issue. The first is to lower the frequency resolution of the STFT, which comes with the additional advantage of a reduced time window, thus lowering the latency introduced by block processing. The second approach is to smooth beamforming coefficients along the frequency axis, thus exploiting their high interfrequency correlation. With both approaches the gap between offline and block-online beamformer performance, as measured by the word error rate achieved by a downstream speech recognizer, is significantly reduced. Experiments are carried out on two copora, representing noisy (CHiME-4) and noisy reverberant (voiceHome) environments.
Publishing Year
Proceedings Title
ITG 2018, Oldenburg, Germany
LibreCat-ID

Cite this

Heitkaemper J, Heymann J, Haeb-Umbach R. Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming. In: ITG 2018, Oldenburg, Germany. ; 2018.
Heitkaemper, J., Heymann, J., & Haeb-Umbach, R. (2018). Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming. In ITG 2018, Oldenburg, Germany.
@inproceedings{Heitkaemper_Heymann_Haeb-Umbach_2018, title={Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming}, booktitle={ITG 2018, Oldenburg, Germany}, author={Heitkaemper, Jens and Heymann, Jahn and Haeb-Umbach, Reinhold}, year={2018} }
Heitkaemper, Jens, Jahn Heymann, and Reinhold Haeb-Umbach. “Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming.” In ITG 2018, Oldenburg, Germany, 2018.
J. Heitkaemper, J. Heymann, and R. Haeb-Umbach, “Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming,” in ITG 2018, Oldenburg, Germany, 2018.
Heitkaemper, Jens, et al. “Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming.” ITG 2018, Oldenburg, Germany, 2018.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)
Access Level
Restricted Closed Access
External material:
Supplementary Material
Description
Slides

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar