Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data

A. Krueger, O. Walter, V. Leutnant, R. Haeb-Umbach, in: Proc. Interspeech, Portland, USA, 2012.

Conference Paper | English
Author
Krueger, Alexander; Walter, Oliver; Leutnant, Volker; Haeb-Umbach, ReinholdLibreCat
Abstract
In this contribution we investigate the effectiveness of Bayesian feature enhancement (BFE) on a medium-sized recognition task containing real-world recordings of noisy reverberant speech. BFE employs a very coarse model of the acoustic impulse response (AIR) from the source to the microphone, which has been shown to be effective if the speech to be recognized has been generated by artificially convolving nonreverberant speech with a constant AIR. Here we demonstrate that the model is also appropriate to be used in feature enhancement of true recordings of noisy reverberant speech. On the Multi-Channel Wall Street Journal Audio Visual corpus (MC-WSJ-AV) the word error rate is cut in half to 41.9 percent compared to the ETSI Standard Front-End using as input the signal of a single distant microphone with a single recognition pass.
Publishing Year
Proceedings Title
Proc. Interspeech
LibreCat-ID

Cite this

Krueger A, Walter O, Leutnant V, Haeb-Umbach R. Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data. In: Proc. Interspeech. Portland, USA; 2012.
Krueger, A., Walter, O., Leutnant, V., & Haeb-Umbach, R. (2012). Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data. In Proc. Interspeech. Portland, USA.
@inproceedings{Krueger_Walter_Leutnant_Haeb-Umbach_2012, place={Portland, USA}, title={Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data}, booktitle={Proc. Interspeech}, author={Krueger, Alexander and Walter, Oliver and Leutnant, Volker and Haeb-Umbach, Reinhold}, year={2012} }
Krueger, Alexander, Oliver Walter, Volker Leutnant, and Reinhold Haeb-Umbach. “Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data.” In Proc. Interspeech. Portland, USA, 2012.
A. Krueger, O. Walter, V. Leutnant, and R. Haeb-Umbach, “Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data,” in Proc. Interspeech, 2012.
Krueger, Alexander, et al. “Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data.” Proc. Interspeech, 2012.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)
Access Level
Restricted Closed Access

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar