A Statistical Observation Model For Noisy Reverberant Speech Features and its Application to Robust ASR

Leutnant, Volker; Krueger, Alexander; Haeb-Umbach, Reinhold

A Statistical Observation Model For Noisy Reverberant Speech Features and its Application to Robust ASR

V. Leutnant, A. Krueger, R. Haeb-Umbach, in: Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference On, 2012.

Download (ext.)

http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6335731

Conference Paper | English

Author

Leutnant, Volker; Krueger, Alexander; Haeb-Umbach, Reinhold^LibreCat

Department

Nachrichtentechnik (NT) / Heinz Nixdorf Institut

Abstract

In this work, an observation model for the joint compensation of noise and reverberation in the logarithmic mel power spectral density domain is considered. It relates the features of the noisy reverberant speech to those of the non-reverberant speech and the noise. In contrast to enhancement of features only corrupted by reverberation (reverberant features), enhancement of noisy reverberant features requires a more sophisticated model for the error introduced by the proposed observation model. In a first consideration, it will be shown that this error is highly dependent on the instantaneous ratio of the power of reverberant speech to the power of the noise and, moreover, sensitive to the phase between reverberant speech and noise in the short-time discrete Fourier domain. Afterwards, a statistically motivated approach will be presented allowing for the model of the observation error to be inferred from the error model previously used for the reverberation only case. Finally, the developed observation error model will be utilized in a Bayesian feature enhancement scheme, leading to improvements in word accuracy on the AURORA5 database.

Keywords

Robust Automatic Speech Recognition; Bayesian feature enhancement; observation model for reverberant and noisy speech

Publishing Year

2012

Proceedings Title

Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference on

LibreCat-ID

11864

Cite this

Leutnant V, Krueger A, Haeb-Umbach R. A Statistical Observation Model For Noisy Reverberant Speech Features and its Application to Robust ASR. In: Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference On. ; 2012.

Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2012). A Statistical Observation Model For Noisy Reverberant Speech Features and its Application to Robust ASR. In Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference on.

@inproceedings{Leutnant_Krueger_Haeb-Umbach_2012, title={A Statistical Observation Model For Noisy Reverberant Speech Features and its Application to Robust ASR}, booktitle={Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference on}, author={Leutnant, Volker and Krueger, Alexander and Haeb-Umbach, Reinhold}, year={2012} }

Leutnant, Volker, Alexander Krueger, and Reinhold Haeb-Umbach. “A Statistical Observation Model For Noisy Reverberant Speech Features and Its Application to Robust ASR.” In Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference On, 2012.

V. Leutnant, A. Krueger, and R. Haeb-Umbach, “A Statistical Observation Model For Noisy Reverberant Speech Features and its Application to Robust ASR,” in Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference on, 2012.

Leutnant, Volker, et al. “A Statistical Observation Model For Noisy Reverberant Speech Features and Its Application to Robust ASR.” Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference On, 2012.

All files available under the following license(s):

Copyright Statement:

This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)

URL

http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6335731

Access Level

Closed Access

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar