book chapter
Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition
Volker
Leutnant
author
Reinhold
Haeb-Umbach
author 242
ReinholdHaeb-Umbach
editor
DorotheaKolossa
editor
54
department
In this contribution, conditional Bayesian estimation employing a phase-sensitive observation model for noise robust speech recognition will be studied. After a review of speech recognition under the presence of corrupted features, termed uncertainty decoding, the estimation of the posterior distribution of the uncorrupted (clean) feature vector will be shown to be a key element of noise robust speech recognition. The estimation process will be based on three major components: an a priori model of the unobservable data, an observation model relating the unobservable data to the corrupted observation and an inference algorithm, finally allowing for a computationally tractable solution. Special stress will be laid on a detailed derivation of the phase-sensitive observation model and the required moments of the phase factor distribution. Thereby, it will not only be proven analytically that the phase factor distribution is non-Gaussian but also that all central moments can (approximately) be computed solely based on the used mel filter bank, finally rendering the moments independent of noise type and signal-to-noise ratio. The phase-sensitive observation model will then be incorporated into a model-based feature enhancement scheme and recognition experiments will be carried out on the Aurora~2 and Aurora~4 databases. The importance of incorporating phase factor information into the enhancement scheme is pointed out by all recognition results. Application of the proposed scheme under the derived uncertainty decoding framework further leads to significant improvements in both recognition tasks, eventually reaching the performance achieved with the ETSI advanced front-end.
Springer2011
eng
Robust Speech Recognition of Uncertain or Missing Data
Leutnant, V., & Haeb-Umbach, R. (2011). Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition. In R. Haeb-Umbach & D. Kolossa (Eds.), <i>Robust Speech Recognition of Uncertain or Missing Data</i>. Springer.
Leutnant V, Haeb-Umbach R. Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition. In: Haeb-Umbach R, Kolossa D, eds. <i>Robust Speech Recognition of Uncertain or Missing Data</i>. Springer; 2011.
Leutnant, Volker, and Reinhold Haeb-Umbach. “Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition.” In <i>Robust Speech Recognition of Uncertain or Missing Data</i>, edited by Reinhold Haeb-Umbach and Dorothea Kolossa. Springer, 2011.
V. Leutnant and R. Haeb-Umbach, “Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition,” in <i>Robust Speech Recognition of Uncertain or Missing Data</i>, R. Haeb-Umbach and D. Kolossa, Eds. Springer, 2011.
Leutnant, Volker, and Reinhold Haeb-Umbach. “Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition.” <i>Robust Speech Recognition of Uncertain or Missing Data</i>, edited by Reinhold Haeb-Umbach and Dorothea Kolossa, Springer, 2011.
V. Leutnant, R. Haeb-Umbach, in: R. Haeb-Umbach, D. Kolossa (Eds.), Robust Speech Recognition of Uncertain or Missing Data, Springer, 2011.
@inbook{Leutnant_Haeb-Umbach_2011, title={Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition}, booktitle={Robust Speech Recognition of Uncertain or Missing Data}, publisher={Springer}, author={Leutnant, Volker and Haeb-Umbach, Reinhold}, editor={Haeb-Umbach, Reinhold and Kolossa, DorotheaEditors}, year={2011} }
118562019-07-12T05:29:35Z2022-01-06T06:51:11Z