A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition
V. Ion, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language Processing 16 (2008) 1047–1060.
Download (ext.)
Journal Article
| English
Author
Ion, Valentin;
Haeb-Umbach, ReinholdLibreCat
Abstract
In this paper, we derive an uncertainty decoding rule for automatic speech recognition (ASR), which accounts for both corrupted observations and inter-frame correlation. The conditional independence assumption, prevalent in hidden Markov model-based ASR, is relaxed to obtain a clean speech posterior that is conditioned on the complete observed feature vector sequence. This is a more informative posterior than one conditioned only on the current observation. The novel decoding is used to obtain a transmission-error robust remote ASR system, where the speech capturing unit is connected to the decoder via an error-prone communication network. We show how the clean speech posterior can be computed for communication links being characterized by either bit errors or packet loss. Recognition results are presented for both distributed and network speech recognition, where in the latter case common voice-over-IP codecs are employed.
Keywords
automatic speech recognition;
bit errors;
codecs;
communication links;
corrupted observations;
decoding;
distributed speech recognition;
error-prone communication network;
feature vector sequence;
hidden Markov model-based ASR;
hidden Markov models;
inter-frame correlation;
Internet telephony;
network speech recognition;
packet loss;
speech posterior;
speech recognition;
transmission error robust speech recognition;
uncertainty decoding;
voice-over-IP codecs
Publishing Year
Journal Title
IEEE Transactions on Audio, Speech, and Language Processing
Volume
16
Issue
5
Page
1047-1060
LibreCat-ID
Cite this
Ion V, Haeb-Umbach R. A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing. 2008;16(5):1047-1060. doi:10.1109/TASL.2008.925879
Ion, V., & Haeb-Umbach, R. (2008). A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 16(5), 1047–1060. https://doi.org/10.1109/TASL.2008.925879
@article{Ion_Haeb-Umbach_2008, title={A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition}, volume={16}, DOI={10.1109/TASL.2008.925879}, number={5}, journal={IEEE Transactions on Audio, Speech, and Language Processing}, author={Ion, Valentin and Haeb-Umbach, Reinhold}, year={2008}, pages={1047–1060} }
Ion, Valentin, and Reinhold Haeb-Umbach. “A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition.” IEEE Transactions on Audio, Speech, and Language Processing 16, no. 5 (2008): 1047–60. https://doi.org/10.1109/TASL.2008.925879.
V. Ion and R. Haeb-Umbach, “A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, no. 5, pp. 1047–1060, 2008.
Ion, Valentin, and Reinhold Haeb-Umbach. “A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition.” IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, no. 5, 2008, pp. 1047–60, doi:10.1109/TASL.2008.925879.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Link(s) to Main File(s)
Access Level
Closed Access