Uncertainty decoding for distributed speech recognition over error-prone networks
V. Ion, R. Haeb-Umbach, Speech Communication 48 (2006) 1435–1446.
Download (ext.)
Journal Article
| English
Author
Ion, Valentin;
Haeb-Umbach, ReinholdLibreCat
Abstract
In this paper, we propose an enhanced error concealment strategy at the server side of a distributed speech recognition (DSR) system, which is fully compatible with the existing DSR standard. It is based on a Bayesian approach, where the a posteriori probability density of the error-free feature vector is computed, given all received feature vectors which are possibly corrupted by transmission errors. Rather than computing a point estimate, such as the MMSE estimate, and plugging it into the Bayesian decision rule, we employ uncertainty decoding, which results in an integration over the uncertainty in the feature domain. In a typical scenario the communication between the thin client, often a mobile device, and the recognition server spreads across heterogeneous networks. Both bit errors on circuit-switched links and lost data packets on IP connections are mitigated by our approach in a unified manner. The experiments reveal improved robustness both for small- and large-vocabulary recognition tasks.
Keywords
Publishing Year
Journal Title
Speech Communication
Volume
48
Issue
11
Page
1435-1446
LibreCat-ID
Cite this
Ion V, Haeb-Umbach R. Uncertainty decoding for distributed speech recognition over error-prone networks. Speech Communication. 2006;48(11):1435-1446. doi:10.1016/j.specom.2006.03.007
Ion, V., & Haeb-Umbach, R. (2006). Uncertainty decoding for distributed speech recognition over error-prone networks. Speech Communication, 48(11), 1435–1446. https://doi.org/10.1016/j.specom.2006.03.007
@article{Ion_Haeb-Umbach_2006, title={Uncertainty decoding for distributed speech recognition over error-prone networks}, volume={48}, DOI={10.1016/j.specom.2006.03.007}, number={11}, journal={Speech Communication}, author={Ion, Valentin and Haeb-Umbach, Reinhold}, year={2006}, pages={1435–1446} }
Ion, Valentin, and Reinhold Haeb-Umbach. “Uncertainty Decoding for Distributed Speech Recognition over Error-Prone Networks.” Speech Communication 48, no. 11 (2006): 1435–46. https://doi.org/10.1016/j.specom.2006.03.007.
V. Ion and R. Haeb-Umbach, “Uncertainty decoding for distributed speech recognition over error-prone networks,” Speech Communication, vol. 48, no. 11, pp. 1435–1446, 2006.
Ion, Valentin, and Reinhold Haeb-Umbach. “Uncertainty Decoding for Distributed Speech Recognition over Error-Prone Networks.” Speech Communication, vol. 48, no. 11, 2006, pp. 1435–46, doi:10.1016/j.specom.2006.03.007.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Link(s) to Main File(s)
Access Level
Closed Access