---
_id: '11820'
abstract:
- lang: eng
  text: In this paper, we derive an uncertainty decoding rule for automatic speech
    recognition (ASR), which accounts for both corrupted observations and inter-frame
    correlation. The conditional independence assumption, prevalent in hidden Markov
    model-based ASR, is relaxed to obtain a clean speech posterior that is conditioned
    on the complete observed feature vector sequence. This is a more informative posterior
    than one conditioned only on the current observation. The novel decoding is used
    to obtain a transmission-error robust remote ASR system, where the speech capturing
    unit is connected to the decoder via an error-prone communication network. We
    show how the clean speech posterior can be computed for communication links being
    characterized by either bit errors or packet loss. Recognition results are presented
    for both distributed and network speech recognition, where in the latter case
    common voice-over-IP codecs are employed.
author:
- first_name: Valentin
  full_name: Ion, Valentin
  last_name: Ion
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: Ion V, Haeb-Umbach R. A Novel Uncertainty Decoding Rule With Applications to
    Transmission Error Robust Speech Recognition. <i>IEEE Transactions on Audio, Speech,
    and Language Processing</i>. 2008;16(5):1047-1060. doi:<a href="https://doi.org/10.1109/TASL.2008.925879">10.1109/TASL.2008.925879</a>
  apa: Ion, V., &#38; Haeb-Umbach, R. (2008). A Novel Uncertainty Decoding Rule With
    Applications to Transmission Error Robust Speech Recognition. <i>IEEE Transactions
    on Audio, Speech, and Language Processing</i>, <i>16</i>(5), 1047–1060. <a href="https://doi.org/10.1109/TASL.2008.925879">https://doi.org/10.1109/TASL.2008.925879</a>
  bibtex: '@article{Ion_Haeb-Umbach_2008, title={A Novel Uncertainty Decoding Rule
    With Applications to Transmission Error Robust Speech Recognition}, volume={16},
    DOI={<a href="https://doi.org/10.1109/TASL.2008.925879">10.1109/TASL.2008.925879</a>},
    number={5}, journal={IEEE Transactions on Audio, Speech, and Language Processing},
    author={Ion, Valentin and Haeb-Umbach, Reinhold}, year={2008}, pages={1047–1060}
    }'
  chicago: 'Ion, Valentin, and Reinhold Haeb-Umbach. “A Novel Uncertainty Decoding
    Rule With Applications to Transmission Error Robust Speech Recognition.” <i>IEEE
    Transactions on Audio, Speech, and Language Processing</i> 16, no. 5 (2008): 1047–60.
    <a href="https://doi.org/10.1109/TASL.2008.925879">https://doi.org/10.1109/TASL.2008.925879</a>.'
  ieee: V. Ion and R. Haeb-Umbach, “A Novel Uncertainty Decoding Rule With Applications
    to Transmission Error Robust Speech Recognition,” <i>IEEE Transactions on Audio,
    Speech, and Language Processing</i>, vol. 16, no. 5, pp. 1047–1060, 2008.
  mla: Ion, Valentin, and Reinhold Haeb-Umbach. “A Novel Uncertainty Decoding Rule
    With Applications to Transmission Error Robust Speech Recognition.” <i>IEEE Transactions
    on Audio, Speech, and Language Processing</i>, vol. 16, no. 5, 2008, pp. 1047–60,
    doi:<a href="https://doi.org/10.1109/TASL.2008.925879">10.1109/TASL.2008.925879</a>.
  short: V. Ion, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language
    Processing 16 (2008) 1047–1060.
date_created: 2019-07-12T05:28:53Z
date_updated: 2022-01-06T06:51:10Z
department:
- _id: '54'
doi: 10.1109/TASL.2008.925879
intvolume: '        16'
issue: '5'
keyword:
- automatic speech recognition
- bit errors
- codecs
- communication links
- corrupted observations
- decoding
- distributed speech recognition
- error-prone communication network
- feature vector sequence
- hidden Markov model-based ASR
- hidden Markov models
- inter-frame correlation
- Internet telephony
- network speech recognition
- packet loss
- speech posterior
- speech recognition
- transmission error robust speech recognition
- uncertainty decoding
- voice-over-IP codecs
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2008/IoHa08-1.pdf
oa: '1'
page: 1047-1060
publication: IEEE Transactions on Audio, Speech, and Language Processing
status: public
title: A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust
  Speech Recognition
type: journal_article
user_id: '44006'
volume: 16
year: '2008'
...
---
_id: '11824'
abstract:
- lang: eng
  text: Soft-feature based speech recognition, which is an example of uncertainty
    decoding, has been proven to be a robust error mitigation method for distributed
    speech recognition over wireless channels exhibiting bit errors. In this paper
    we extend this concept to packet-oriented transmissions. The a posteriori probability
    density function of the lost feature vector, given the closest received neighbours,
    is computed. In the experiments, the nearest frame repetition, which is shown
    to be equivalent to the MAP estimate, outperforms the MMSE estimate for long bursts.
    Taking the variance into account at the speech recognition stage results in superior
    performance compared to classical schemes using point estimates. A computationally
    and memory efficient implementation of the proposed packet loss compensation scheme
    based on table lookup is presented
author:
- first_name: Valentin
  full_name: Ion, Valentin
  last_name: Ion
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Ion V, Haeb-Umbach R. An Inexpensive Packet Loss Compensation Scheme for Distributed
    Speech Recognition Based on Soft-Features. In: <i>IEEE International Conference
    on Acoustics, Speech and Signal Processing (ICASSP 2006)</i>. Vol 1. ; 2006:I.
    doi:<a href="https://doi.org/10.1109/ICASSP.2006.1659984">10.1109/ICASSP.2006.1659984</a>'
  apa: Ion, V., &#38; Haeb-Umbach, R. (2006). An Inexpensive Packet Loss Compensation
    Scheme for Distributed Speech Recognition Based on Soft-Features. In <i>IEEE International
    Conference on Acoustics, Speech and Signal Processing (ICASSP 2006)</i> (Vol.
    1, p. I). <a href="https://doi.org/10.1109/ICASSP.2006.1659984">https://doi.org/10.1109/ICASSP.2006.1659984</a>
  bibtex: '@inproceedings{Ion_Haeb-Umbach_2006, title={An Inexpensive Packet Loss
    Compensation Scheme for Distributed Speech Recognition Based on Soft-Features},
    volume={1}, DOI={<a href="https://doi.org/10.1109/ICASSP.2006.1659984">10.1109/ICASSP.2006.1659984</a>},
    booktitle={IEEE International Conference on Acoustics, Speech and Signal Processing
    (ICASSP 2006)}, author={Ion, Valentin and Haeb-Umbach, Reinhold}, year={2006},
    pages={I} }'
  chicago: Ion, Valentin, and Reinhold Haeb-Umbach. “An Inexpensive Packet Loss Compensation
    Scheme for Distributed Speech Recognition Based on Soft-Features.” In <i>IEEE
    International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006)</i>,
    1:I, 2006. <a href="https://doi.org/10.1109/ICASSP.2006.1659984">https://doi.org/10.1109/ICASSP.2006.1659984</a>.
  ieee: V. Ion and R. Haeb-Umbach, “An Inexpensive Packet Loss Compensation Scheme
    for Distributed Speech Recognition Based on Soft-Features,” in <i>IEEE International
    Conference on Acoustics, Speech and Signal Processing (ICASSP 2006)</i>, 2006,
    vol. 1, p. I.
  mla: Ion, Valentin, and Reinhold Haeb-Umbach. “An Inexpensive Packet Loss Compensation
    Scheme for Distributed Speech Recognition Based on Soft-Features.” <i>IEEE International
    Conference on Acoustics, Speech and Signal Processing (ICASSP 2006)</i>, vol.
    1, 2006, p. I, doi:<a href="https://doi.org/10.1109/ICASSP.2006.1659984">10.1109/ICASSP.2006.1659984</a>.
  short: 'V. Ion, R. Haeb-Umbach, in: IEEE International Conference on Acoustics,
    Speech and Signal Processing (ICASSP 2006), 2006, p. I.'
date_created: 2019-07-12T05:28:58Z
date_updated: 2022-01-06T06:51:10Z
department:
- _id: '54'
doi: 10.1109/ICASSP.2006.1659984
intvolume: '         1'
keyword:
- distributed speech recognition
- least mean squares methods
- MAP estimate
- maximum likelihood estimation
- MMSE estimate
- packet loss compensation scheme
- packet switched communication
- posteriori probability density function
- robust error mitigation method
- soft-features
- speech recognition
- table lookup
- voice communication
- wireless channels
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2006/IoHa06-2.pdf
oa: '1'
page: I
publication: IEEE International Conference on Acoustics, Speech and Signal Processing
  (ICASSP 2006)
status: public
title: An Inexpensive Packet Loss Compensation Scheme for Distributed Speech Recognition
  Based on Soft-Features
type: conference
user_id: '44006'
volume: 1
year: '2006'
...
---
_id: '11825'
abstract:
- lang: eng
  text: In this paper, we propose an enhanced error concealment strategy at the server
    side of a distributed speech recognition (DSR) system, which is fully compatible
    with the existing DSR standard. It is based on a Bayesian approach, where the
    a posteriori probability density of the error-free feature vector is computed,
    given all received feature vectors which are possibly corrupted by transmission
    errors. Rather than computing a point estimate, such as the MMSE estimate, and
    plugging it into the Bayesian decision rule, we employ uncertainty decoding, which
    results in an integration over the uncertainty in the feature domain. In a typical
    scenario the communication between the thin client, often a mobile device, and
    the recognition server spreads across heterogeneous networks. Both bit errors
    on circuit-switched links and lost data packets on IP connections are mitigated
    by our approach in a unified manner. The experiments reveal improved robustness
    both for small- and large-vocabulary recognition tasks.
author:
- first_name: Valentin
  full_name: Ion, Valentin
  last_name: Ion
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: Ion V, Haeb-Umbach R. Uncertainty decoding for distributed speech recognition
    over error-prone networks. <i>Speech Communication</i>. 2006;48(11):1435-1446.
    doi:<a href="https://doi.org/10.1016/j.specom.2006.03.007">10.1016/j.specom.2006.03.007</a>
  apa: Ion, V., &#38; Haeb-Umbach, R. (2006). Uncertainty decoding for distributed
    speech recognition over error-prone networks. <i>Speech Communication</i>, <i>48</i>(11),
    1435–1446. <a href="https://doi.org/10.1016/j.specom.2006.03.007">https://doi.org/10.1016/j.specom.2006.03.007</a>
  bibtex: '@article{Ion_Haeb-Umbach_2006, title={Uncertainty decoding for distributed
    speech recognition over error-prone networks}, volume={48}, DOI={<a href="https://doi.org/10.1016/j.specom.2006.03.007">10.1016/j.specom.2006.03.007</a>},
    number={11}, journal={Speech Communication}, author={Ion, Valentin and Haeb-Umbach,
    Reinhold}, year={2006}, pages={1435–1446} }'
  chicago: 'Ion, Valentin, and Reinhold Haeb-Umbach. “Uncertainty Decoding for Distributed
    Speech Recognition over Error-Prone Networks.” <i>Speech Communication</i> 48,
    no. 11 (2006): 1435–46. <a href="https://doi.org/10.1016/j.specom.2006.03.007">https://doi.org/10.1016/j.specom.2006.03.007</a>.'
  ieee: V. Ion and R. Haeb-Umbach, “Uncertainty decoding for distributed speech recognition
    over error-prone networks,” <i>Speech Communication</i>, vol. 48, no. 11, pp.
    1435–1446, 2006.
  mla: Ion, Valentin, and Reinhold Haeb-Umbach. “Uncertainty Decoding for Distributed
    Speech Recognition over Error-Prone Networks.” <i>Speech Communication</i>, vol.
    48, no. 11, 2006, pp. 1435–46, doi:<a href="https://doi.org/10.1016/j.specom.2006.03.007">10.1016/j.specom.2006.03.007</a>.
  short: V. Ion, R. Haeb-Umbach, Speech Communication 48 (2006) 1435–1446.
date_created: 2019-07-12T05:28:59Z
date_updated: 2022-01-06T06:51:10Z
department:
- _id: '54'
doi: 10.1016/j.specom.2006.03.007
intvolume: '        48'
issue: '11'
keyword:
- Channel error robustness
- Distributed speech recognition
- Soft features
- Uncertainty decoding
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2006/IoHa06-3.pdf
oa: '1'
page: 1435-1446
publication: Speech Communication
status: public
title: Uncertainty decoding for distributed speech recognition over error-prone networks
type: journal_article
user_id: '44006'
volume: 48
year: '2006'
...
---
_id: '11828'
abstract:
- lang: eng
  text: 'In this paper we present a comparison of the recently proposed Soft-Feature
    Distributed Speech Recognition (SFDSR) with the two evaluated candidate codecs
    for Speech Enabled Services over wireless networks: Adaptive Multirate Codec (AMR)
    and the ETSI Extended Advanced Front-End for Distributed Speech Recognition (XAFE).
    It is shown that SFDSR achieves the best recognition performance on a simulated
    GSM transmission, followed by XAFE and AMR.We also present some new results concerning
    SFDSR which demonstrate the versatility of the approach. Further, a simple method
    is introduced which considerably reduces the computational effort.'
author:
- first_name: Valentin
  full_name: Ion, Valentin
  last_name: Ion
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Ion V, Haeb-Umbach R. A Comparison of Soft-Feature Distributed Speech Recognition
    with Candidate Codecs for Speech Enabled Mobile Services. In: <i>IEEE International
    Conference on Acoustics, Speech and Signal Processing (ICASSP 2005)</i>. Vol 1.
    ; 2005:333-336. doi:<a href="https://doi.org/10.1109/ICASSP.2005.1415118">10.1109/ICASSP.2005.1415118</a>'
  apa: Ion, V., &#38; Haeb-Umbach, R. (2005). A Comparison of Soft-Feature Distributed
    Speech Recognition with Candidate Codecs for Speech Enabled Mobile Services. In
    <i>IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP
    2005)</i> (Vol. 1, pp. 333–336). <a href="https://doi.org/10.1109/ICASSP.2005.1415118">https://doi.org/10.1109/ICASSP.2005.1415118</a>
  bibtex: '@inproceedings{Ion_Haeb-Umbach_2005, title={A Comparison of Soft-Feature
    Distributed Speech Recognition with Candidate Codecs for Speech Enabled Mobile
    Services}, volume={1}, DOI={<a href="https://doi.org/10.1109/ICASSP.2005.1415118">10.1109/ICASSP.2005.1415118</a>},
    booktitle={IEEE International Conference on Acoustics, Speech and Signal Processing
    (ICASSP 2005)}, author={Ion, Valentin and Haeb-Umbach, Reinhold}, year={2005},
    pages={333–336} }'
  chicago: Ion, Valentin, and Reinhold Haeb-Umbach. “A Comparison of Soft-Feature
    Distributed Speech Recognition with Candidate Codecs for Speech Enabled Mobile
    Services.” In <i>IEEE International Conference on Acoustics, Speech and Signal
    Processing (ICASSP 2005)</i>, 1:333–36, 2005. <a href="https://doi.org/10.1109/ICASSP.2005.1415118">https://doi.org/10.1109/ICASSP.2005.1415118</a>.
  ieee: V. Ion and R. Haeb-Umbach, “A Comparison of Soft-Feature Distributed Speech
    Recognition with Candidate Codecs for Speech Enabled Mobile Services,” in <i>IEEE
    International Conference on Acoustics, Speech and Signal Processing (ICASSP 2005)</i>,
    2005, vol. 1, pp. 333–336.
  mla: Ion, Valentin, and Reinhold Haeb-Umbach. “A Comparison of Soft-Feature Distributed
    Speech Recognition with Candidate Codecs for Speech Enabled Mobile Services.”
    <i>IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP
    2005)</i>, vol. 1, 2005, pp. 333–36, doi:<a href="https://doi.org/10.1109/ICASSP.2005.1415118">10.1109/ICASSP.2005.1415118</a>.
  short: 'V. Ion, R. Haeb-Umbach, in: IEEE International Conference on Acoustics,
    Speech and Signal Processing (ICASSP 2005), 2005, pp. 333–336.'
date_created: 2019-07-12T05:29:02Z
date_updated: 2022-01-06T06:51:10Z
department:
- _id: '54'
doi: 10.1109/ICASSP.2005.1415118
intvolume: '         1'
keyword:
- adaptive codes
- adaptive multirate codec
- AMR
- distributed speech recognition
- ETSI
- extended advanced front-end
- recognition performance
- SFDSR
- simulated GSM transmission
- soft-feature distributed speech recognition
- speech codecs
- speech coding
- speech recognition
- variable rate codes
- XAFE
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2005/IoHa05-2.pdf
oa: '1'
page: 333-336
publication: IEEE International Conference on Acoustics, Speech and Signal Processing
  (ICASSP 2005)
status: public
title: A Comparison of Soft-Feature Distributed Speech Recognition with Candidate
  Codecs for Speech Enabled Mobile Services
type: conference
user_id: '44006'
volume: 1
year: '2005'
...
