---
_id: '11917'
abstract:
- lang: eng
text: In this paper we present a speech presence probability (SPP) estimation algorithmwhich
exploits both temporal and spectral correlations of speech. To this end, the SPP
estimation is formulated as the posterior probability estimation of the states
of a two-dimensional (2D) Hidden Markov Model (HMM). We derive an iterative algorithm
to decode the 2D-HMM which is based on the turbo principle. The experimental results
show that indeed the SPP estimates improve from iteration to iteration, and further
clearly outperform another state-of-the-art SPP estimation algorithm.
author:
- first_name: Dang Hai Tran
full_name: Vu, Dang Hai Tran
last_name: Vu
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: 'Vu DHT, Haeb-Umbach R. Using the turbo principle for exploiting temporal and
spectral correlations in speech presence probability estimation. In: 38th International
Conference on Acoustics, Speech and Signal Processing (ICASSP 2013). ; 2013:863-867.
doi:10.1109/ICASSP.2013.6637771'
apa: Vu, D. H. T., & Haeb-Umbach, R. (2013). Using the turbo principle for exploiting
temporal and spectral correlations in speech presence probability estimation.
In 38th International Conference on Acoustics, Speech and Signal Processing
(ICASSP 2013) (pp. 863–867). https://doi.org/10.1109/ICASSP.2013.6637771
bibtex: '@inproceedings{Vu_Haeb-Umbach_2013, title={Using the turbo principle for
exploiting temporal and spectral correlations in speech presence probability estimation},
DOI={10.1109/ICASSP.2013.6637771},
booktitle={38th International Conference on Acoustics, Speech and Signal Processing
(ICASSP 2013)}, author={Vu, Dang Hai Tran and Haeb-Umbach, Reinhold}, year={2013},
pages={863–867} }'
chicago: Vu, Dang Hai Tran, and Reinhold Haeb-Umbach. “Using the Turbo Principle
for Exploiting Temporal and Spectral Correlations in Speech Presence Probability
Estimation.” In 38th International Conference on Acoustics, Speech and Signal
Processing (ICASSP 2013), 863–67, 2013. https://doi.org/10.1109/ICASSP.2013.6637771.
ieee: D. H. T. Vu and R. Haeb-Umbach, “Using the turbo principle for exploiting
temporal and spectral correlations in speech presence probability estimation,”
in 38th International Conference on Acoustics, Speech and Signal Processing
(ICASSP 2013), 2013, pp. 863–867.
mla: Vu, Dang Hai Tran, and Reinhold Haeb-Umbach. “Using the Turbo Principle for
Exploiting Temporal and Spectral Correlations in Speech Presence Probability Estimation.”
38th International Conference on Acoustics, Speech and Signal Processing (ICASSP
2013), 2013, pp. 863–67, doi:10.1109/ICASSP.2013.6637771.
short: 'D.H.T. Vu, R. Haeb-Umbach, in: 38th International Conference on Acoustics,
Speech and Signal Processing (ICASSP 2013), 2013, pp. 863–867.'
date_created: 2019-07-12T05:30:45Z
date_updated: 2022-01-06T06:51:12Z
department:
- _id: '54'
doi: 10.1109/ICASSP.2013.6637771
keyword:
- correlation methods
- estimation theory
- hidden Markov models
- iterative methods
- probability
- spectral analysis
- speech processing
- 2D HMM
- SPP estimates
- iterative algorithm
- posterior probability estimation
- spectral correlation
- speech presence probability estimation
- state-of-the-art SPP estimation algorithm
- temporal correlation
- turbo principle
- two-dimensional hidden Markov model
- Correlation
- Decoding
- Estimation
- Iterative decoding
- Noise
- Speech
- Vectors
language:
- iso: eng
page: 863-867
publication: 38th International Conference on Acoustics, Speech and Signal Processing
(ICASSP 2013)
publication_identifier:
issn:
- 1520-6149
status: public
title: Using the turbo principle for exploiting temporal and spectral correlations
in speech presence probability estimation
type: conference
user_id: '44006'
year: '2013'
...
---
_id: '11937'
abstract:
- lang: eng
text: In automatic speech recognition, hidden Markov models (HMMs) are commonly
used for speech decoding, while switching linear dynamic models (SLDMs) can be
employed for a preceding model-based speech feature enhancement. In this paper,
these model types are combined in order to obtain a novel iterative speech feature
enhancement and recognition architecture. It is shown that speech feature enhancement
with SLDMs can be improved by feeding back information from the HMM to the enhancement
stage. Two different feedback structures are derived. In the first, the posteriors
of the HMM states are used to control the model probabilities of the SLDMs, while
in the second they are employed to directly influence the estimate of the speech
feature distribution. Both approaches lead to improvements in recognition accuracy
both on the AURORA2 and AURORA4 databases compared to non-iterative speech feature
enhancement with SLDMs. It is also shown that a combination with uncertainty decoding
further enhances performance.
author:
- first_name: Stefan
full_name: Windmann, Stefan
last_name: Windmann
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: Windmann S, Haeb-Umbach R. Approaches to Iterative Speech Feature Enhancement
and Recognition. IEEE Transactions on Audio, Speech, and Language Processing.
2009;17(5):974-984. doi:10.1109/TASL.2009.2014894
apa: Windmann, S., & Haeb-Umbach, R. (2009). Approaches to Iterative Speech
Feature Enhancement and Recognition. IEEE Transactions on Audio, Speech, and
Language Processing, 17(5), 974–984. https://doi.org/10.1109/TASL.2009.2014894
bibtex: '@article{Windmann_Haeb-Umbach_2009, title={Approaches to Iterative Speech
Feature Enhancement and Recognition}, volume={17}, DOI={10.1109/TASL.2009.2014894},
number={5}, journal={IEEE Transactions on Audio, Speech, and Language Processing},
author={Windmann, Stefan and Haeb-Umbach, Reinhold}, year={2009}, pages={974–984}
}'
chicago: 'Windmann, Stefan, and Reinhold Haeb-Umbach. “Approaches to Iterative Speech
Feature Enhancement and Recognition.” IEEE Transactions on Audio, Speech, and
Language Processing 17, no. 5 (2009): 974–84. https://doi.org/10.1109/TASL.2009.2014894.'
ieee: S. Windmann and R. Haeb-Umbach, “Approaches to Iterative Speech Feature Enhancement
and Recognition,” IEEE Transactions on Audio, Speech, and Language Processing,
vol. 17, no. 5, pp. 974–984, 2009.
mla: Windmann, Stefan, and Reinhold Haeb-Umbach. “Approaches to Iterative Speech
Feature Enhancement and Recognition.” IEEE Transactions on Audio, Speech, and
Language Processing, vol. 17, no. 5, 2009, pp. 974–84, doi:10.1109/TASL.2009.2014894.
short: S. Windmann, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language
Processing 17 (2009) 974–984.
date_created: 2019-07-12T05:31:08Z
date_updated: 2022-01-06T06:51:12Z
department:
- _id: '54'
doi: 10.1109/TASL.2009.2014894
intvolume: ' 17'
issue: '5'
keyword:
- AURORA2 databases
- AURORA4 databases
- automatic speech recognition
- feedback structures
- hidden Markov models
- HMM
- iterative methods
- iterative speech feature enhancement
- model probabilities
- speech decoding
- speech enhancement
- speech feature distribution
- speech recognition
- switching linear dynamic models
language:
- iso: eng
main_file_link:
- open_access: '1'
url: https://groups.uni-paderborn.de/nt/pubs/2009/WiHa09-1.pdf
oa: '1'
page: 974-984
publication: IEEE Transactions on Audio, Speech, and Language Processing
status: public
title: Approaches to Iterative Speech Feature Enhancement and Recognition
type: journal_article
user_id: '44006'
volume: 17
year: '2009'
...