---
_id: '11861'
abstract:
- lang: eng
text: 'In this contribution we present a theoretical and experimental investigation
into the effects of reverberation and noise on features in the logarithmic mel
power spectral domain, an intermediate stage in the computation of the mel frequency
cepstral coefficients, prevalent in automatic speech recognition (ASR). Gaining
insight into the complex interaction between clean speech, noise, and noisy reverberant
speech features is essential for any ASR system to be robust against noise and
reverberation present in distant microphone input signals. The findings are gathered
in a probabilistic formulation of an observation model which may be used in model-based
feature compensation schemes. The proposed observation model extends previous
models in three major directions: First, the contribution of additive background
noise to the observation error is explicitly taken into account. Second, an energy
compensation constant is introduced which ensures an unbiased estimate of the
reverberant speech features, and, third, a recursive variant of the observation
model is developed resulting in reduced computational complexity when used in
model-based feature compensation. The experimental section is used to evaluate
the accuracy of the model and to describe how its parameters can be determined
from test data.'
author:
- first_name: Volker
full_name: Leutnant, Volker
last_name: Leutnant
- first_name: Alexander
full_name: Krueger, Alexander
last_name: Krueger
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: Leutnant V, Krueger A, Haeb-Umbach R. A New Observation Model in the Logarithmic
Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech.
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2014;22(1):95-109.
doi:10.1109/TASLP.2013.2285480
apa: Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2014). A New Observation
Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition
of Noisy Reverberant Speech. IEEE/ACM Transactions on Audio, Speech, and Language
Processing, 22(1), 95–109. https://doi.org/10.1109/TASLP.2013.2285480
bibtex: '@article{Leutnant_Krueger_Haeb-Umbach_2014, title={A New Observation Model
in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of
Noisy Reverberant Speech}, volume={22}, DOI={10.1109/TASLP.2013.2285480},
number={1}, journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
author={Leutnant, Volker and Krueger, Alexander and Haeb-Umbach, Reinhold}, year={2014},
pages={95–109} }'
chicago: 'Leutnant, Volker, Alexander Krueger, and Reinhold Haeb-Umbach. “A New
Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic
Recognition of Noisy Reverberant Speech.” IEEE/ACM Transactions on Audio, Speech,
and Language Processing 22, no. 1 (2014): 95–109. https://doi.org/10.1109/TASLP.2013.2285480.'
ieee: V. Leutnant, A. Krueger, and R. Haeb-Umbach, “A New Observation Model in the
Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant
Speech,” IEEE/ACM Transactions on Audio, Speech, and Language Processing,
vol. 22, no. 1, pp. 95–109, 2014.
mla: Leutnant, Volker, et al. “A New Observation Model in the Logarithmic Mel Power
Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech.” IEEE/ACM
Transactions on Audio, Speech, and Language Processing, vol. 22, no. 1, 2014,
pp. 95–109, doi:10.1109/TASLP.2013.2285480.
short: V. Leutnant, A. Krueger, R. Haeb-Umbach, IEEE/ACM Transactions on Audio,
Speech, and Language Processing 22 (2014) 95–109.
date_created: 2019-07-12T05:29:41Z
date_updated: 2022-01-06T06:51:11Z
department:
- _id: '54'
doi: 10.1109/TASLP.2013.2285480
intvolume: ' 22'
issue: '1'
keyword:
- computational complexity
- reverberation
- speech recognition
- automatic speech recognition
- background noise
- clean speech
- computational complexity
- energy compensation
- logarithmic mel power spectral domain
- mel frequency cepstral coefficients
- microphone input signals
- model-based feature compensation schemes
- noisy reverberant speech automatic recognition
- noisy reverberant speech features
- reverberation
- Atmospheric modeling
- Computational modeling
- Noise
- Noise measurement
- Reverberation
- Speech
- Vectors
- Model-based feature compensation
- observation model for reverberant and noisy speech
- recursive observation model
- robust automatic speech recognition
language:
- iso: eng
page: 95-109
publication: IEEE/ACM Transactions on Audio, Speech, and Language Processing
publication_identifier:
issn:
- 2329-9290
status: public
title: A New Observation Model in the Logarithmic Mel Power Spectral Domain for the
Automatic Recognition of Noisy Reverberant Speech
type: journal_article
user_id: '44006'
volume: 22
year: '2014'
...