An Investigation of Cepstral Parameterisations for Large Vocabulary Speech Recognition

R. Haeb-Umbach, M. Loog, in: Eurospeech, 1999.

Conference Paper | English
Author
Abstract
We examined variants of MFCC and PLP cepstral parameterisations in the context of large vocabulary continuous speech recognition under different acous-tical environmental conditions: Compared to MFCC, mel-frequency PLP uses a cubic root intensity-to-loudness law, and an LPC analysis is applied to the mel-warped spectrum. In LPC-smoothed MFCC, the only difference to MFCC is the additional LPC smoothing of the warped spectrum. While neither technique was able to significantly outperform the MFCC parameterisation in our setup which includes an LDA feature transformation, feature set combination via DMC at the acoustic likelihood level and via ROVER at the recognized word level delivered small but consistent improvements.
Publishing Year
Proceedings Title
Eurospeech
LibreCat-ID

Cite this

Haeb-Umbach R, Loog M. An Investigation of Cepstral Parameterisations for Large Vocabulary Speech Recognition. In: Eurospeech. ; 1999.
Haeb-Umbach, R., & Loog, M. (1999). An Investigation of Cepstral Parameterisations for Large Vocabulary Speech Recognition. In Eurospeech.
@inproceedings{Haeb-Umbach_Loog_1999, title={An Investigation of Cepstral Parameterisations for Large Vocabulary Speech Recognition}, booktitle={Eurospeech}, author={Haeb-Umbach, Reinhold and Loog, Marco}, year={1999} }
Haeb-Umbach, Reinhold, and Marco Loog. “An Investigation of Cepstral Parameterisations for Large Vocabulary Speech Recognition.” In Eurospeech, 1999.
R. Haeb-Umbach and M. Loog, “An Investigation of Cepstral Parameterisations for Large Vocabulary Speech Recognition,” in Eurospeech, 1999.
Haeb-Umbach, Reinhold, and Marco Loog. “An Investigation of Cepstral Parameterisations for Large Vocabulary Speech Recognition.” Eurospeech, 1999.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)
Access Level
Restricted Closed Access

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar