Acoustic Modeling in the Philips Hub-4 Continuous-Speech Recognition System

Haeb-Umbach, Reinhold; Aubert, Xavier L.; Beyerlein, Peter; Klakow, Dietrich; Ullrich, Meinhard; Wendemuth, Andreas; Wilcox, Patricia

Acoustic Modeling in the Philips Hub-4 Continuous-Speech Recognition System

R. Haeb-Umbach, X.L. Aubert, P. Beyerlein, D. Klakow, M. Ullrich, A. Wendemuth, P. Wilcox, in: DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, 1998.

Download (ext.)

https://groups.uni-paderborn.de/nt/pubs/1998/Workshop_Landsdowne_1998_Haeb2_pape[...]

Conference Paper | English

Author

Haeb-Umbach, Reinhold^LibreCat; Aubert, Xavier L.; Beyerlein, Peter; Klakow, Dietrich; Ullrich, Meinhard; Wendemuth, Andreas; Wilcox, Patricia

Department

Nachrichtentechnik (NT) / Heinz Nixdorf Institut

Abstract

In this paper we describe some characteristics of the acoustic modeling used in the Philips continuous-speech recognition system for the DARPA Hub-4 1997 evaluation, which are related to robustness issues. We aimed at a conceptually simple system: We trained two model sets on 70 hours of the Hub-4 training data, one for within-word and one for cross-word decoding. These model sets were used for both genders and all environmental conditions. In order to be able to do so, channel normalization (mean, variance normalization) and speaker normalization (vocal tract length normalization, realized by an appropriate shift of the center frequencies of the mel filter bank) have been applied, as well as adaptation techniques. MLLR-based unsupervised batch adaptation on clusters of segments was conducted both after a first within-word decoding and a cross-word decoding pass. The training strategy and the effects of the various normalization and adaptation techniques will be discussed in the paper.

Publishing Year

1998

Proceedings Title

DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne

LibreCat-ID

11784

Cite this

Haeb-Umbach R, Aubert XL, Beyerlein P, et al. Acoustic Modeling in the Philips Hub-4 Continuous-Speech Recognition System. In: DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne. ; 1998.

Haeb-Umbach, R., Aubert, X. L., Beyerlein, P., Klakow, D., Ullrich, M., Wendemuth, A., & Wilcox, P. (1998). Acoustic Modeling in the Philips Hub-4 Continuous-Speech Recognition System. In DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne.

@inproceedings{Haeb-Umbach_Aubert_Beyerlein_Klakow_Ullrich_Wendemuth_Wilcox_1998, title={Acoustic Modeling in the Philips Hub-4 Continuous-Speech Recognition System}, booktitle={DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne}, author={Haeb-Umbach, Reinhold and Aubert, Xavier L. and Beyerlein, Peter and Klakow, Dietrich and Ullrich, Meinhard and Wendemuth, Andreas and Wilcox, Patricia}, year={1998} }

Haeb-Umbach, Reinhold, Xavier L. Aubert, Peter Beyerlein, Dietrich Klakow, Meinhard Ullrich, Andreas Wendemuth, and Patricia Wilcox. “Acoustic Modeling in the Philips Hub-4 Continuous-Speech Recognition System.” In DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, 1998.

R. Haeb-Umbach et al., “Acoustic Modeling in the Philips Hub-4 Continuous-Speech Recognition System,” in DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, 1998.

Haeb-Umbach, Reinhold, et al. “Acoustic Modeling in the Philips Hub-4 Continuous-Speech Recognition System.” DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, 1998.

All files available under the following license(s):

Copyright Statement:

This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)

URL

https://groups.uni-paderborn.de/nt/pubs/1998/Workshop_Landsdowne_1998_Haeb2_paper.pdf

Access Level

Closed Access

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar