A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training

Welling, L.; Haeb-Umbach, Reinhold; Aubert, X.; Haberland, N.

A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training

L. Welling, R. Haeb-Umbach, X. Aubert, N. Haberland, in: ICASSP 1998, Seattle, 1998.

Download (ext.)

https://groups.uni-paderborn.de/nt/pubs/1998/ICASSP_1998_Haeb_paper.pdf

Conference Paper | English

Author

Welling, L.; Haeb-Umbach, Reinhold^LibreCat; Aubert, X.; Haberland, N.

Department

Nachrichtentechnik (NT) / Heinz Nixdorf Institut

Abstract

Although speaker normalization is attempted in very different manners, vocal tract normalization (VTN) and speaker adaptive training (SAT) share many common properties. We show that both lead to more compact representations of the phonetically relevant variations of the training data and that both achieve improved error rate performance only if a complementary normalization or adaptation operation is conducted on the test data. Algorithms for fast test speaker enrollment are presented for both normalization methods: in the framework of SAT, a pre-transformation step is proposed, which alone, i.e. without subsequent unsupervised MLLR adaption, reduces the error rate by almost 10% on the WSJ 5k test sets. For VTN, the use of a Gaussian mixture model makes obsolete a first recognition pass to obtain a preliminary transcription of the test utterance at hardly and loss in performance.

Publishing Year

1998

Proceedings Title

ICASSP 1998, Seattle

LibreCat-ID

11936

Cite this

Welling L, Haeb-Umbach R, Aubert X, Haberland N. A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training. In: ICASSP 1998, Seattle. ; 1998.

Welling, L., Haeb-Umbach, R., Aubert, X., & Haberland, N. (1998). A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training. In ICASSP 1998, Seattle.

@inproceedings{Welling_Haeb-Umbach_Aubert_Haberland_1998, title={A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training}, booktitle={ICASSP 1998, Seattle}, author={Welling, L. and Haeb-Umbach, Reinhold and Aubert, X. and Haberland, N.}, year={1998} }

Welling, L., Reinhold Haeb-Umbach, X. Aubert, and N. Haberland. “A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training.” In ICASSP 1998, Seattle, 1998.

L. Welling, R. Haeb-Umbach, X. Aubert, and N. Haberland, “A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training,” in ICASSP 1998, Seattle, 1998.

Welling, L., et al. “A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training.” ICASSP 1998, Seattle, 1998.

All files available under the following license(s):

Copyright Statement:

This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)

URL

https://groups.uni-paderborn.de/nt/pubs/1998/ICASSP_1998_Haeb_paper.pdf

Access Level

Closed Access

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar