A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training

L. Welling, R. Haeb-Umbach, X. Aubert, N. Haberland, in: ICASSP 1998, Seattle, 1998.

Conference Paper | English
Author
; ; ;
Abstract
Although speaker normalization is attempted in very different manners, vocal tract normalization (VTN) and speaker adaptive training (SAT) share many common properties. We show that both lead to more compact representations of the phonetically relevant variations of the training data and that both achieve improved error rate performance only if a complementary normalization or adaptation operation is conducted on the test data. Algorithms for fast test speaker enrollment are presented for both normalization methods: in the framework of SAT, a pre-transformation step is proposed, which alone, i.e. without subsequent unsupervised MLLR adaption, reduces the error rate by almost 10% on the WSJ 5k test sets. For VTN, the use of a Gaussian mixture model makes obsolete a first recognition pass to obtain a preliminary transcription of the test utterance at hardly and loss in performance.
Publishing Year
Proceedings Title
ICASSP 1998, Seattle
LibreCat-ID

Cite this

Welling L, Haeb-Umbach R, Aubert X, Haberland N. A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training. In: ICASSP 1998, Seattle. ; 1998.
Welling, L., Haeb-Umbach, R., Aubert, X., & Haberland, N. (1998). A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training. In ICASSP 1998, Seattle.
@inproceedings{Welling_Haeb-Umbach_Aubert_Haberland_1998, title={A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training}, booktitle={ICASSP 1998, Seattle}, author={Welling, L. and Haeb-Umbach, Reinhold and Aubert, X. and Haberland, N.}, year={1998} }
Welling, L., Reinhold Haeb-Umbach, X. Aubert, and N. Haberland. “A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training.” In ICASSP 1998, Seattle, 1998.
L. Welling, R. Haeb-Umbach, X. Aubert, and N. Haberland, “A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training,” in ICASSP 1998, Seattle, 1998.
Welling, L., et al. “A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training.” ICASSP 1998, Seattle, 1998.

Link(s) to Main File(s)
Access Level
Restricted Closed Access

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar