Continuous speech dictation - From theory to practice

V. Steinbiss, H.J. Ney, U. Essen, B.H. Tran, X.L. Aubert, C. Dugast, R. Kneser, H.G. Meier, M. Oerder, R. Haeb-Umbach, D. Geller, W. Hoellerbauer, H. Bartosik, Speech Communication (1995).

No fulltext has been uploaded.
Journal Article | English
Steinbiss, Volker; Ney, Hermann J.; Essen, Ute; Tran, Bach Hiep; Aubert, Xavier L.; Dugast, Christian; Kneser, Reinhard; Meier, Hans Günter; Oerder, Martin; Haeb-Umbach, ReinholdLibreCat; Geller, Dieter; Hoellerbauer, W.
This paper gives an overview of the Philips research system for phoneme-based, large-vocabulary, continuousspeech recognition. The system has been successfully applied to various tasks in the German and (American) English languages, ranging from small vocabulary tasks to very large vocabulary tasks. Here, we concentrate on continuousspeech recognition for dictation in real applications, the dictation of legal reports and radiology reports in German. We describe this task and report on experimental results. We also describe a commercial PC-based dictation system which includes a PC implementation of our scientific recognition prototype. In order to allow for a comparison with the performance of other systems, a section with an evaluation on the standard Wall Street Journal task (dictation of American English newspaper text) is supplied. The recognition architecture is based on an integrated statistical approach. We describe the characteristic features of the system as opposed to other systems: 1. the Viterbi criterion is consistently applied both in training and testing; 2. continuous mixture densities are used without tying or smoothing; 3. time-synchronous beam search in connection with a phoneme look-ahead is applied to a tree-organized lexicon.
Publishing Year
Journal Title
Speech Communication

