---
_id: '11842'
abstract:
- lang: eng
  text: 'In this paper we present some experiments that have been performed while
    developing language models for the PHILIPS Broadcast News system. Three main issues
    will be discussed: construction of phrases, adaptation of remote corpora to this
    task, and the combination of the different models. Also, perplexities on the 1997
    evaluation data are reported.'
author:
- first_name: Dietrich
  full_name: Klakow, Dietrich
  last_name: Klakow
- first_name: Xavier L.
  full_name: Aubert, Xavier L.
  last_name: Aubert
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: Peter
  full_name: Beyerlein, Peter
  last_name: Beyerlein
- first_name: Meinhard
  full_name: Ullrich, Meinhard
  last_name: Ullrich
- first_name: Andreas
  full_name: Wendemuth, Andreas
  last_name: Wendemuth
- first_name: Patricia
  full_name: Wilcox, Patricia
  last_name: Wilcox
citation:
  ama: 'Klakow D, Aubert XL, Haeb-Umbach R, et al. Language-Model Investigations related
    to Broadcast News. In: <i>DARPA Broadcast News Transcription and Understanding
    Workshop, Landsdowne</i>. ; 1998.'
  apa: Klakow, D., Aubert, X. L., Haeb-Umbach, R., Beyerlein, P., Ullrich, M., Wendemuth,
    A., &#38; Wilcox, P. (1998). Language-Model Investigations related to Broadcast
    News. In <i>DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne</i>.
  bibtex: '@inproceedings{Klakow_Aubert_Haeb-Umbach_Beyerlein_Ullrich_Wendemuth_Wilcox_1998,
    title={Language-Model Investigations related to Broadcast News}, booktitle={DARPA
    Broadcast News Transcription and Understanding Workshop, Landsdowne}, author={Klakow,
    Dietrich and Aubert, Xavier L. and Haeb-Umbach, Reinhold and Beyerlein, Peter
    and Ullrich, Meinhard and Wendemuth, Andreas and Wilcox, Patricia}, year={1998}
    }'
  chicago: Klakow, Dietrich, Xavier L. Aubert, Reinhold Haeb-Umbach, Peter Beyerlein,
    Meinhard Ullrich, Andreas Wendemuth, and Patricia Wilcox. “Language-Model Investigations
    Related to Broadcast News.” In <i>DARPA Broadcast News Transcription and Understanding
    Workshop, Landsdowne</i>, 1998.
  ieee: D. Klakow <i>et al.</i>, “Language-Model Investigations related to Broadcast
    News,” in <i>DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne</i>,
    1998.
  mla: Klakow, Dietrich, et al. “Language-Model Investigations Related to Broadcast
    News.” <i>DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne</i>,
    1998.
  short: 'D. Klakow, X.L. Aubert, R. Haeb-Umbach, P. Beyerlein, M. Ullrich, A. Wendemuth,
    P. Wilcox, in: DARPA Broadcast News Transcription and Understanding Workshop,
    Landsdowne, 1998.'
date_created: 2019-07-12T05:29:19Z
date_updated: 2022-01-06T06:51:11Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/1998/Workshop_Lansdowne_1998_Haeb1_paper.pdf
oa: '1'
publication: DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne
status: public
title: Language-Model Investigations related to Broadcast News
type: conference
user_id: '44006'
year: '1998'
...
---
_id: '11936'
abstract:
- lang: eng
  text: 'Although speaker normalization is attempted in very different manners, vocal
    tract normalization (VTN) and speaker adaptive training (SAT) share many common
    properties. We show that both lead to more compact representations of the phonetically
    relevant variations of the training data and that both achieve improved error
    rate performance only if a complementary normalization or adaptation operation
    is conducted on the test data. Algorithms for fast test speaker enrollment are
    presented for both normalization methods: in the framework of SAT, a pre-transformation
    step is proposed, which alone, i.e. without subsequent unsupervised MLLR adaption,
    reduces the error rate by almost 10% on the WSJ 5k test sets. For VTN, the use
    of a Gaussian mixture model makes obsolete a first recognition pass to obtain
    a preliminary transcription of the test utterance at hardly and loss in performance.'
author:
- first_name: L.
  full_name: Welling, L.
  last_name: Welling
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: X.
  full_name: Aubert, X.
  last_name: Aubert
- first_name: N.
  full_name: Haberland, N.
  last_name: Haberland
citation:
  ama: 'Welling L, Haeb-Umbach R, Aubert X, Haberland N. A Study on Speaker Normalization
    Using Vocal Tract Normalization and Speaker Adaptive Training. In: <i>ICASSP 1998,
    Seattle</i>. ; 1998.'
  apa: Welling, L., Haeb-Umbach, R., Aubert, X., &#38; Haberland, N. (1998). A Study
    on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive
    Training. In <i>ICASSP 1998, Seattle</i>.
  bibtex: '@inproceedings{Welling_Haeb-Umbach_Aubert_Haberland_1998, title={A Study
    on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive
    Training}, booktitle={ICASSP 1998, Seattle}, author={Welling, L. and Haeb-Umbach,
    Reinhold and Aubert, X. and Haberland, N.}, year={1998} }'
  chicago: Welling, L., Reinhold Haeb-Umbach, X. Aubert, and N. Haberland. “A Study
    on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive
    Training.” In <i>ICASSP 1998, Seattle</i>, 1998.
  ieee: L. Welling, R. Haeb-Umbach, X. Aubert, and N. Haberland, “A Study on Speaker
    Normalization Using Vocal Tract Normalization and Speaker Adaptive Training,”
    in <i>ICASSP 1998, Seattle</i>, 1998.
  mla: Welling, L., et al. “A Study on Speaker Normalization Using Vocal Tract Normalization
    and Speaker Adaptive Training.” <i>ICASSP 1998, Seattle</i>, 1998.
  short: 'L. Welling, R. Haeb-Umbach, X. Aubert, N. Haberland, in: ICASSP 1998, Seattle,
    1998.'
date_created: 2019-07-12T05:31:07Z
date_updated: 2022-01-06T06:51:12Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/1998/ICASSP_1998_Haeb_paper.pdf
oa: '1'
publication: ICASSP 1998, Seattle
status: public
title: A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker
  Adaptive Training
type: conference
user_id: '44006'
year: '1998'
...
---
_id: '11750'
abstract:
- lang: eng
  text: Addresses the problem of online, writer-independent, unconstrained handwriting
    recognition. Based on hidden Markov models (HMM), which are successfully employed
    in speech recognition tasks, we focus on representations which address scalability,
    recognition performance and compactness. 'Delayed' features are introduced which
    integrate more global, handwriting specific knowledge into the HMM representation.
    These features lead to larger error-rate reduction than 'delta' features which
    are known from speech recognition and even require fewer additional components.
    Scalability is addressed with a size-independent representation. Compactness is
    achieved with linear discriminant analysis. The representations are discussed
    and the results for a mixed-style word recognition task with vocabularies of 200
    (up to 99% correct words) and 20000 words (up to 88.8% correct words) are given.
author:
- first_name: J.G.A.
  full_name: Dolfing, J.G.A.
  last_name: Dolfing
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Dolfing JGA, Haeb-Umbach R. Signal Representations for Hidden Markov Model
    Based On-Line Handwriting Recognition. In: <i>ICASSP, Munich</i>. ; 1997.'
  apa: Dolfing, J. G. A., &#38; Haeb-Umbach, R. (1997). Signal Representations for
    Hidden Markov Model Based On-Line Handwriting Recognition. In <i>ICASSP, Munich</i>.
  bibtex: '@inproceedings{Dolfing_Haeb-Umbach_1997, title={Signal Representations
    for Hidden Markov Model Based On-Line Handwriting Recognition}, booktitle={ICASSP,
    Munich}, author={Dolfing, J.G.A. and Haeb-Umbach, Reinhold}, year={1997} }'
  chicago: Dolfing, J.G.A., and Reinhold Haeb-Umbach. “Signal Representations for
    Hidden Markov Model Based On-Line Handwriting Recognition.” In <i>ICASSP, Munich</i>,
    1997.
  ieee: J. G. A. Dolfing and R. Haeb-Umbach, “Signal Representations for Hidden Markov
    Model Based On-Line Handwriting Recognition,” in <i>ICASSP, Munich</i>, 1997.
  mla: Dolfing, J. G. A., and Reinhold Haeb-Umbach. “Signal Representations for Hidden
    Markov Model Based On-Line Handwriting Recognition.” <i>ICASSP, Munich</i>, 1997.
  short: 'J.G.A. Dolfing, R. Haeb-Umbach, in: ICASSP, Munich, 1997.'
date_created: 2019-07-12T05:27:32Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/1997/ICASSP_1997_Haeb1_paper.pdf
oa: '1'
publication: ICASSP, Munich
status: public
title: Signal Representations for Hidden Markov Model Based On-Line Handwriting Recognition
type: conference
user_id: '44006'
year: '1997'
...
---
_id: '11766'
abstract:
- lang: eng
  text: This paper reports the design of a command-based speech interface for an answering
    machine or a voice mail system. Automatic speech recognition was integrated in
    order to facilitate the remote control and the retrieval of voice messages from
    any telephone in a speech-only dialogue. The design goal was that consumers would
    perceive the speech interface as a benefit compared with the common touch-tone
    interface. In this paper we will first describe the speech technology underlying
    the system. Then it will be shown how, based on this technology, the user interface
    was designed in a top-down approach. We started with the development of a concept
    and tested it by means of a Wizard-of-Oz simulation. After refining the concept
    in parallel design, it was implemented in a high-fidelity prototype. By means
    of qualitative user testing the design was improved in three iteration steps.
    The achievement of the design goal was finally verified with user tests in two
    countries.
author:
- first_name: Stephan
  full_name: Gamm, Stephan
  last_name: Gamm
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: Detlev
  full_name: Langmann, Detlev
  last_name: Langmann
citation:
  ama: Gamm S, Haeb-Umbach R, Langmann D. The development of a command-based speech
    interface for a telephone answering machine. <i>Speech Communication</i>. 1997.
  apa: Gamm, S., Haeb-Umbach, R., &#38; Langmann, D. (1997). The development of a
    command-based speech interface for a telephone answering machine. <i>Speech Communication</i>.
  bibtex: '@article{Gamm_Haeb-Umbach_Langmann_1997, title={The development of a command-based
    speech interface for a telephone answering machine}, journal={Speech Communication},
    author={Gamm, Stephan and Haeb-Umbach, Reinhold and Langmann, Detlev}, year={1997}
    }'
  chicago: Gamm, Stephan, Reinhold Haeb-Umbach, and Detlev Langmann. “The Development
    of a Command-Based Speech Interface for a Telephone Answering Machine.” <i>Speech
    Communication</i>, 1997.
  ieee: S. Gamm, R. Haeb-Umbach, and D. Langmann, “The development of a command-based
    speech interface for a telephone answering machine,” <i>Speech Communication</i>,
    1997.
  mla: Gamm, Stephan, et al. “The Development of a Command-Based Speech Interface
    for a Telephone Answering Machine.” <i>Speech Communication</i>, 1997.
  short: S. Gamm, R. Haeb-Umbach, D. Langmann, Speech Communication (1997).
date_created: 2019-07-12T05:27:50Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
language:
- iso: eng
publication: Speech Communication
status: public
title: The development of a command-based speech interface for a telephone answering
  machine
type: journal_article
user_id: '44006'
year: '1997'
...
---
_id: '11781'
abstract:
- lang: eng
  text: 'The increased popularity of mobile telephony introduces both challenges and
    opportunitites for automatic speech recognition. ASR offers ways to simplify the
    use of mobile phones, notably in hands- and eyes-busy situations. However, the
    acoustic environment can be severely degraded and the wireless network may add
    additional distortions to the speech signal. This paper gives an overview of the
    sources of degradation and attempts to robust speech recognition for mobile communications.
    Emphasis is placed on approaches which are suitable for implementation in mobile
    terminals. Two example applications are described which illustrate the robustness
    issues and design considerations typical of low-cost noisy speech recognition:
    voice-dialling in a GSM phone and hands-free digit recognition in the car.'
author:
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Haeb-Umbach R. Robust Speech Recognition for Wireless Networks and Mobile
    Telephony. In: <i>Eurospeech</i>. ; 1997.'
  apa: Haeb-Umbach, R. (1997). Robust Speech Recognition for Wireless Networks and
    Mobile Telephony. In <i>Eurospeech</i>.
  bibtex: '@inproceedings{Haeb-Umbach_1997, title={Robust Speech Recognition for Wireless
    Networks and Mobile Telephony}, booktitle={Eurospeech}, author={Haeb-Umbach, Reinhold},
    year={1997} }'
  chicago: Haeb-Umbach, Reinhold. “Robust Speech Recognition for Wireless Networks
    and Mobile Telephony.” In <i>Eurospeech</i>, 1997.
  ieee: R. Haeb-Umbach, “Robust Speech Recognition for Wireless Networks and Mobile
    Telephony,” in <i>Eurospeech</i>, 1997.
  mla: Haeb-Umbach, Reinhold. “Robust Speech Recognition for Wireless Networks and
    Mobile Telephony.” <i>Eurospeech</i>, 1997.
  short: 'R. Haeb-Umbach, in: Eurospeech, 1997.'
date_created: 2019-07-12T05:28:08Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/1997/Eurospeech_1997_Haeb1_paper.pdf
oa: '1'
publication: Eurospeech
status: public
title: Robust Speech Recognition for Wireless Networks and Mobile Telephony
type: conference
user_id: '44006'
year: '1997'
...
---
_id: '11819'
abstract:
- lang: eng
  text: The SpeechDat project aims to produce speech databases for all official languages
    of the European Union and some major dialectal variants and minority languages
    resulting in 28 speech databases. They will be recorded over fixed and mobile
    telephone networks. This will provide a realistic basis for training and assessment
    of both isolated and continuous-speech utterances, employing whole-word or subword
    approaches, and thus can be used for developing voice driven teleservices including
    speaker verification. The specification of the databases has been developed jointly,
    and is essentially the same for each language to facilitate dissemination and
    use. There will be a controlled variation among the speakers concerning sex, age,
    dialect, environment of call, etc. The validation of all databases will be carried
    out centrally. The SpeechDat databases will be transferred to ELRA for distribution.
    The next databases to be recorded will cover East European languages.
author:
- first_name: H.
  full_name: Hoege, H.
  last_name: Hoege
- first_name: H. S.
  full_name: Tropf, H. S.
  last_name: Tropf
- first_name: R.
  full_name: Winsky, R.
  last_name: Winsky
- first_name: H.
  full_name: van den Heuvel, H.
  last_name: van den Heuvel
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: K.
  full_name: Choukri, K.
  last_name: Choukri
citation:
  ama: 'Hoege H, Tropf HS, Winsky R, van den Heuvel H, Haeb-Umbach R, Choukri K. European
    Speech Databases for Telephone Applications. In: <i>ICASSP, Munich</i>. ; 1997.'
  apa: Hoege, H., Tropf, H. S., Winsky, R., van den Heuvel, H., Haeb-Umbach, R., &#38;
    Choukri, K. (1997). European Speech Databases for Telephone Applications. In <i>ICASSP,
    Munich</i>.
  bibtex: '@inproceedings{Hoege_Tropf_Winsky_van den Heuvel_Haeb-Umbach_Choukri_1997,
    title={European Speech Databases for Telephone Applications}, booktitle={ICASSP,
    Munich}, author={Hoege, H. and Tropf, H. S. and Winsky, R. and van den Heuvel,
    H. and Haeb-Umbach, Reinhold and Choukri, K.}, year={1997} }'
  chicago: Hoege, H., H. S. Tropf, R. Winsky, H. van den Heuvel, Reinhold Haeb-Umbach,
    and K. Choukri. “European Speech Databases for Telephone Applications.” In <i>ICASSP,
    Munich</i>, 1997.
  ieee: H. Hoege, H. S. Tropf, R. Winsky, H. van den Heuvel, R. Haeb-Umbach, and K.
    Choukri, “European Speech Databases for Telephone Applications,” in <i>ICASSP,
    Munich</i>, 1997.
  mla: Hoege, H., et al. “European Speech Databases for Telephone Applications.” <i>ICASSP,
    Munich</i>, 1997.
  short: 'H. Hoege, H.S. Tropf, R. Winsky, H. van den Heuvel, R. Haeb-Umbach, K. Choukri,
    in: ICASSP, Munich, 1997.'
date_created: 2019-07-12T05:28:52Z
date_updated: 2022-01-06T06:51:09Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/1997/ICASSP_1997_Haeb_paper.pdf
oa: '1'
publication: ICASSP, Munich
status: public
title: European Speech Databases for Telephone Applications
type: conference
user_id: '44006'
year: '1997'
...
---
_id: '11852'
abstract:
- lang: eng
  text: This paper describes speaker-independent speech recognition experiments concerning
    acoustic front end processing on a speech database that was recorded in 3 different
    cars. We investigate different feature analysis approaches (mel-filter bank, mel-cepstrum,
    perceptually linear predictive coding) and present results with noise compensation
    techniques based on spectral subtraction. Although the methods employed lead to
    considerable error rate reduction the error analysis shows that low signal-to-noise
    ratios are still a problem
author:
- first_name: Detlev
  full_name: Langmann, Detlev
  last_name: Langmann
- first_name: Alexander
  full_name: Fischer, Alexander
  last_name: Fischer
- first_name: Friedhelm
  full_name: Wuppermann, Friedhelm
  last_name: Wuppermann
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: Thomas
  full_name: Eisele, Thomas
  last_name: Eisele
citation:
  ama: 'Langmann D, Fischer A, Wuppermann F, Haeb-Umbach R, Eisele T. Acoustic Front
    Ends for Speaker-Independent Digit Recognition in Car Environments. In: <i>Eurospeech</i>.
    ; 1997.'
  apa: Langmann, D., Fischer, A., Wuppermann, F., Haeb-Umbach, R., &#38; Eisele, T.
    (1997). Acoustic Front Ends for Speaker-Independent Digit Recognition in Car Environments.
    In <i>Eurospeech</i>.
  bibtex: '@inproceedings{Langmann_Fischer_Wuppermann_Haeb-Umbach_Eisele_1997, title={Acoustic
    Front Ends for Speaker-Independent Digit Recognition in Car Environments}, booktitle={Eurospeech},
    author={Langmann, Detlev and Fischer, Alexander and Wuppermann, Friedhelm and
    Haeb-Umbach, Reinhold and Eisele, Thomas}, year={1997} }'
  chicago: Langmann, Detlev, Alexander Fischer, Friedhelm Wuppermann, Reinhold Haeb-Umbach,
    and Thomas Eisele. “Acoustic Front Ends for Speaker-Independent Digit Recognition
    in Car Environments.” In <i>Eurospeech</i>, 1997.
  ieee: D. Langmann, A. Fischer, F. Wuppermann, R. Haeb-Umbach, and T. Eisele, “Acoustic
    Front Ends for Speaker-Independent Digit Recognition in Car Environments,” in
    <i>Eurospeech</i>, 1997.
  mla: Langmann, Detlev, et al. “Acoustic Front Ends for Speaker-Independent Digit
    Recognition in Car Environments.” <i>Eurospeech</i>, 1997.
  short: 'D. Langmann, A. Fischer, F. Wuppermann, R. Haeb-Umbach, T. Eisele, in: Eurospeech,
    1997.'
date_created: 2019-07-12T05:29:30Z
date_updated: 2022-01-06T06:51:11Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/1997/Eurospeech_1997_Haeb_paper.pdf
oa: '1'
publication: Eurospeech
status: public
title: Acoustic Front Ends for Speaker-Independent Digit Recognition in Car Environments
type: conference
user_id: '44006'
year: '1997'
...
---
_id: '11855'
author:
- first_name: Detlev
  full_name: Langmann, Detlev
  last_name: Langmann
- first_name: Friedhelm
  full_name: Wuppermann, Friedhelm
  last_name: Wuppermann
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: A.
  full_name: Fischer, A.
  last_name: Fischer
- first_name: Thomas
  full_name: Eisele, Thomas
  last_name: Eisele
citation:
  ama: 'Langmann D, Wuppermann F, Haeb-Umbach R, Fischer A, Eisele T. Investigation
    of Acoustic Front Ends for Speaker-Independent Speech Recognition in the Car.
    In: <i>Aachener Kolloquium on Signal Theory</i>. ; 1997.'
  apa: Langmann, D., Wuppermann, F., Haeb-Umbach, R., Fischer, A., &#38; Eisele, T.
    (1997). Investigation of Acoustic Front Ends for Speaker-Independent Speech Recognition
    in the Car. In <i>Aachener Kolloquium on Signal Theory</i>.
  bibtex: '@inproceedings{Langmann_Wuppermann_Haeb-Umbach_Fischer_Eisele_1997, title={Investigation
    of Acoustic Front Ends for Speaker-Independent Speech Recognition in the Car},
    booktitle={Aachener Kolloquium on Signal Theory}, author={Langmann, Detlev and
    Wuppermann, Friedhelm and Haeb-Umbach, Reinhold and Fischer, A. and Eisele, Thomas},
    year={1997} }'
  chicago: Langmann, Detlev, Friedhelm Wuppermann, Reinhold Haeb-Umbach, A. Fischer,
    and Thomas Eisele. “Investigation of Acoustic Front Ends for Speaker-Independent
    Speech Recognition in the Car.” In <i>Aachener Kolloquium on Signal Theory</i>,
    1997.
  ieee: D. Langmann, F. Wuppermann, R. Haeb-Umbach, A. Fischer, and T. Eisele, “Investigation
    of Acoustic Front Ends for Speaker-Independent Speech Recognition in the Car,”
    in <i>Aachener Kolloquium on Signal Theory</i>, 1997.
  mla: Langmann, Detlev, et al. “Investigation of Acoustic Front Ends for Speaker-Independent
    Speech Recognition in the Car.” <i>Aachener Kolloquium on Signal Theory</i>, 1997.
  short: 'D. Langmann, F. Wuppermann, R. Haeb-Umbach, A. Fischer, T. Eisele, in: Aachener
    Kolloquium on Signal Theory, 1997.'
date_created: 2019-07-12T05:29:34Z
date_updated: 2022-01-06T06:51:11Z
department:
- _id: '54'
language:
- iso: eng
publication: Aachener Kolloquium on Signal Theory
status: public
title: Investigation of Acoustic Front Ends for Speaker-Independent Speech Recognition
  in the Car
type: conference
user_id: '44006'
year: '1997'
...
---
_id: '11761'
abstract:
- lang: eng
  text: 'Although widely used, there are still open questions concerning which properties
    of linear discriminant analysis (LDA) account for its success in many speech recognition
    systems. In order to gain more insight into the nature of the transformation we
    compare LDA with mel-cepstral feature vectors with respect to the following criteria:
    decorrelation and ordering property; invariance under linear transforms; automatic
    learning of dynamical features; and data dependence of the transformation.'
author:
- first_name: Thomas
  full_name: Eisele, Thomas
  last_name: Eisele
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: Detlev
  full_name: Langmann, Detlev
  last_name: Langmann
citation:
  ama: 'Eisele T, Haeb-Umbach R, Langmann D. A Comparative Study of Linear Feature
    Transformation Techniques for Automatic Speech Recognition. In: <i>ICSLP , Philadelphia</i>.
    ; 1996.'
  apa: Eisele, T., Haeb-Umbach, R., &#38; Langmann, D. (1996). A Comparative Study
    of Linear Feature Transformation Techniques for Automatic Speech Recognition.
    In <i>ICSLP , Philadelphia</i>.
  bibtex: '@inproceedings{Eisele_Haeb-Umbach_Langmann_1996, title={A Comparative Study
    of Linear Feature Transformation Techniques for Automatic Speech Recognition},
    booktitle={ICSLP , Philadelphia}, author={Eisele, Thomas and Haeb-Umbach, Reinhold
    and Langmann, Detlev}, year={1996} }'
  chicago: Eisele, Thomas, Reinhold Haeb-Umbach, and Detlev Langmann. “A Comparative
    Study of Linear Feature Transformation Techniques for Automatic Speech Recognition.”
    In <i>ICSLP , Philadelphia</i>, 1996.
  ieee: T. Eisele, R. Haeb-Umbach, and D. Langmann, “A Comparative Study of Linear
    Feature Transformation Techniques for Automatic Speech Recognition,” in <i>ICSLP
    , Philadelphia</i>, 1996.
  mla: Eisele, Thomas, et al. “A Comparative Study of Linear Feature Transformation
    Techniques for Automatic Speech Recognition.” <i>ICSLP , Philadelphia</i>, 1996.
  short: 'T. Eisele, R. Haeb-Umbach, D. Langmann, in: ICSLP , Philadelphia, 1996.'
date_created: 2019-07-12T05:27:45Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/1996/ICSLP_1996_Haeb1_paper.pdf
oa: '1'
publication: ICSLP , Philadelphia
status: public
title: A Comparative Study of Linear Feature Transformation Techniques for Automatic
  Speech Recognition
type: conference
user_id: '44006'
year: '1996'
...
---
_id: '11767'
abstract:
- lang: eng
  text: This paper tells the story of the design of a command-based speech interface
    for a voice mail system. Speech recognition was integrated in the voice mail system
    in order to allow the remote interrogation of messages in a speech-only dialogue.
    Our design goal was that consumers would perceive voice control as a clear benefit
    versus touch-tone control. It is shown how the speech interface was designed in
    a top-down approach. We started with a concept development and tested it by means
    of a Wizard-of-Oz simulation. After refining the concept in parallel design, the
    design was implemented in a high-fidelity prototype. By means of qualitative user
    testing it was improved in three iteration steps. We verified the achievement
    of our design goal with tests in two countries
author:
- first_name: Stephan
  full_name: Gamm, Stephan
  last_name: Gamm
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: Detlev
  full_name: Langmann, Detlev
  last_name: Langmann
citation:
  ama: 'Gamm S, Haeb-Umbach R, Langmann D. Findings with the Design of a Command-Based
    Speech Interface for a Voice Mail System. In: <i>IEEE Workshop on Interactive
    Voice Technology for Telecommunications Applications</i>. ; 1996.'
  apa: Gamm, S., Haeb-Umbach, R., &#38; Langmann, D. (1996). Findings with the Design
    of a Command-Based Speech Interface for a Voice Mail System. In <i>IEEE Workshop
    on Interactive Voice Technology for Telecommunications Applications</i>.
  bibtex: '@inproceedings{Gamm_Haeb-Umbach_Langmann_1996, title={Findings with the
    Design of a Command-Based Speech Interface for a Voice Mail System}, booktitle={IEEE
    Workshop on Interactive Voice Technology for Telecommunications Applications},
    author={Gamm, Stephan and Haeb-Umbach, Reinhold and Langmann, Detlev}, year={1996}
    }'
  chicago: Gamm, Stephan, Reinhold Haeb-Umbach, and Detlev Langmann. “Findings with
    the Design of a Command-Based Speech Interface for a Voice Mail System.” In <i>IEEE
    Workshop on Interactive Voice Technology for Telecommunications Applications</i>,
    1996.
  ieee: S. Gamm, R. Haeb-Umbach, and D. Langmann, “Findings with the Design of a Command-Based
    Speech Interface for a Voice Mail System,” in <i>IEEE Workshop on Interactive
    Voice Technology for Telecommunications Applications</i>, 1996.
  mla: Gamm, Stephan, et al. “Findings with the Design of a Command-Based Speech Interface
    for a Voice Mail System.” <i>IEEE Workshop on Interactive Voice Technology for
    Telecommunications Applications</i>, 1996.
  short: 'S. Gamm, R. Haeb-Umbach, D. Langmann, in: IEEE Workshop on Interactive Voice
    Technology for Telecommunications Applications, 1996.'
date_created: 2019-07-12T05:27:52Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/1996/Workshop_1996__Haeb_paper.pdf
oa: '1'
publication: IEEE Workshop on Interactive Voice Technology for Telecommunications
  Applications
status: public
title: Findings with the Design of a Command-Based Speech Interface for a Voice Mail
  System
type: conference
user_id: '44006'
year: '1996'
...
---
_id: '11853'
abstract:
- lang: eng
  text: The paper describes the design, collection and postprocessing of the French
    SpeechDat corpus FRESCO. Being a database of approximately 35000 utterances recorded
    from 1000 callers over the terrestrial telephone network in France, it comprises
    immediately usable and relevant speech for the initial training and assessment
    of speaker independent phoneme model or word model based speech recognizers, as
    they are employed in automated telephone services. FRESCO is one of the 1000 speaker
    telephone speech databases produced as "case studies" within the European project
    SpeechDat(M).
author:
- first_name: Detlev
  full_name: Langmann, Detlev
  last_name: Langmann
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Langmann D, Haeb-Umbach R. FRESCO: The French Telephone Speech Data Collection
    - Part of the European SpeechDat(M) Project. In: <i>ICSLP, Philadelphia</i>. ;
    1996.'
  apa: 'Langmann, D., &#38; Haeb-Umbach, R. (1996). FRESCO: The French Telephone Speech
    Data Collection - Part of the European SpeechDat(M) Project. In <i>ICSLP, Philadelphia</i>.'
  bibtex: '@inproceedings{Langmann_Haeb-Umbach_1996, title={FRESCO: The French Telephone
    Speech Data Collection - Part of the European SpeechDat(M) Project}, booktitle={ICSLP,
    Philadelphia}, author={Langmann, Detlev and Haeb-Umbach, Reinhold}, year={1996}
    }'
  chicago: 'Langmann, Detlev, and Reinhold Haeb-Umbach. “FRESCO: The French Telephone
    Speech Data Collection - Part of the European SpeechDat(M) Project.” In <i>ICSLP,
    Philadelphia</i>, 1996.'
  ieee: 'D. Langmann and R. Haeb-Umbach, “FRESCO: The French Telephone Speech Data
    Collection - Part of the European SpeechDat(M) Project,” in <i>ICSLP, Philadelphia</i>,
    1996.'
  mla: 'Langmann, Detlev, and Reinhold Haeb-Umbach. “FRESCO: The French Telephone
    Speech Data Collection - Part of the European SpeechDat(M) Project.” <i>ICSLP,
    Philadelphia</i>, 1996.'
  short: 'D. Langmann, R. Haeb-Umbach, in: ICSLP, Philadelphia, 1996.'
date_created: 2019-07-12T05:29:31Z
date_updated: 2022-01-06T06:51:11Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/1996/ICSLP_1996_Haeb_paper.pdf
oa: '1'
publication: ICSLP, Philadelphia
status: public
title: 'FRESCO: The French Telephone Speech Data Collection - Part of the European
  SpeechDat(M) Project'
type: conference
user_id: '44006'
year: '1996'
...
---
_id: '11854'
author:
- first_name: Detlev
  full_name: Langmann, Detlev
  last_name: Langmann
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: Thomas
  full_name: Eisele, Thomas
  last_name: Eisele
citation:
  ama: 'Langmann D, Haeb-Umbach R, Eisele T. Robust Rejection Modeling for a Small-Vocabulary
    Application. In: <i>ITG Fachtagung Sprachkommunikation, Frankfurt</i>. ; 1996.'
  apa: Langmann, D., Haeb-Umbach, R., &#38; Eisele, T. (1996). Robust Rejection Modeling
    for a Small-Vocabulary Application. In <i>ITG Fachtagung Sprachkommunikation,
    Frankfurt</i>.
  bibtex: '@inproceedings{Langmann_Haeb-Umbach_Eisele_1996, title={Robust Rejection
    Modeling for a Small-Vocabulary Application}, booktitle={ITG Fachtagung Sprachkommunikation,
    Frankfurt}, author={Langmann, Detlev and Haeb-Umbach, Reinhold and Eisele, Thomas},
    year={1996} }'
  chicago: Langmann, Detlev, Reinhold Haeb-Umbach, and Thomas Eisele. “Robust Rejection
    Modeling for a Small-Vocabulary Application.” In <i>ITG Fachtagung Sprachkommunikation,
    Frankfurt</i>, 1996.
  ieee: D. Langmann, R. Haeb-Umbach, and T. Eisele, “Robust Rejection Modeling for
    a Small-Vocabulary Application,” in <i>ITG Fachtagung Sprachkommunikation, Frankfurt</i>,
    1996.
  mla: Langmann, Detlev, et al. “Robust Rejection Modeling for a Small-Vocabulary
    Application.” <i>ITG Fachtagung Sprachkommunikation, Frankfurt</i>, 1996.
  short: 'D. Langmann, R. Haeb-Umbach, T. Eisele, in: ITG Fachtagung Sprachkommunikation,
    Frankfurt, 1996.'
date_created: 2019-07-12T05:29:32Z
date_updated: 2022-01-06T06:51:11Z
department:
- _id: '54'
language:
- iso: eng
publication: ITG Fachtagung Sprachkommunikation, Frankfurt
status: public
title: Robust Rejection Modeling for a Small-Vocabulary Application
type: conference
user_id: '44006'
year: '1996'
...
---
_id: '11757'
abstract:
- lang: eng
  text: Clustering techniques have been integrated at different levels into the training
    procedure of a continuous-density hidden Markov model (HMM) speech recognizer.
    These clustering techniques can be used in two ways. First acoustically similar
    states are tied together. It will help to reduce the number of parameters but
    also allow to train otherwise rarely seen states together with more robust ones
    (state-tying). Secondly densities are clustered across states, this reduces the
    number of densities while at the same time keeping the best performances of our
    recognizer (density-clustering). We have applied these techniques both to word-based
    small-vocabulary and phoneme-based large-vocabulary recognition tasks. On the
    WSJ task, we could achieve a reduction of the word error rate by 7%. On the TI/NIST-connected
    digit task, the number of parameters was reduced by a factor 2-3 while keeping
    the same string error rate.
author:
- first_name: Christian
  full_name: Dugast, Christian
  last_name: Dugast
- first_name: Peter
  full_name: Beyerlein, Peter
  last_name: Beyerlein
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Dugast C, Beyerlein P, Haeb-Umbach R. Application of Clustering Techniques
    to Mixture Density Modelling for Continuous-Speech Recognition. In: <i>ICASSP,
    Detroit</i>. ; 1995.'
  apa: Dugast, C., Beyerlein, P., &#38; Haeb-Umbach, R. (1995). Application of Clustering
    Techniques to Mixture Density Modelling for Continuous-Speech Recognition. In
    <i>ICASSP, Detroit</i>.
  bibtex: '@inproceedings{Dugast_Beyerlein_Haeb-Umbach_1995, title={Application of
    Clustering Techniques to Mixture Density Modelling for Continuous-Speech Recognition},
    booktitle={ICASSP, Detroit}, author={Dugast, Christian and Beyerlein, Peter and
    Haeb-Umbach, Reinhold}, year={1995} }'
  chicago: Dugast, Christian, Peter Beyerlein, and Reinhold Haeb-Umbach. “Application
    of Clustering Techniques to Mixture Density Modelling for Continuous-Speech Recognition.”
    In <i>ICASSP, Detroit</i>, 1995.
  ieee: C. Dugast, P. Beyerlein, and R. Haeb-Umbach, “Application of Clustering Techniques
    to Mixture Density Modelling for Continuous-Speech Recognition,” in <i>ICASSP,
    Detroit</i>, 1995.
  mla: Dugast, Christian, et al. “Application of Clustering Techniques to Mixture
    Density Modelling for Continuous-Speech Recognition.” <i>ICASSP, Detroit</i>,
    1995.
  short: 'C. Dugast, P. Beyerlein, R. Haeb-Umbach, in: ICASSP, Detroit, 1995.'
date_created: 2019-07-12T05:27:40Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/1995/ICASSP_1995_Haeb_paper.pdf
oa: '1'
publication: ICASSP, Detroit
status: public
title: Application of Clustering Techniques to Mixture Density Modelling for Continuous-Speech
  Recognition
type: conference
user_id: '44006'
year: '1995'
...
---
_id: '11764'
abstract:
- lang: eng
  text: Today speech recognition of a small vocabulary can be realized so cost-effectively
    that the technology can penetrate into consumer electronics. But, as first applications
    that failed on the market show, it is by no means obvious how to incorporate voice
    control in a user interface. This paper addresses the issue of how to design a
    voice control so that the user perceives it as a benefit. User interface guidelines
    that are adapted or specific to voice control are presented. Then the process
    of designing a voice control in the user-centred approach is described. By means
    of two examples, the car stereo and telephone answering machine, it is shown how
    this is turned into practice.
author:
- first_name: Stephan
  full_name: Gamm, Stephan
  last_name: Gamm
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: Gamm S, Haeb-Umbach R. User interface design of voice controlled consumer electronics.
    <i>Philips Journal of Research</i>. 1995.
  apa: Gamm, S., &#38; Haeb-Umbach, R. (1995). User interface design of voice controlled
    consumer electronics. <i>Philips Journal of Research</i>.
  bibtex: '@article{Gamm_Haeb-Umbach_1995, title={User interface design of voice controlled
    consumer electronics}, journal={Philips Journal of Research}, author={Gamm, Stephan
    and Haeb-Umbach, Reinhold}, year={1995} }'
  chicago: Gamm, Stephan, and Reinhold Haeb-Umbach. “User Interface Design of Voice
    Controlled Consumer Electronics.” <i>Philips Journal of Research</i>, 1995.
  ieee: S. Gamm and R. Haeb-Umbach, “User interface design of voice controlled consumer
    electronics,” <i>Philips Journal of Research</i>, 1995.
  mla: Gamm, Stephan, and Reinhold Haeb-Umbach. “User Interface Design of Voice Controlled
    Consumer Electronics.” <i>Philips Journal of Research</i>, 1995.
  short: S. Gamm, R. Haeb-Umbach, Philips Journal of Research (1995).
date_created: 2019-07-12T05:27:48Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
language:
- iso: eng
publication: Philips Journal of Research
status: public
title: User interface design of voice controlled consumer electronics
type: journal_article
user_id: '44006'
year: '1995'
...
---
_id: '11765'
author:
- first_name: Stephan
  full_name: Gamm, Stephan
  last_name: Gamm
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Gamm S, Haeb-Umbach R. Human Factors of a Voice-Controlled Car Stereo. In:
    <i>Eurospeech, Madrid</i>. ; 1995.'
  apa: Gamm, S., &#38; Haeb-Umbach, R. (1995). Human Factors of a Voice-Controlled
    Car Stereo. In <i>Eurospeech, Madrid</i>.
  bibtex: '@inproceedings{Gamm_Haeb-Umbach_1995, title={Human Factors of a Voice-Controlled
    Car Stereo}, booktitle={Eurospeech, Madrid}, author={Gamm, Stephan and Haeb-Umbach,
    Reinhold}, year={1995} }'
  chicago: Gamm, Stephan, and Reinhold Haeb-Umbach. “Human Factors of a Voice-Controlled
    Car Stereo.” In <i>Eurospeech, Madrid</i>, 1995.
  ieee: S. Gamm and R. Haeb-Umbach, “Human Factors of a Voice-Controlled Car Stereo,”
    in <i>Eurospeech, Madrid</i>, 1995.
  mla: Gamm, Stephan, and Reinhold Haeb-Umbach. “Human Factors of a Voice-Controlled
    Car Stereo.” <i>Eurospeech, Madrid</i>, 1995.
  short: 'S. Gamm, R. Haeb-Umbach, in: Eurospeech, Madrid, 1995.'
date_created: 2019-07-12T05:27:49Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
language:
- iso: eng
publication: Eurospeech, Madrid
status: public
title: Human Factors of a Voice-Controlled Car Stereo
type: conference
user_id: '44006'
year: '1995'
...
---
_id: '11768'
author:
- first_name: Stephan
  full_name: Gamm, Stephan
  last_name: Gamm
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: Det
  full_name: Langmann, Det
  last_name: Langmann
citation:
  ama: 'Gamm S, Haeb-Umbach R, Langmann D. The Usability Engineering of a Voice-Controlled
    Answering Machine. In: <i>International Symposium on Human Factors in Telecommunications,
    Melbourne</i>. ; 1995.'
  apa: Gamm, S., Haeb-Umbach, R., &#38; Langmann, D. (1995). The Usability Engineering
    of a Voice-Controlled Answering Machine. In <i>International Symposium on Human
    Factors in Telecommunications, Melbourne</i>.
  bibtex: '@inproceedings{Gamm_Haeb-Umbach_Langmann_1995, title={The Usability Engineering
    of a Voice-Controlled Answering Machine}, booktitle={International Symposium on
    Human Factors in Telecommunications, Melbourne}, author={Gamm, Stephan and Haeb-Umbach,
    Reinhold and Langmann, Det}, year={1995} }'
  chicago: Gamm, Stephan, Reinhold Haeb-Umbach, and Det Langmann. “The Usability Engineering
    of a Voice-Controlled Answering Machine.” In <i>International Symposium on Human
    Factors in Telecommunications, Melbourne</i>, 1995.
  ieee: S. Gamm, R. Haeb-Umbach, and D. Langmann, “The Usability Engineering of a
    Voice-Controlled Answering Machine,” in <i>International Symposium on Human Factors
    in Telecommunications, Melbourne</i>, 1995.
  mla: Gamm, Stephan, et al. “The Usability Engineering of a Voice-Controlled Answering
    Machine.” <i>International Symposium on Human Factors in Telecommunications, Melbourne</i>,
    1995.
  short: 'S. Gamm, R. Haeb-Umbach, D. Langmann, in: International Symposium on Human
    Factors in Telecommunications, Melbourne, 1995.'
date_created: 2019-07-12T05:27:53Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
language:
- iso: eng
publication: International Symposium on Human Factors in Telecommunications, Melbourne
status: public
title: The Usability Engineering of a Voice-Controlled Answering Machine
type: conference
user_id: '44006'
year: '1995'
...
---
_id: '11786'
abstract:
- lang: eng
  text: 'Recognition accuracy has been the primary objective of most speech recognition
    research, and impressive results have been obtained, e.g. less than 0.3% word
    error rate on a speaker-independent digit recognition task. When it comes to real-world
    applications, robustness and real-time response might be more important issues.
    For the first requirement we review some of the work on robustness and discuss
    one specific technique, spectral normalization, in more detail. The requirement
    of real-time response has to be considered in the light of the limited hardware
    resources in voice control applications, which are due to the tight cost constraints.
    In this paper we discuss in detail one specific means to reduce the processing
    and memory demands: a clustering technique applied at various levels within the
    acoustic modelling.'
author:
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: Peter
  full_name: Beyerlein, Peter
  last_name: Beyerlein
- first_name: Dieter
  full_name: Geller, Dieter
  last_name: Geller
citation:
  ama: Haeb-Umbach R, Beyerlein P, Geller D. Speech recognition algorithms for voice
    control interfaces. <i>Philips Journal of Research</i>. 1995.
  apa: Haeb-Umbach, R., Beyerlein, P., &#38; Geller, D. (1995). Speech recognition
    algorithms for voice control interfaces. <i>Philips Journal of Research</i>.
  bibtex: '@article{Haeb-Umbach_Beyerlein_Geller_1995, title={Speech recognition algorithms
    for voice control interfaces}, journal={Philips Journal of Research}, author={Haeb-Umbach,
    Reinhold and Beyerlein, Peter and Geller, Dieter}, year={1995} }'
  chicago: Haeb-Umbach, Reinhold, Peter Beyerlein, and Dieter Geller. “Speech Recognition
    Algorithms for Voice Control Interfaces.” <i>Philips Journal of Research</i>,
    1995.
  ieee: R. Haeb-Umbach, P. Beyerlein, and D. Geller, “Speech recognition algorithms
    for voice control interfaces,” <i>Philips Journal of Research</i>, 1995.
  mla: Haeb-Umbach, Reinhold, et al. “Speech Recognition Algorithms for Voice Control
    Interfaces.” <i>Philips Journal of Research</i>, 1995.
  short: R. Haeb-Umbach, P. Beyerlein, D. Geller, Philips Journal of Research (1995).
date_created: 2019-07-12T05:28:14Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
language:
- iso: eng
publication: Philips Journal of Research
status: public
title: Speech recognition algorithms for voice control interfaces
type: journal_article
user_id: '44006'
year: '1995'
...
---
_id: '11787'
abstract:
- lang: eng
  text: We address the problem of automatically finding an acoustic representation
    (i.e. a transcription) of unknown words as a sequence of subword units, given
    a few sample utterances of the unknown words, and an inventory of speaker-independent
    subword units. The problem arises if a user wants to add his own vocabulary to
    a speaker-independent recognition system simply by speaking the words a few times.
    Two methods are investigated which are both based on a maximum-likelihood formulation
    of the problem. The experimental results show that both automatic transcription
    methods provide a good estimate of the acoustic models of unknown words. The recognition
    error rates obtained with such models in a speaker-independent recognition task
    are clearly better than those resulting from separate whole-word models. They
    are comparable with the performance of transcriptions drawn from a dictionary.
author:
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: P.
  full_name: Beyerlein, P.
  last_name: Beyerlein
- first_name: E.
  full_name: Thelen, E.
  last_name: Thelen
citation:
  ama: 'Haeb-Umbach R, Beyerlein P, Thelen E. Automatic Transcription of Unknown Words
    in a Speech Recognition System. In: <i>ICASSP, Detroit</i>. ; 1995.'
  apa: Haeb-Umbach, R., Beyerlein, P., &#38; Thelen, E. (1995). Automatic Transcription
    of Unknown Words in a Speech Recognition System. In <i>ICASSP, Detroit</i>.
  bibtex: '@inproceedings{Haeb-Umbach_Beyerlein_Thelen_1995, title={Automatic Transcription
    of Unknown Words in a Speech Recognition System}, booktitle={ICASSP, Detroit},
    author={Haeb-Umbach, Reinhold and Beyerlein, P. and Thelen, E.}, year={1995} }'
  chicago: Haeb-Umbach, Reinhold, P. Beyerlein, and E. Thelen. “Automatic Transcription
    of Unknown Words in a Speech Recognition System.” In <i>ICASSP, Detroit</i>, 1995.
  ieee: R. Haeb-Umbach, P. Beyerlein, and E. Thelen, “Automatic Transcription of Unknown
    Words in a Speech Recognition System,” in <i>ICASSP, Detroit</i>, 1995.
  mla: Haeb-Umbach, Reinhold, et al. “Automatic Transcription of Unknown Words in
    a Speech Recognition System.” <i>ICASSP, Detroit</i>, 1995.
  short: 'R. Haeb-Umbach, P. Beyerlein, E. Thelen, in: ICASSP, Detroit, 1995.'
date_created: 2019-07-12T05:28:15Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/1995/ICASSP_1995_Haeb1_paper.pdf
oa: '1'
publication: ICASSP, Detroit
status: public
title: Automatic Transcription of Unknown Words in a Speech Recognition System
type: conference
user_id: '44006'
year: '1995'
...
---
_id: '11905'
abstract:
- lang: eng
  text: This paper gives an overview of the Philips Research system for continuous-speech
    recognition. The recognition architecture is based on an integrated statistical
    approach. The system has been successfully applied to various tasks in American
    English and German, ranging from small vocabulary tasks to very large vocabulary
    tasks and from recognition only to speech understanding. Here, we concentrate
    on phoneme-based continuous-speech recognition for large vocabulary recognition
    as used for dictation, which covers a significant part of our research work on
    speech recognition. We describe this task and report on experimental results.
    In order to allow a comparison with the performance of other systems, a section
    with an evaluation on the standard North American Business news (NAB2) task (dictation
    of American English newspaper text) is supplied.
author:
- first_name: Volker
  full_name: Steinbiss, Volker
  last_name: Steinbiss
- first_name: Hermann J.
  full_name: Ney, Hermann J.
  last_name: Ney
- first_name: Xavier L.
  full_name: Aubert, Xavier L.
  last_name: Aubert
- first_name: Stefan
  full_name: Besling, Stefan
  last_name: Besling
- first_name: Christian
  full_name: Dugast, Christian
  last_name: Dugast
- first_name: Ute
  full_name: Essen, Ute
  last_name: Essen
- first_name: Dieter
  full_name: Geller, Dieter
  last_name: Geller
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: Reinhard
  full_name: Kneser, Reinhard
  last_name: Kneser
- first_name: Hans Günter
  full_name: Meier, Hans Günter
  last_name: Meier
- first_name: Martin
  full_name: Oerder, Martin
  last_name: Oerder
- first_name: Bach Hiep
  full_name: Tran, Bach Hiep
  last_name: Tran
citation:
  ama: Steinbiss V, Ney HJ, Aubert XL, et al. The Philips Research system for continuous-speech
    dictation. <i>Philips Journal of Research</i>. 1995.
  apa: Steinbiss, V., Ney, H. J., Aubert, X. L., Besling, S., Dugast, C., Essen, U.,
    … Tran, B. H. (1995). The Philips Research system for continuous-speech dictation.
    <i>Philips Journal of Research</i>.
  bibtex: '@article{Steinbiss_Ney_Aubert_Besling_Dugast_Essen_Geller_Haeb-Umbach_Kneser_Meier_et
    al._1995, title={The Philips Research system for continuous-speech dictation},
    journal={Philips Journal of Research}, author={Steinbiss, Volker and Ney, Hermann
    J. and Aubert, Xavier L. and Besling, Stefan and Dugast, Christian and Essen,
    Ute and Geller, Dieter and Haeb-Umbach, Reinhold and Kneser, Reinhard and Meier,
    Hans Günter and et al.}, year={1995} }'
  chicago: Steinbiss, Volker, Hermann J. Ney, Xavier L. Aubert, Stefan Besling, Christian
    Dugast, Ute Essen, Dieter Geller, et al. “The Philips Research System for Continuous-Speech
    Dictation.” <i>Philips Journal of Research</i>, 1995.
  ieee: V. Steinbiss <i>et al.</i>, “The Philips Research system for continuous-speech
    dictation,” <i>Philips Journal of Research</i>, 1995.
  mla: Steinbiss, Volker, et al. “The Philips Research System for Continuous-Speech
    Dictation.” <i>Philips Journal of Research</i>, 1995.
  short: V. Steinbiss, H.J. Ney, X.L. Aubert, S. Besling, C. Dugast, U. Essen, D.
    Geller, R. Haeb-Umbach, R. Kneser, H.G. Meier, M. Oerder, B.H. Tran, Philips Journal
    of Research (1995).
date_created: 2019-07-12T05:30:31Z
date_updated: 2022-01-06T06:51:12Z
department:
- _id: '54'
language:
- iso: eng
publication: Philips Journal of Research
status: public
title: The Philips Research system for continuous-speech dictation
type: journal_article
user_id: '44006'
year: '1995'
...
---
_id: '11948'
abstract:
- lang: eng
  text: 'This paper gives an overview of the Philips research system for phoneme-based,
    large-vocabulary, continuousspeech recognition. The system has been successfully
    applied to various tasks in the German and (American) English languages, ranging
    from small vocabulary tasks to very large vocabulary tasks. Here, we concentrate
    on continuousspeech recognition for dictation in real applications, the dictation
    of legal reports and radiology reports in German. We describe this task and report
    on experimental results. We also describe a commercial PC-based dictation system
    which includes a PC implementation of our scientific recognition prototype. In
    order to allow for a comparison with the performance of other systems, a section
    with an evaluation on the standard Wall Street Journal task (dictation of American
    English newspaper text) is supplied. The recognition architecture is based on
    an integrated statistical approach. We describe the characteristic features of
    the system as opposed to other systems: 1. the Viterbi criterion is consistently
    applied both in training and testing; 2. continuous mixture densities are used
    without tying or smoothing; 3. time-synchronous beam search in connection with
    a phoneme look-ahead is applied to a tree-organized lexicon.'
author:
- first_name: Volker
  full_name: Steinbiss, Volker
  last_name: Steinbiss
- first_name: Hermann J.
  full_name: Ney, Hermann J.
  last_name: Ney
- first_name: Ute
  full_name: Essen, Ute
  last_name: Essen
- first_name: Bach Hiep
  full_name: Tran, Bach Hiep
  last_name: Tran
- first_name: Xavier L.
  full_name: Aubert, Xavier L.
  last_name: Aubert
- first_name: Christian
  full_name: Dugast, Christian
  last_name: Dugast
- first_name: Reinhard
  full_name: Kneser, Reinhard
  last_name: Kneser
- first_name: Hans Günter
  full_name: Meier, Hans Günter
  last_name: Meier
- first_name: Martin
  full_name: Oerder, Martin
  last_name: Oerder
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: Dieter
  full_name: Geller, Dieter
  last_name: Geller
- first_name: W.
  full_name: Hoellerbauer, W.
  last_name: Hoellerbauer
- first_name: H.
  full_name: Bartosik, H.
  last_name: Bartosik
citation:
  ama: Steinbiss V, Ney HJ, Essen U, et al. Continuous speech dictation - From theory
    to practice. <i>Speech Communication</i>. 1995.
  apa: Steinbiss, V., Ney, H. J., Essen, U., Tran, B. H., Aubert, X. L., Dugast, C.,
    … Bartosik, H. (1995). Continuous speech dictation - From theory to practice.
    <i>Speech Communication</i>.
  bibtex: '@article{Steinbiss_Ney_Essen_Tran_Aubert_Dugast_Kneser_Meier_Oerder_Haeb-Umbach_et
    al._1995, title={Continuous speech dictation - From theory to practice}, journal={Speech
    Communication}, author={Steinbiss, Volker and Ney, Hermann J. and Essen, Ute and
    Tran, Bach Hiep and Aubert, Xavier L. and Dugast, Christian and Kneser, Reinhard
    and Meier, Hans Günter and Oerder, Martin and Haeb-Umbach, Reinhold and et al.},
    year={1995} }'
  chicago: Steinbiss, Volker, Hermann J. Ney, Ute Essen, Bach Hiep Tran, Xavier L.
    Aubert, Christian Dugast, Reinhard Kneser, et al. “Continuous Speech Dictation
    - From Theory to Practice.” <i>Speech Communication</i>, 1995.
  ieee: V. Steinbiss <i>et al.</i>, “Continuous speech dictation - From theory to
    practice,” <i>Speech Communication</i>, 1995.
  mla: Steinbiss, Volker, et al. “Continuous Speech Dictation - From Theory to Practice.”
    <i>Speech Communication</i>, 1995.
  short: V. Steinbiss, H.J. Ney, U. Essen, B.H. Tran, X.L. Aubert, C. Dugast, R. Kneser,
    H.G. Meier, M. Oerder, R. Haeb-Umbach, D. Geller, W. Hoellerbauer, H. Bartosik,
    Speech Communication (1995).
date_created: 2019-07-12T12:22:00Z
date_updated: 2022-01-06T06:51:13Z
department:
- _id: '54'
language:
- iso: eng
publication: Speech Communication
status: public
title: Continuous speech dictation - From theory to practice
type: journal_article
user_id: '44006'
year: '1995'
...