conference paper
Options for Modelling Temporal Statistical Dependencies in an Acoustic Model for ASR
Volker
Leutnant
author
Reinhold
Haeb-Umbach
author 242
54
department
Traditionally, ASR systems are based on hidden Markov models with Gaussian mixtures modelling the state-conditioned feature distribution. The inherent assumption of conditional independence, stating that a feature's likelihood solely depends on the current HMM state, makes the search computationally tractable, nevertheless has also been identified to be a major reason for the lack of robustness of such systems. Linear dynamic models have been proposed to overcome this weakness by employing a hidden dynamic state process underlying the observed features. Though performance of linear dynamic models on continuous speech/phone recognition tasks has been shown to be superior to that of equivalent static models, this approach still cannot compete with the established acoustic models. In this paper we consider the combination of hidden Markov models based on Gaussian mixture densities (GMM-HMMs) and linear dynamic models (LDMs) as the acoustic model for automatic speech recognition systems. In doing so, the individual strengths of both models, i.e. the modelling of long-term temporal dependencies by the GMM-HMM and the direct modelling of statistical dependencies between consecutive feature vectors by the LDM, are exploited. Phone classification experiments conducted on the TIMIT database indicate the prospective use of this approach for the application to continuous speech recognition.
2010
eng
36. Deutsche Jahrestagung fuer Akustik (DAGA 2010)
Leutnant V, Haeb-Umbach R. Options for Modelling Temporal Statistical Dependencies in an Acoustic Model for ASR. In: <i>36. Deutsche Jahrestagung Fuer Akustik (DAGA 2010)</i>. ; 2010.
Leutnant, Volker, and Reinhold Haeb-Umbach. “Options for Modelling Temporal Statistical Dependencies in an Acoustic Model for ASR.” <i>36. Deutsche Jahrestagung Fuer Akustik (DAGA 2010)</i>, 2010.
Leutnant, V., & Haeb-Umbach, R. (2010). Options for Modelling Temporal Statistical Dependencies in an Acoustic Model for ASR. In <i>36. Deutsche Jahrestagung fuer Akustik (DAGA 2010)</i>.
V. Leutnant and R. Haeb-Umbach, “Options for Modelling Temporal Statistical Dependencies in an Acoustic Model for ASR,” in <i>36. Deutsche Jahrestagung fuer Akustik (DAGA 2010)</i>, 2010.
@inproceedings{Leutnant_Haeb-Umbach_2010, title={Options for Modelling Temporal Statistical Dependencies in an Acoustic Model for ASR}, booktitle={36. Deutsche Jahrestagung fuer Akustik (DAGA 2010)}, author={Leutnant, Volker and Haeb-Umbach, Reinhold}, year={2010} }
V. Leutnant, R. Haeb-Umbach, in: 36. Deutsche Jahrestagung Fuer Akustik (DAGA 2010), 2010.
Leutnant, Volker, and Reinhold Haeb-Umbach. “Options for Modelling Temporal Statistical Dependencies in an Acoustic Model for ASR.” In <i>36. Deutsche Jahrestagung Fuer Akustik (DAGA 2010)</i>, 2010.
118572019-07-12T05:29:36Z2022-01-06T06:51:11Z