A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01)
O. Walter, J. Schmalenstroeer, R. Haeb-Umbach, A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01), 2013.
Download (ext.)
Report
| English
Author
Walter, Oliver;
Schmalenstroeer, JoergLibreCat;
Haeb-Umbach, ReinholdLibreCat
Abstract
In this paper we present a novel initialization method for unsupervised learning of acoustic patterns in recordings of continuous speech. The pattern discovery task is solved by dynamic time warping whose performance we improve by a smart starting point selection. This enables a more accurate discovery of patterns compared to conventional approaches. After graph-based clustering the patterns are employed for training hidden Markov models for an unsupervised speech acquisition. By iterating between model training and decoding in an EM-like framework the word accuracy is continuously improved. On the TIDIGITS corpus we achieve a word error rate of about 13 percent by the proposed unsupervised pattern discovery approach, which neither assumes knowledge of the acoustic units nor of the labels of the training data.
Publishing Year
LibreCat-ID
Cite this
Walter O, Schmalenstroeer J, Haeb-Umbach R. A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01).; 2013.
Walter, O., Schmalenstroeer, J., & Haeb-Umbach, R. (2013). A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01).
@book{Walter_Schmalenstroeer_Haeb-Umbach_2013, title={A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01)}, author={Walter, Oliver and Schmalenstroeer, Joerg and Haeb-Umbach, Reinhold}, year={2013} }
Walter, Oliver, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01), 2013.
O. Walter, J. Schmalenstroeer, and R. Haeb-Umbach, A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01). 2013.
Walter, Oliver, et al. A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01). 2013.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Link(s) to Main File(s)
Access Level
Closed Access