A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01)

O. Walter, J. Schmalenstroeer, R. Haeb-Umbach, A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01), 2013.

Report | English
Abstract
In this paper we present a novel initialization method for unsupervised learning of acoustic patterns in recordings of continuous speech. The pattern discovery task is solved by dynamic time warping whose performance we improve by a smart starting point selection. This enables a more accurate discovery of patterns compared to conventional approaches. After graph-based clustering the patterns are employed for training hidden Markov models for an unsupervised speech acquisition. By iterating between model training and decoding in an EM-like framework the word accuracy is continuously improved. On the TIDIGITS corpus we achieve a word error rate of about 13 percent by the proposed unsupervised pattern discovery approach, which neither assumes knowledge of the acoustic units nor of the labels of the training data.
Publishing Year
LibreCat-ID

Cite this

Walter O, Schmalenstroeer J, Haeb-Umbach R. A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01).; 2013.
Walter, O., Schmalenstroeer, J., & Haeb-Umbach, R. (2013). A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01).
@book{Walter_Schmalenstroeer_Haeb-Umbach_2013, title={A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01)}, author={Walter, Oliver and Schmalenstroeer, Joerg and Haeb-Umbach, Reinhold}, year={2013} }
Walter, Oliver, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01), 2013.
O. Walter, J. Schmalenstroeer, and R. Haeb-Umbach, A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01). 2013.
Walter, Oliver, et al. A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01). 2013.

Link(s) to Main File(s)
Access Level
Restricted Closed Access

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar