A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01)

Walter, Oliver; Schmalenstroeer, Joerg; Haeb-Umbach, Reinhold

A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01)

O. Walter, J. Schmalenstroeer, R. Haeb-Umbach, A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01), 2013.

Download (ext.)

https://groups.uni-paderborn.de/nt/pubs/2013/WaScHa2013.pdf

Report | English

Author

Walter, Oliver; Schmalenstroeer, Joerg^LibreCat; Haeb-Umbach, Reinhold^LibreCat

Department

Nachrichtentechnik (NT) / Heinz Nixdorf Institut

Abstract

In this paper we present a novel initialization method for unsupervised learning of acoustic patterns in recordings of continuous speech. The pattern discovery task is solved by dynamic time warping whose performance we improve by a smart starting point selection. This enables a more accurate discovery of patterns compared to conventional approaches. After graph-based clustering the patterns are employed for training hidden Markov models for an unsupervised speech acquisition. By iterating between model training and decoding in an EM-like framework the word accuracy is continuously improved. On the TIDIGITS corpus we achieve a word error rate of about 13 percent by the proposed unsupervised pattern discovery approach, which neither assumes knowledge of the acoustic units nor of the labels of the training data.

Publishing Year

2013

LibreCat-ID

11926

Cite this

Walter O, Schmalenstroeer J, Haeb-Umbach R. A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01).; 2013.

Walter, O., Schmalenstroeer, J., & Haeb-Umbach, R. (2013). A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01).

@book{Walter_Schmalenstroeer_Haeb-Umbach_2013, title={A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01)}, author={Walter, Oliver and Schmalenstroeer, Joerg and Haeb-Umbach, Reinhold}, year={2013} }

Walter, Oliver, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01), 2013.

O. Walter, J. Schmalenstroeer, and R. Haeb-Umbach, A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01). 2013.

Walter, Oliver, et al. A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01). 2013.

All files available under the following license(s):

Copyright Statement:

This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)

URL

https://groups.uni-paderborn.de/nt/pubs/2013/WaScHa2013.pdf

Access Level

Closed Access

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar