conference paper
Unsupervised learning of acoustic events using dynamic time warping and hierarchical K-means++ clustering
Joerg
Schmalenstroeer
author 460
Markus
Bartek
author
Reinhold
Haeb-Umbach
author 242
54
department
In this paper we propose to jointly consider Segmental Dynamic Time Warping and distance clustering for the unsupervised learning of acoustic events. As a result, the computational complexity increases only linearly with the dababase size compared to a quadratic increase in a sequential setup, where all pairwise SDTW distances between segments are computed prior to clustering. Further, we discuss options for seed value selection for clustering and show that drawing seeds with a probability proportional to the distance from the already drawn seeds, known as K-means++ clustering, results in a significantly higher probability of finding representatives of each of the underlying classes, compared to the commonly used draws from a uniform distribution. Experiments are performed on an acoustic event classification and an isolated digit recognition task, where on the latter the final word accuracy approaches that of supervised training.
2011
eng
Interspeech 2011
Schmalenstroeer J, Bartek M, Haeb-Umbach R. Unsupervised learning of acoustic events using dynamic time warping and hierarchical K-means++ clustering. In: <i>Interspeech 2011</i>. ; 2011.
Schmalenstroeer, J., Bartek, M., & Haeb-Umbach, R. (2011). Unsupervised learning of acoustic events using dynamic time warping and hierarchical K-means++ clustering. In <i>Interspeech 2011</i>.
J. Schmalenstroeer, M. Bartek, and R. Haeb-Umbach, “Unsupervised learning of acoustic events using dynamic time warping and hierarchical K-means++ clustering,” in <i>Interspeech 2011</i>, 2011.
Schmalenstroeer, Joerg, Markus Bartek, and Reinhold Haeb-Umbach. “Unsupervised Learning of Acoustic Events Using Dynamic Time Warping and Hierarchical K-Means++ Clustering.” In <i>Interspeech 2011</i>, 2011.
@inproceedings{Schmalenstroeer_Bartek_Haeb-Umbach_2011, title={Unsupervised learning of acoustic events using dynamic time warping and hierarchical K-means++ clustering}, booktitle={Interspeech 2011}, author={Schmalenstroeer, Joerg and Bartek, Markus and Haeb-Umbach, Reinhold}, year={2011} }
J. Schmalenstroeer, M. Bartek, R. Haeb-Umbach, in: Interspeech 2011, 2011.
Schmalenstroeer, Joerg, et al. “Unsupervised Learning of Acoustic Events Using Dynamic Time Warping and Hierarchical K-Means++ Clustering.” <i>Interspeech 2011</i>, 2011.
118892019-07-12T05:30:13Z2019-07-12T09:11:27Z