{"abstract":[{"text":"Variational Autoencoders (VAEs) have been shown to provide efficient neural-network-based approximate Bayesian inference for observation models for which exact inference is intractable. Its extension, the so-called Structured VAE (SVAE) allows inference in the presence of both discrete and continuous latent variables. Inspired by this extension, we developed a VAE with Hidden Markov Models (HMMs) as latent models. We applied the resulting HMM-VAE to the task of acoustic unit discovery in a zero resource scenario. Starting from an initial model based on variational inference in an HMM with Gaussian Mixture Model (GMM) emission probabilities, the accuracy of the acoustic unit discovery could be significantly improved by the HMM-VAE. In doing so we were able to demonstrate for an unsupervised learning task what is well-known in the supervised learning case: Neural networks provide superior modeling power compared to GMMs.","lang":"eng"}],"oa":"1","author":[{"first_name":"Janek","id":"34851","last_name":"Ebbers","full_name":"Ebbers, Janek"},{"full_name":"Heymann, Jahn","id":"9168","last_name":"Heymann","first_name":"Jahn"},{"id":"11213","last_name":"Drude","full_name":"Drude, Lukas","first_name":"Lukas"},{"last_name":"Glarner","id":"14169","full_name":"Glarner, Thomas","first_name":"Thomas"},{"id":"242","last_name":"Haeb-Umbach","full_name":"Haeb-Umbach, Reinhold","first_name":"Reinhold"},{"last_name":"Raj","full_name":"Raj, Bhiksha","first_name":"Bhiksha"}],"quality_controlled":"1","year":"2017","status":"public","date_created":"2019-07-12T05:27:42Z","user_id":"34851","related_material":{"link":[{"relation":"supplementary_material","url":"https://groups.uni-paderborn.de/nt/pubs/2017/INTERSPEECH_2017_Ebbers_poster.pdf","description":"Poster"},{"description":"Slides","url":"https://groups.uni-paderborn.de/nt/pubs/2017/INTERSPEECH_2017_Ebbers_slides.pdf","relation":"supplementary_material"}]},"citation":{"ama":"Ebbers J, Heymann J, Drude L, Glarner T, Haeb-Umbach R, Raj B. Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. In: INTERSPEECH 2017, Stockholm, Schweden. ; 2017.","mla":"Ebbers, Janek, et al. “Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery.” INTERSPEECH 2017, Stockholm, Schweden, 2017.","ieee":"J. Ebbers, J. Heymann, L. Drude, T. Glarner, R. Haeb-Umbach, and B. Raj, “Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery,” 2017.","apa":"Ebbers, J., Heymann, J., Drude, L., Glarner, T., Haeb-Umbach, R., & Raj, B. (2017). Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. INTERSPEECH 2017, Stockholm, Schweden.","bibtex":"@inproceedings{Ebbers_Heymann_Drude_Glarner_Haeb-Umbach_Raj_2017, title={Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery}, booktitle={INTERSPEECH 2017, Stockholm, Schweden}, author={Ebbers, Janek and Heymann, Jahn and Drude, Lukas and Glarner, Thomas and Haeb-Umbach, Reinhold and Raj, Bhiksha}, year={2017} }","short":"J. Ebbers, J. Heymann, L. Drude, T. Glarner, R. Haeb-Umbach, B. Raj, in: INTERSPEECH 2017, Stockholm, Schweden, 2017.","chicago":"Ebbers, Janek, Jahn Heymann, Lukas Drude, Thomas Glarner, Reinhold Haeb-Umbach, and Bhiksha Raj. “Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery.” In INTERSPEECH 2017, Stockholm, Schweden, 2017."},"_id":"11759","title":"Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery","department":[{"_id":"54"}],"main_file_link":[{"open_access":"1","url":"https://groups.uni-paderborn.de/nt/pubs/2017/INTERSPEECH_2017_Ebbers_paper.pdf"}],"language":[{"iso":"eng"}],"type":"conference","date_updated":"2023-11-22T08:29:06Z","publication":"INTERSPEECH 2017, Stockholm, Schweden"}