{"citation":{"ieee":"O. Walter and R. Haeb-Umbach, “Unsupervised Word Discovery from Speech using Bayesian Hierarchical Models,” in 38th German Conference on Pattern Recognition (GCPR 2016), 2016.","apa":"Walter, O., & Haeb-Umbach, R. (2016). Unsupervised Word Discovery from Speech using Bayesian Hierarchical Models. In 38th German Conference on Pattern Recognition (GCPR 2016).","bibtex":"@inproceedings{Walter_Haeb-Umbach_2016, title={Unsupervised Word Discovery from Speech using Bayesian Hierarchical Models}, booktitle={38th German Conference on Pattern Recognition (GCPR 2016)}, author={Walter, Oliver and Haeb-Umbach, Reinhold}, year={2016} }","mla":"Walter, Oliver, and Reinhold Haeb-Umbach. “Unsupervised Word Discovery from Speech Using Bayesian Hierarchical Models.” 38th German Conference on Pattern Recognition (GCPR 2016), 2016.","short":"O. Walter, R. Haeb-Umbach, in: 38th German Conference on Pattern Recognition (GCPR 2016), 2016.","chicago":"Walter, Oliver, and Reinhold Haeb-Umbach. “Unsupervised Word Discovery from Speech Using Bayesian Hierarchical Models.” In 38th German Conference on Pattern Recognition (GCPR 2016), 2016.","ama":"Walter O, Haeb-Umbach R. Unsupervised Word Discovery from Speech using Bayesian Hierarchical Models. In: 38th German Conference on Pattern Recognition (GCPR 2016). ; 2016."},"author":[{"full_name":"Walter, Oliver","last_name":"Walter","first_name":"Oliver"},{"last_name":"Haeb-Umbach","full_name":"Haeb-Umbach, Reinhold","first_name":"Reinhold","id":"242"}],"oa":"1","year":"2016","main_file_link":[{"open_access":"1","url":"https://groups.uni-paderborn.de/nt/pubs/2016/WaHa16.pdf"}],"language":[{"iso":"eng"}],"_id":"11920","type":"conference","date_updated":"2022-01-06T06:51:12Z","publication":"38th German Conference on Pattern Recognition (GCPR 2016)","department":[{"_id":"54"}],"title":"Unsupervised Word Discovery from Speech using Bayesian Hierarchical Models","user_id":"44006","date_created":"2019-07-12T05:30:49Z","abstract":[{"lang":"eng","text":"In this paper we demonstrate an algorithm to learn words from speech using non-parametric Bayesian hierarchical models in an unsupervised setting. We exploit the assumption of a hierarchical structure of speech, namely the formation of spoken words as a sequence of phonemes. We employ the Nested Hierarchical Pitman-Yor Language Model, which allows an a priori unknown and possibly unlimited number of words. We assume the n-gram probabilities of words, the m-gram probabilities of phoneme sequences in words and the phoneme sequences of the words themselves as latent variables to be learned. We evaluate the algorithm on a cross language task using an existing speech recognizer trained on English speech to decode speech in the Xitsonga language supplied for the 2015 ZeroSpeech challenge. We apply the learning algorithm on the resulting phoneme graphs and achieve the highest token precision and F score compared to present systems."}],"status":"public","related_material":{"link":[{"description":"Presentation","relation":"supplementary_material","url":"https://groups.uni-paderborn.de/nt/pubs/2016/WaHa16_Talk.pdf"}]}}