{"_id":"12901","title":"Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition","department":[{"_id":"54"}],"date_created":"2019-07-30T14:53:58Z","user_id":"44006","citation":{"ieee":"C. Boeddeker, H. Erdogan, T. Yoshioka, and R. Haeb-Umbach, “Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition,” in ICASSP 2018, Calgary, Canada, 2018.","mla":"Boeddeker, Christoph, et al. “Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition.” ICASSP 2018, Calgary, Canada, 2018.","ama":"Boeddeker C, Erdogan H, Yoshioka T, Haeb-Umbach R. Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition. In: ICASSP 2018, Calgary, Canada. ; 2018.","bibtex":"@inproceedings{Boeddeker_Erdogan_Yoshioka_Haeb-Umbach_2018, title={Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition}, booktitle={ICASSP 2018, Calgary, Canada}, author={Boeddeker, Christoph and Erdogan, Hakan and Yoshioka, Takuya and Haeb-Umbach, Reinhold}, year={2018} }","apa":"Boeddeker, C., Erdogan, H., Yoshioka, T., & Haeb-Umbach, R. (2018). Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition. In ICASSP 2018, Calgary, Canada.","short":"C. Boeddeker, H. Erdogan, T. Yoshioka, R. Haeb-Umbach, in: ICASSP 2018, Calgary, Canada, 2018.","chicago":"Boeddeker, Christoph, Hakan Erdogan, Takuya Yoshioka, and Reinhold Haeb-Umbach. “Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition.” In ICASSP 2018, Calgary, Canada, 2018."},"related_material":{"link":[{"relation":"supplementary_material","description":"Poster","url":"https://groups.uni-paderborn.de/nt/pubs/2018/ICASSP_2018_Boeddeker_Slides.pdf"}]},"year":"2018","status":"public","author":[{"last_name":"Boeddeker","id":"40767","full_name":"Boeddeker, Christoph","first_name":"Christoph"},{"first_name":"Hakan","full_name":"Erdogan, Hakan","last_name":"Erdogan"},{"last_name":"Yoshioka","full_name":"Yoshioka, Takuya","first_name":"Takuya"},{"first_name":"Reinhold","full_name":"Haeb-Umbach, Reinhold","last_name":"Haeb-Umbach","id":"242"}],"oa":"1","abstract":[{"lang":"eng","text":"This work examines acoustic beamformers employing neural networks (NNs) for mask prediction as front-end for automatic speech recognition (ASR) systems for practical scenarios like voice-enabled home devices. To test the versatility of the mask predicting network, the system is evaluated with different recording hardware, different microphone array designs, and different acoustic models of the downstream ASR system. Significant gains in recognition accuracy are obtained in all configurations despite the fact that the NN had been trained on mismatched data. Unlike previous work, the NN is trained on a feature level objective, which gives some performance advantage over a mask related criterion. Furthermore, different approaches for realizing online, or adaptive, NN-based beamforming are explored, where the online algorithms still show significant gains compared to the baseline performance."}],"date_updated":"2022-01-06T06:51:24Z","type":"conference","publication":"ICASSP 2018, Calgary, Canada","main_file_link":[{"url":"https://groups.uni-paderborn.de/nt/pubs/2018/ICASSP_2018_Boeddeker_Paper.pdf","open_access":"1"}],"language":[{"iso":"eng"}]}