Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

39 Publications


2023 | Journal Article | LibreCat-ID: 35602 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48275 | OA
von Neumann, T., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin.
LibreCat | Files available | Download (ext.)
 

2021 | Conference Paper | LibreCat-ID: 26770 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2021). Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021. Interspeech. https://doi.org/10.21437/interspeech.2021-1177
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20504
Heitkaemper, J., Jakobeit, D., Boeddeker, C., Drude, L., & Haeb-Umbach, R. (2020). Demystifying TasNet: A Dissecting Approach. ICASSP 2020 Virtual Barcelona Spain.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20505
Heitkaemper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (2020). Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments. INTERSPEECH 2020 Virtual Shanghai China.
LibreCat | Files available
 

2018 | Conference Paper | LibreCat-ID: 17557
Abramov, O., Kopp, S., Nemeth, A., Kern, F., Mertens, U., & Rohlfing, K. (2018). Towards a Computational Model of Child Gesture-Speech Production. KOGWIS2018: Computational Approaches to Cognitive Science.
LibreCat
 

2018 | Conference Paper | LibreCat-ID: 17179
Abramov, O., Kopp, S., Nemeth, A., Kern, F., Mertens, U., & Rohlfing, K. (2018). Towards a Computational Model of Child Gesture-Speech Production. KOGWIS2018: Computational Approaches to Cognitive Science.
LibreCat
 

2015 | Conference Paper | LibreCat-ID: 11739 | OA
Chinaev, A., & Haeb-Umbach, R. (2015). On Optimal Smoothing in Minimum Statistics Based Noise Tracking. In Interspeech 2015 (pp. 1785–1789).
LibreCat | Files available | Download (ext.)
 

2015 | Conference Paper | LibreCat-ID: 11813 | OA
Heymann, J., Haeb-Umbach, R., Golik, P., & Schlueter, R. (2015). Unsupervised adaptation of a denoising autoencoder by Bayesian Feature Enhancement for reverberant asr under mismatch conditions. In Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on (pp. 5053–5057). https://doi.org/10.1109/ICASSP.2015.7178933
LibreCat | DOI | Download (ext.)
 

2014 | Conference Paper | LibreCat-ID: 11753 | OA
Drude, L., Chinaev, A., Tran Vu, D. H., & Haeb-Umbach, R. (2014). Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models. In 14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014) (pp. 213–217).
LibreCat | Files available | Download (ext.)
 

Filters and Search Terms

keyword="Speech"

Search

Filter Publications

Display / Sort

Citation Style: APA

Export / Embed