Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

43 Publications


2023 | Conference Paper | LibreCat-ID: 48390
Berger, S., Vieting, P., Boeddeker, C., Schlüter, R., & Haeb-Umbach, R. (2023). Mixture Encoder for Joint Speech Separation and Recognition. INTERSPEECH 2023. https://doi.org/10.21437/interspeech.2023-1815
LibreCat | DOI
 

2023 | Journal Article | LibreCat-ID: 35602 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48281 | OA
von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2023). On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp49357.2023.10094784
LibreCat | Files available | DOI | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48275 | OA
von Neumann, T., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin.
LibreCat | Files available | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 54439 | OA
Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2023). Multi-stage diarization refinement for the CHiME-7 DASR scenario. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). https://doi.org/10.21437/chime.2023-10
LibreCat | DOI | Download (ext.)
 

2022 | Journal Article | LibreCat-ID: 33669 | OA
Zhang, W., Chang, X., Boeddeker, C., Nakatani, T., Watanabe, S., & Qian, Y. (2022). End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party. IEEE/ACM Transactions on Audio, Speech, and Language Processing. https://doi.org/10.1109/TASLP.2022.3209942
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33954 | OA
Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2022). An Initialization Scheme for Meeting Separation with Spatial Mixture Models. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10929
LibreCat | DOI | Download (ext.)
 

2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita, K., von Neumann, T., Delcroix, M., Boeddeker, C., & Haeb-Umbach, R. (2022). Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. Proc. Interspeech 2022, 1486–1490. https://doi.org/10.21437/Interspeech.2022-11408
LibreCat | DOI
 

2022 | Conference Paper | LibreCat-ID: 33819 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2022). SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746757
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33847 | OA
Cord-Landwehr, T., von Neumann, T., Boeddeker, C., & Haeb-Umbach, R. (2022). MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg.
LibreCat | Files available | arXiv
 

2022 | Conference Paper | LibreCat-ID: 33848 | OA
Cord-Landwehr, T., Boeddeker, C., von Neumann, T., Zorila, C., Doddipatla, R., & Haeb-Umbach, R. (2022). Monaural source separation: From anechoic to reverberant environments. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).
LibreCat | Files available | arXiv
 

2022 | Misc | LibreCat-ID: 33816 | OA
Gburrek, T., Boeddeker, C., von Neumann, T., Cord-Landwehr, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv. https://doi.org/10.48550/ARXIV.2205.00944
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 28256
Zhang, W., Boeddeker, C., Watanabe, S., Nakatani, T., Delcroix, M., Kinoshita, K., Ochiai, T., Kamo, N., Haeb-Umbach, R., & Qian, Y. (2021). End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp39728.2021.9414464
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 28262
Li, C., Shi, J., Zhang, W., Subramanian, A. S., Chang, X., Kamo, N., Hira, M., Hayashi, T., Boeddeker, C., Chen, Z., & Watanabe, S. (2021). ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration. 2021 IEEE Spoken Language Technology Workshop (SLT). https://doi.org/10.1109/slt48900.2021.9383615
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 28261
Li, C., Luo, Y., Han, C., Li, J., Yoshioka, T., Zhou, T., Delcroix, M., Kinoshita, K., Boeddeker, C., Qian, Y., Watanabe, S., & Chen, Z. (2021). Dual-Path RNN for Long Recording Speech Separation. 2021 IEEE Spoken Language Technology Workshop (SLT). https://doi.org/10.1109/slt48900.2021.9383514
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 44843 | OA
Boeddeker, C., Rautenberg, F., & Haeb-Umbach, R. (2021). A Comparison and Combination of Unsupervised Blind Source Separation  Techniques. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Kiel.
LibreCat | Files available | Download (ext.) | arXiv
 

2021 | Conference Paper | LibreCat-ID: 28259 | OA
Boeddeker, C., Zhang, W., Nakatani, T., Kinoshita, K., Ochiai, T., Delcroix, M., Kamo, N., Qian, Y., & Haeb-Umbach, R. (2021). Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp39728.2021.9414661
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 26770 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2021). Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021. Interspeech. https://doi.org/10.21437/interspeech.2021-1177
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 29173 | OA
von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2021). Speeding Up Permutation Invariant Training for Source Separation. Speech Communication; 14th ITG Conference. Speech Communication; 14th ITG Conference, Kiel.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20700 | OA
Boeddeker, C., Cord-Landwehr, T., Heitkaemper, J., Zorila, C., Hayakawa, D., Li, M., … Haeb-Umbach, R. (2020). Towards a speaker diarization system for the CHiME 2020 dinner party transcription. In Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments.
LibreCat | Files available
 

Filters and Search Terms

(person=40767)

status=public

Search

Filter Publications

Display / Sort

Citation Style: APA

Export / Embed