Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
15 Publications
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita, K., von Neumann, T., Delcroix, M., Boeddeker, C., & Haeb-Umbach, R. (2022). Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. Proc. Interspeech 2022, 1486–1490. https://doi.org/10.21437/Interspeech.2022-11408
LibreCat
| DOI
2022 | Conference Paper | LibreCat-ID: 33819 |
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2022). SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746757
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33847 |
Cord-Landwehr, T., von Neumann, T., Boeddeker, C., & Haeb-Umbach, R. (2022). MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33848 |
Cord-Landwehr, T., Boeddeker, C., von Neumann, T., Zorila, C., Doddipatla, R., & Haeb-Umbach, R. (2022). Monaural source separation: From anechoic to reverberant environments. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).
LibreCat
| Files available
| arXiv
2022 | Misc | LibreCat-ID: 33816 |
Gburrek, T., Boeddeker, C., von Neumann, T., Cord-Landwehr, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv. https://doi.org/10.48550/ARXIV.2205.00944
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 26770 |
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2021). Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021. Interspeech. https://doi.org/10.21437/interspeech.2021-1177
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 29173 |
von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2021). Speeding Up Permutation Invariant Training for Source Separation. Speech Communication; 14th ITG Conference. Speech Communication; 14th ITG Conference, Kiel.
LibreCat
| Files available
2020 | Conference Paper | LibreCat-ID: 20762 |
von Neumann, T., Kinoshita, K., Drude, L., Boeddeker, C., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). End-to-End Training of Time Domain Audio Separation and Recognition. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 7004–7008. https://doi.org/10.1109/ICASSP40776.2020.9053461
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 20764 |
von Neumann, T., Boeddeker, C., Drude, L., Kinoshita, K., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. Proc. Interspeech 2020, 3097–3101. https://doi.org/10.21437/Interspeech.2020-2519
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 20766 |
Kinoshita, K., von Neumann, T., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. Proc. Interspeech 2020, 2652–2656. https://doi.org/10.21437/Interspeech.2020-2388
LibreCat
| Files available
| DOI