Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
15 Publications
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita K, von Neumann T, Delcroix M, Boeddeker C, Haeb-Umbach R. Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. In: Proc. Interspeech 2022. ISCA; 2022:1486-1490. doi:10.21437/Interspeech.2022-11408
LibreCat
| DOI
2022 | Conference Paper | LibreCat-ID: 33819 |
![Open access file OA](https://ris.uni-paderborn.de/images/access_open.png)
von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746757
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33847 |
![Open access file OA](https://ris.uni-paderborn.de/images/access_open.png)
Cord-Landwehr T, von Neumann T, Boeddeker C, Haeb-Umbach R. MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). ; 2022.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33848 |
![Open access file OA](https://ris.uni-paderborn.de/images/access_open.png)
Cord-Landwehr T, Boeddeker C, von Neumann T, Zorila C, Doddipatla R, Haeb-Umbach R. Monaural source separation: From anechoic to reverberant environments. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022.
LibreCat
| Files available
| arXiv
2022 | Misc | LibreCat-ID: 33816 |
![Open access file OA](https://ris.uni-paderborn.de/images/access_open.png)
Gburrek T, Boeddeker C, von Neumann T, Cord-Landwehr T, Schmalenstroeer J, Haeb-Umbach R. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv; 2022. doi:10.48550/ARXIV.2205.00944
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 26770 |
![Open access file OA](https://ris.uni-paderborn.de/images/access_open.png)
von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. In: Interspeech 2021. ; 2021. doi:10.21437/interspeech.2021-1177
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 29173 |
![Open access file OA](https://ris.uni-paderborn.de/images/access_open.png)
von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. Speeding Up Permutation Invariant Training for Source Separation. In: Speech Communication; 14th ITG Conference. ; 2021.
LibreCat
| Files available
2020 | Conference Paper | LibreCat-ID: 20762 |
![Open access file OA](https://ris.uni-paderborn.de/images/access_open.png)
von Neumann T, Kinoshita K, Drude L, et al. End-to-End Training of Time Domain Audio Separation and Recognition. In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2020:7004-7008. doi:10.1109/ICASSP40776.2020.9053461
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 20764 |
![Open access file OA](https://ris.uni-paderborn.de/images/access_open.png)
von Neumann T, Boeddeker C, Drude L, et al. Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. In: Proc. Interspeech 2020. ; 2020:3097-3101. doi:10.21437/Interspeech.2020-2519
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 20766 |
![Open access file OA](https://ris.uni-paderborn.de/images/access_open.png)
Kinoshita K, von Neumann T, Delcroix M, Nakatani T, Haeb-Umbach R. Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. In: Proc. Interspeech 2020. ; 2020:2652-2656. doi:10.21437/Interspeech.2020-2388
LibreCat
| Files available
| DOI