LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

15 Publications

2022 | Conference Paper | LibreCat-ID: 33958

Kinoshita K, von Neumann T, Delcroix M, Boeddeker C, Haeb-Umbach R. Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. In: Proc. Interspeech 2022. ISCA; 2022:1486-1490. doi:10.21437/Interspeech.2022-11408

LibreCat | DOI

2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746757

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr T, von Neumann T, Boeddeker C, Haeb-Umbach R. MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). ; 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr T, Boeddeker C, von Neumann T, Zorila C, Doddipatla R, Haeb-Umbach R. Monaural source separation: From anechoic to reverberant environments. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022.

LibreCat | Files available | arXiv

2022 | Misc | LibreCat-ID: 33816 |

Gburrek T, Boeddeker C, von Neumann T, Cord-Landwehr T, Schmalenstroeer J, Haeb-Umbach R. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv; 2022. doi:10.48550/ARXIV.2205.00944

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 26770 |

von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. In: Interspeech 2021. ; 2021. doi:10.21437/interspeech.2021-1177

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 29173 |

von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. Speeding Up Permutation Invariant Training for Source Separation. In: Speech Communication; 14th ITG Conference. ; 2021.

LibreCat | Files available

2020 | Conference Paper | LibreCat-ID: 20762 |

von Neumann T, Kinoshita K, Drude L, et al. End-to-End Training of Time Domain Audio Separation and Recognition. In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2020:7004-7008. doi:10.1109/ICASSP40776.2020.9053461

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20764 |

von Neumann T, Boeddeker C, Drude L, et al. Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. In: Proc. Interspeech 2020. ; 2020:3097-3101. doi:10.21437/Interspeech.2020-2519

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20766 |

Kinoshita K, von Neumann T, Delcroix M, Nakatani T, Haeb-Umbach R. Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. In: Proc. Interspeech 2020. ; 2020:2652-2656. doi:10.21437/Interspeech.2020-2388

LibreCat | Files available | DOI

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

15 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

15 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options