LibreCat – Publication List Manager

Thilo Christoph von Neumann

Nachrichtentechnik (NT) / Heinz Nixdorf Institut

tvn@mail.uni-paderborn.de

49870

17 Publications

Mark all

[17]

2024 | Conference Paper | LibreCat-ID: 56004 |

von Neumann T, Boeddeker C, Cord-Landwehr T, Delcroix M, Haeb-Umbach R. Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. In: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). IEEE; 2024. doi:10.1109/icasspw62465.2024.10625894

LibreCat | Files available | DOI

[16]

2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting P, Berger S, von Neumann T, Boeddeker C, Schlüter R, Haeb-Umbach R. Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription. In: 2024 IEEE Spoken Language Technology Workshop (SLT). ; 2024.

LibreCat | Download (ext.)

[15]

2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2023;31:576-589. doi:10.1109/taslp.2022.3228629

LibreCat | Files available | DOI

[14]

2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10094784

LibreCat | Files available | DOI | Download (ext.)

[13]

2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann T, Boeddeker C, Delcroix M, Haeb-Umbach R. MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. In: Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. ; 2023.

LibreCat | Files available | Download (ext.)

[12]

2023 | Conference Paper | LibreCat-ID: 54439 |

Boeddeker C, Cord-Landwehr T, von Neumann T, Haeb-Umbach R. Multi-stage diarization refinement for the CHiME-7 DASR scenario. In: 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). ISCA; 2023. doi:10.21437/chime.2023-10

LibreCat | DOI | Download (ext.)

[11]

2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr T, von Neumann T, Boeddeker C, Haeb-Umbach R. MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). ; 2022.

LibreCat | Files available | arXiv

[10]

2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr T, Boeddeker C, von Neumann T, Zorila C, Doddipatla R, Haeb-Umbach R. Monaural source separation: From anechoic to reverberant environments. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022.

LibreCat | Files available | arXiv

[9]

2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746757

LibreCat | Files available | DOI

[8]

2022 | Misc | LibreCat-ID: 33816 |

Gburrek T, Boeddeker C, von Neumann T, Cord-Landwehr T, Schmalenstroeer J, Haeb-Umbach R. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv; 2022. doi:10.48550/ARXIV.2205.00944

LibreCat | Files available | DOI

[7]

2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker C, Cord-Landwehr T, von Neumann T, Haeb-Umbach R. An Initialization Scheme for Meeting Separation with Spatial Mixture Models. In: Interspeech 2022. ISCA; 2022. doi:10.21437/interspeech.2022-10929

LibreCat | DOI | Download (ext.)

[6]

2022 | Conference Paper | LibreCat-ID: 33958

Kinoshita K, von Neumann T, Delcroix M, Boeddeker C, Haeb-Umbach R. Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. In: Proc. Interspeech 2022. ISCA; 2022:1486-1490. doi:10.21437/Interspeech.2022-11408

LibreCat | DOI | Download (ext.)

[5]

2021 | Conference Paper | LibreCat-ID: 26770 |

von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. In: Interspeech 2021. ; 2021. doi:10.21437/interspeech.2021-1177

LibreCat | Files available | DOI

[4]

2021 | Conference Paper | LibreCat-ID: 29173 |

von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. Speeding Up Permutation Invariant Training for Source Separation. In: Speech Communication; 14th ITG Conference. ; 2021.

LibreCat | Files available

[3]

2020 | Conference Paper | LibreCat-ID: 20762 |

von Neumann T, Kinoshita K, Drude L, et al. End-to-End Training of Time Domain Audio Separation and Recognition. In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2020:7004-7008. doi:10.1109/ICASSP40776.2020.9053461

LibreCat | Files available | DOI

[2]

2020 | Conference Paper | LibreCat-ID: 20764 |

von Neumann T, Boeddeker C, Drude L, et al. Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. In: Proc. Interspeech 2020. ; 2020:3097-3101. doi:10.21437/Interspeech.2020-2519

LibreCat | Files available | DOI

[1]

2020 | Conference Paper | LibreCat-ID: 20766 |

Kinoshita K, von Neumann T, Delcroix M, Nakatani T, Haeb-Umbach R. Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. In: Proc. Interspeech 2020. ; 2020:2652-2656. doi:10.21437/Interspeech.2020-2388

LibreCat | Files available | DOI

17 Publications

Mark all

[17]

2024 | Conference Paper | LibreCat-ID: 56004 |

LibreCat | Files available | DOI

[16]

2024 | Conference Paper | LibreCat-ID: 57659 |

LibreCat | Download (ext.)

[15]

2023 | Journal Article | LibreCat-ID: 35602 |

LibreCat | Files available | DOI

[14]

2023 | Conference Paper | LibreCat-ID: 48281 |

LibreCat | Files available | DOI | Download (ext.)

[13]

2023 | Conference Paper | LibreCat-ID: 48275 |

LibreCat | Files available | Download (ext.)

[12]

2023 | Conference Paper | LibreCat-ID: 54439 |

LibreCat | DOI | Download (ext.)

[11]

2022 | Conference Paper | LibreCat-ID: 33847 |

LibreCat | Files available | arXiv

[10]

2022 | Conference Paper | LibreCat-ID: 33848 |

LibreCat | Files available | arXiv

[9]

2022 | Conference Paper | LibreCat-ID: 33819 |

LibreCat | Files available | DOI

[8]

2022 | Misc | LibreCat-ID: 33816 |

LibreCat | Files available | DOI

[7]

2022 | Conference Paper | LibreCat-ID: 33954 |

LibreCat | DOI | Download (ext.)

[6]

2022 | Conference Paper | LibreCat-ID: 33958

LibreCat | DOI | Download (ext.)

[5]

2021 | Conference Paper | LibreCat-ID: 26770 |

LibreCat | Files available | DOI

[4]

2021 | Conference Paper | LibreCat-ID: 29173 |

von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. Speeding Up Permutation Invariant Training for Source Separation. In: Speech Communication; 14th ITG Conference. ; 2021.

LibreCat | Files available

[3]

2020 | Conference Paper | LibreCat-ID: 20762 |

LibreCat | Files available | DOI

[2]

2020 | Conference Paper | LibreCat-ID: 20764 |

LibreCat | Files available | DOI

[1]

2020 | Conference Paper | LibreCat-ID: 20766 |

LibreCat | Files available | DOI

Thilo Christoph von Neumann

17 Publications

Search

Filter Publications

Display / Sort

Export / Embed

17 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Thilo Christoph von Neumann

17 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Export Options

17 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Export Options