LibreCat – Publication List Manager

Thilo Christoph von Neumann

Nachrichtentechnik (NT) / Heinz Nixdorf Institut

tvn@mail.uni-paderborn.de

49870

17 Publications

Mark all

[17]

2024 | Conference Paper | LibreCat-ID: 56004 |

von Neumann, T., Boeddeker, C., Cord-Landwehr, T., Delcroix, M., & Haeb-Umbach, R. (2024). Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). https://doi.org/10.1109/icasspw62465.2024.10625894

LibreCat | Files available | DOI

[16]

2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting, P., Berger, S., von Neumann, T., Boeddeker, C., Schlüter, R., & Haeb-Umbach, R. (2024). Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription. 2024 IEEE Spoken Language Technology Workshop (SLT).

LibreCat | Download (ext.)

[15]

2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629

LibreCat | Files available | DOI

[14]

2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2023). On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp49357.2023.10094784

LibreCat | Files available | DOI | Download (ext.)

[13]

2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann, T., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin.

LibreCat | Files available | Download (ext.)

[12]

2023 | Conference Paper | LibreCat-ID: 54439 |

Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2023). Multi-stage diarization refinement for the CHiME-7 DASR scenario. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). https://doi.org/10.21437/chime.2023-10

LibreCat | DOI | Download (ext.)

[11]

2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, T., von Neumann, T., Boeddeker, C., & Haeb-Umbach, R. (2022). MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg.

LibreCat | Files available | arXiv

[10]

2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr, T., Boeddeker, C., von Neumann, T., Zorila, C., Doddipatla, R., & Haeb-Umbach, R. (2022). Monaural source separation: From anechoic to reverberant environments. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).

LibreCat | Files available | arXiv

[9]

2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2022). SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746757

LibreCat | Files available | DOI

[8]

2022 | Misc | LibreCat-ID: 33816 |

Gburrek, T., Boeddeker, C., von Neumann, T., Cord-Landwehr, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv. https://doi.org/10.48550/ARXIV.2205.00944

LibreCat | Files available | DOI

[7]

2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2022). An Initialization Scheme for Meeting Separation with Spatial Mixture Models. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10929

LibreCat | DOI | Download (ext.)

[6]

2022 | Conference Paper | LibreCat-ID: 33958

Kinoshita, K., von Neumann, T., Delcroix, M., Boeddeker, C., & Haeb-Umbach, R. (2022). Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. Proc. Interspeech 2022, 1486–1490. https://doi.org/10.21437/Interspeech.2022-11408

LibreCat | DOI | Download (ext.)

[5]

2021 | Conference Paper | LibreCat-ID: 26770 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2021). Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021. Interspeech. https://doi.org/10.21437/interspeech.2021-1177

LibreCat | Files available | DOI

[4]

2021 | Conference Paper | LibreCat-ID: 29173 |

von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2021). Speeding Up Permutation Invariant Training for Source Separation. Speech Communication; 14th ITG Conference. Speech Communication; 14th ITG Conference, Kiel.

LibreCat | Files available

[3]

2020 | Conference Paper | LibreCat-ID: 20762 |

von Neumann, T., Kinoshita, K., Drude, L., Boeddeker, C., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). End-to-End Training of Time Domain Audio Separation and Recognition. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 7004–7008. https://doi.org/10.1109/ICASSP40776.2020.9053461

LibreCat | Files available | DOI

[2]

2020 | Conference Paper | LibreCat-ID: 20764 |

von Neumann, T., Boeddeker, C., Drude, L., Kinoshita, K., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. Proc. Interspeech 2020, 3097–3101. https://doi.org/10.21437/Interspeech.2020-2519

LibreCat | Files available | DOI

[1]

2020 | Conference Paper | LibreCat-ID: 20766 |

Kinoshita, K., von Neumann, T., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. Proc. Interspeech 2020, 2652–2656. https://doi.org/10.21437/Interspeech.2020-2388

LibreCat | Files available | DOI

17 Publications

Mark all

[17]

2024 | Conference Paper | LibreCat-ID: 56004 |

LibreCat | Files available | DOI

[16]

2024 | Conference Paper | LibreCat-ID: 57659 |

LibreCat | Download (ext.)

[15]

2023 | Journal Article | LibreCat-ID: 35602 |

LibreCat | Files available | DOI

[14]

2023 | Conference Paper | LibreCat-ID: 48281 |

LibreCat | Files available | DOI | Download (ext.)

[13]

2023 | Conference Paper | LibreCat-ID: 48275 |

LibreCat | Files available | Download (ext.)

[12]

2023 | Conference Paper | LibreCat-ID: 54439 |

LibreCat | DOI | Download (ext.)

[11]

2022 | Conference Paper | LibreCat-ID: 33847 |

LibreCat | Files available | arXiv

[10]

2022 | Conference Paper | LibreCat-ID: 33848 |

LibreCat | Files available | arXiv

[9]

2022 | Conference Paper | LibreCat-ID: 33819 |

LibreCat | Files available | DOI

[8]

2022 | Misc | LibreCat-ID: 33816 |

LibreCat | Files available | DOI

[7]

2022 | Conference Paper | LibreCat-ID: 33954 |

LibreCat | DOI | Download (ext.)

[6]

2022 | Conference Paper | LibreCat-ID: 33958

LibreCat | DOI | Download (ext.)

[5]

2021 | Conference Paper | LibreCat-ID: 26770 |

LibreCat | Files available | DOI

[4]

2021 | Conference Paper | LibreCat-ID: 29173 |

LibreCat | Files available

[3]

2020 | Conference Paper | LibreCat-ID: 20762 |

LibreCat | Files available | DOI

[2]

2020 | Conference Paper | LibreCat-ID: 20764 |

LibreCat | Files available | DOI

[1]

2020 | Conference Paper | LibreCat-ID: 20766 |

LibreCat | Files available | DOI

Thilo Christoph von Neumann

17 Publications

Search

Filter Publications

Display / Sort

Export / Embed

17 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Thilo Christoph von Neumann

17 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Export Options

17 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Export Options