LibreCat – Publication List Manager

Thilo Christoph von Neumann

Nachrichtentechnik (NT) / Heinz Nixdorf Institut

tvn@mail.uni-paderborn.de

49870

17 Publications

Mark all

[17]

2024 | Conference Paper | LibreCat-ID: 56004 |

von Neumann, Thilo, et al. “Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization.” 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), IEEE, 2024, doi:10.1109/icasspw62465.2024.10625894.

LibreCat | Files available | DOI

[16]

2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting, Peter, et al. “Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription.” 2024 IEEE Spoken Language Technology Workshop (SLT), 2024.

LibreCat | Download (ext.)

[15]

2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann, Thilo, et al. “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, Institute of Electrical and Electronics Engineers (IEEE), 2023, pp. 576–89, doi:10.1109/taslp.2022.3228629.

LibreCat | Files available | DOI

[14]

2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann, Thilo, et al. “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10094784.

LibreCat | Files available | DOI | Download (ext.)

[13]

2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann, Thilo, et al. “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems.” Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.

LibreCat | Files available | Download (ext.)

[12]

2023 | Conference Paper | LibreCat-ID: 54439 |

Boeddeker, Christoph, et al. “Multi-Stage Diarization Refinement for the CHiME-7 DASR Scenario.” 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023), ISCA, 2023, doi:10.21437/chime.2023-10.

LibreCat | DOI | Download (ext.)

[11]

2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, Tobias, et al. “MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.

LibreCat | Files available | arXiv

[10]

2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr, Tobias, et al. “Monaural Source Separation: From Anechoic to Reverberant Environments.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2022.

LibreCat | Files available | arXiv

[9]

2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann, Thilo, et al. “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data.” ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2022, doi:10.1109/icassp43922.2022.9746757.

LibreCat | Files available | DOI

[8]

2022 | Misc | LibreCat-ID: 33816 |

Gburrek, Tobias, et al. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022, doi:10.48550/ARXIV.2205.00944.

LibreCat | Files available | DOI

[7]

2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker, Christoph, et al. “An Initialization Scheme for Meeting Separation with Spatial Mixture Models.” Interspeech 2022, ISCA, 2022, doi:10.21437/interspeech.2022-10929.

LibreCat | DOI | Download (ext.)

[6]

2022 | Conference Paper | LibreCat-ID: 33958

Kinoshita, Keisuke, et al. “Utterance-by-Utterance Overlap-Aware Neural Diarization with Graph-PIT.” Proc. Interspeech 2022, ISCA, 2022, pp. 1486–90, doi:10.21437/Interspeech.2022-11408.

LibreCat | DOI | Download (ext.)

[5]

2021 | Conference Paper | LibreCat-ID: 26770 |

von Neumann, Thilo, et al. “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers.” Interspeech 2021, 2021, doi:10.21437/interspeech.2021-1177.

LibreCat | Files available | DOI

[4]

2021 | Conference Paper | LibreCat-ID: 29173 |

von Neumann, Thilo, et al. “Speeding Up Permutation Invariant Training for Source Separation.” Speech Communication; 14th ITG Conference, 2021.

LibreCat | Files available

[3]

2020 | Conference Paper | LibreCat-ID: 20762 |

von Neumann, Thilo, et al. “End-to-End Training of Time Domain Audio Separation and Recognition.” ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–08, doi:10.1109/ICASSP40776.2020.9053461.

LibreCat | Files available | DOI

[2]

2020 | Conference Paper | LibreCat-ID: 20764 |

von Neumann, Thilo, et al. “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR.” Proc. Interspeech 2020, 2020, pp. 3097–101, doi:10.21437/Interspeech.2020-2519.

LibreCat | Files available | DOI

[1]

2020 | Conference Paper | LibreCat-ID: 20766 |

Kinoshita, Keisuke, et al. “Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and Its Application to Speaker Stream Separation.” Proc. Interspeech 2020, 2020, pp. 2652–56, doi:10.21437/Interspeech.2020-2388.

LibreCat | Files available | DOI

17 Publications

Mark all

[17]

2024 | Conference Paper | LibreCat-ID: 56004 |

LibreCat | Files available | DOI

[16]

2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting, Peter, et al. “Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription.” 2024 IEEE Spoken Language Technology Workshop (SLT), 2024.

LibreCat | Download (ext.)

[15]

2023 | Journal Article | LibreCat-ID: 35602 |

LibreCat | Files available | DOI

[14]

2023 | Conference Paper | LibreCat-ID: 48281 |

LibreCat | Files available | DOI | Download (ext.)

[13]

2023 | Conference Paper | LibreCat-ID: 48275 |

LibreCat | Files available | Download (ext.)

[12]

2023 | Conference Paper | LibreCat-ID: 54439 |

LibreCat | DOI | Download (ext.)

[11]

2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, Tobias, et al. “MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.

LibreCat | Files available | arXiv

[10]

2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr, Tobias, et al. “Monaural Source Separation: From Anechoic to Reverberant Environments.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2022.

LibreCat | Files available | arXiv

[9]

2022 | Conference Paper | LibreCat-ID: 33819 |

LibreCat | Files available | DOI

[8]

2022 | Misc | LibreCat-ID: 33816 |

Gburrek, Tobias, et al. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022, doi:10.48550/ARXIV.2205.00944.

LibreCat | Files available | DOI

[7]

2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker, Christoph, et al. “An Initialization Scheme for Meeting Separation with Spatial Mixture Models.” Interspeech 2022, ISCA, 2022, doi:10.21437/interspeech.2022-10929.

LibreCat | DOI | Download (ext.)

[6]

2022 | Conference Paper | LibreCat-ID: 33958

Kinoshita, Keisuke, et al. “Utterance-by-Utterance Overlap-Aware Neural Diarization with Graph-PIT.” Proc. Interspeech 2022, ISCA, 2022, pp. 1486–90, doi:10.21437/Interspeech.2022-11408.

LibreCat | DOI | Download (ext.)

[5]

2021 | Conference Paper | LibreCat-ID: 26770 |

LibreCat | Files available | DOI

[4]

2021 | Conference Paper | LibreCat-ID: 29173 |

von Neumann, Thilo, et al. “Speeding Up Permutation Invariant Training for Source Separation.” Speech Communication; 14th ITG Conference, 2021.

LibreCat | Files available

[3]

2020 | Conference Paper | LibreCat-ID: 20762 |

LibreCat | Files available | DOI

[2]

2020 | Conference Paper | LibreCat-ID: 20764 |

LibreCat | Files available | DOI

[1]

2020 | Conference Paper | LibreCat-ID: 20766 |

LibreCat | Files available | DOI

Thilo Christoph von Neumann

17 Publications

Search

Filter Publications

Display / Sort

Export / Embed

17 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Thilo Christoph von Neumann

17 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Export Options

17 Publications

Search

Filter Publications

Display / Sort

Export / Embed

Export Options