14 Publications

Mark all

[14]
2023 | Journal Article | LibreCat-ID: 35602 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629
LibreCat | Files available | DOI
 
[13]
2023 | Conference Paper | LibreCat-ID: 48281 | OA
von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2023). On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp49357.2023.10094784
LibreCat | Files available | DOI | Download (ext.)
 
[12]
2023 | Conference Paper | LibreCat-ID: 48275 | OA
von Neumann, T., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin.
LibreCat | Files available | Download (ext.)
 
[11]
2022 | Conference Paper | LibreCat-ID: 33954 | OA
Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2022). An Initialization Scheme for Meeting Separation with Spatial Mixture Models. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10929
LibreCat | DOI | Download (ext.)
 
[10]
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita, K., von Neumann, T., Delcroix, M., Boeddeker, C., & Haeb-Umbach, R. (2022). Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. Proc. Interspeech 2022, 1486–1490. https://doi.org/10.21437/Interspeech.2022-11408
LibreCat | DOI
 
[9]
2022 | Conference Paper | LibreCat-ID: 33819 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2022). SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746757
LibreCat | Files available | DOI
 
[8]
2022 | Conference Paper | LibreCat-ID: 33847 | OA
Cord-Landwehr, T., von Neumann, T., Boeddeker, C., & Haeb-Umbach, R. (2022). MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg.
LibreCat | Files available | arXiv
 
[7]
2022 | Conference Paper | LibreCat-ID: 33848 | OA
Cord-Landwehr, T., Boeddeker, C., von Neumann, T., Zorila, C., Doddipatla, R., & Haeb-Umbach, R. (2022). Monaural source separation: From anechoic to reverberant environments. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).
LibreCat | Files available | arXiv
 
[6]
2022 | Misc | LibreCat-ID: 33816 | OA
Gburrek, T., Boeddeker, C., von Neumann, T., Cord-Landwehr, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv. https://doi.org/10.48550/ARXIV.2205.00944
LibreCat | Files available | DOI
 
[5]
2021 | Conference Paper | LibreCat-ID: 26770 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2021). Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021. Interspeech. https://doi.org/10.21437/interspeech.2021-1177
LibreCat | Files available | DOI
 
[4]
2021 | Conference Paper | LibreCat-ID: 29173 | OA
von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2021). Speeding Up Permutation Invariant Training for Source Separation. Speech Communication; 14th ITG Conference. Speech Communication; 14th ITG Conference, Kiel.
LibreCat | Files available
 
[3]
2020 | Conference Paper | LibreCat-ID: 20762 | OA
von Neumann, T., Kinoshita, K., Drude, L., Boeddeker, C., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). End-to-End Training of Time Domain Audio Separation and Recognition. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 7004–7008. https://doi.org/10.1109/ICASSP40776.2020.9053461
LibreCat | Files available | DOI
 
[2]
2020 | Conference Paper | LibreCat-ID: 20764 | OA
von Neumann, T., Boeddeker, C., Drude, L., Kinoshita, K., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. Proc. Interspeech 2020, 3097–3101. https://doi.org/10.21437/Interspeech.2020-2519
LibreCat | Files available | DOI
 
[1]
2020 | Conference Paper | LibreCat-ID: 20766 | OA
Kinoshita, K., von Neumann, T., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. Proc. Interspeech 2020, 2652–2656. https://doi.org/10.21437/Interspeech.2020-2388
LibreCat | Files available | DOI
 

Search

Filter Publications

Display / Sort

Citation Style: APA

Export / Embed

14 Publications

Mark all

[14]
2023 | Journal Article | LibreCat-ID: 35602 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629
LibreCat | Files available | DOI
 
[13]
2023 | Conference Paper | LibreCat-ID: 48281 | OA
von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2023). On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp49357.2023.10094784
LibreCat | Files available | DOI | Download (ext.)
 
[12]
2023 | Conference Paper | LibreCat-ID: 48275 | OA
von Neumann, T., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin.
LibreCat | Files available | Download (ext.)
 
[11]
2022 | Conference Paper | LibreCat-ID: 33954 | OA
Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2022). An Initialization Scheme for Meeting Separation with Spatial Mixture Models. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10929
LibreCat | DOI | Download (ext.)
 
[10]
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita, K., von Neumann, T., Delcroix, M., Boeddeker, C., & Haeb-Umbach, R. (2022). Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. Proc. Interspeech 2022, 1486–1490. https://doi.org/10.21437/Interspeech.2022-11408
LibreCat | DOI
 
[9]
2022 | Conference Paper | LibreCat-ID: 33819 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2022). SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746757
LibreCat | Files available | DOI
 
[8]
2022 | Conference Paper | LibreCat-ID: 33847 | OA
Cord-Landwehr, T., von Neumann, T., Boeddeker, C., & Haeb-Umbach, R. (2022). MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg.
LibreCat | Files available | arXiv
 
[7]
2022 | Conference Paper | LibreCat-ID: 33848 | OA
Cord-Landwehr, T., Boeddeker, C., von Neumann, T., Zorila, C., Doddipatla, R., & Haeb-Umbach, R. (2022). Monaural source separation: From anechoic to reverberant environments. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).
LibreCat | Files available | arXiv
 
[6]
2022 | Misc | LibreCat-ID: 33816 | OA
Gburrek, T., Boeddeker, C., von Neumann, T., Cord-Landwehr, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv. https://doi.org/10.48550/ARXIV.2205.00944
LibreCat | Files available | DOI
 
[5]
2021 | Conference Paper | LibreCat-ID: 26770 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2021). Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021. Interspeech. https://doi.org/10.21437/interspeech.2021-1177
LibreCat | Files available | DOI
 
[4]
2021 | Conference Paper | LibreCat-ID: 29173 | OA
von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2021). Speeding Up Permutation Invariant Training for Source Separation. Speech Communication; 14th ITG Conference. Speech Communication; 14th ITG Conference, Kiel.
LibreCat | Files available
 
[3]
2020 | Conference Paper | LibreCat-ID: 20762 | OA
von Neumann, T., Kinoshita, K., Drude, L., Boeddeker, C., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). End-to-End Training of Time Domain Audio Separation and Recognition. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 7004–7008. https://doi.org/10.1109/ICASSP40776.2020.9053461
LibreCat | Files available | DOI
 
[2]
2020 | Conference Paper | LibreCat-ID: 20764 | OA
von Neumann, T., Boeddeker, C., Drude, L., Kinoshita, K., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. Proc. Interspeech 2020, 3097–3101. https://doi.org/10.21437/Interspeech.2020-2519
LibreCat | Files available | DOI
 
[1]
2020 | Conference Paper | LibreCat-ID: 20766 | OA
Kinoshita, K., von Neumann, T., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. Proc. Interspeech 2020, 2652–2656. https://doi.org/10.21437/Interspeech.2020-2388
LibreCat | Files available | DOI
 

Search

Filter Publications

Display / Sort

Citation Style: APA

Export / Embed