Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
49 Publications
2024 | Preprint | LibreCat-ID: 56273 |

S. Cornell et al., “The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization,” arXiv:2407.16447. 2024.
LibreCat
| Download (ext.)
| arXiv
2024 | Conference Paper | LibreCat-ID: 56004 |

T. von Neumann, C. Boeddeker, T. Cord-Landwehr, M. Delcroix, and R. Haeb-Umbach, “Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization,” 2024, doi: 10.1109/icasspw62465.2024.10625894.
LibreCat
| Files available
| DOI
2024 | Journal Article | LibreCat-ID: 52958 |

C. Boeddeker, A. S. Subramanian, G. Wichern, R. Haeb-Umbach, and J. Le Roux, “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1185–1197, 2024, doi: 10.1109/taslp.2024.3350887.
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 53659
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios,” presented at the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, 2024, doi: 10.1109/icassp48485.2024.10445911.
LibreCat
| DOI
2024 | Preprint | LibreCat-ID: 57085 |

T. Cord-Landwehr, C. Boeddeker, and R. Haeb-Umbach, “Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models.” 2024.
LibreCat
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 56272 |

C. Boeddeker, T. Cord-Landwehr, and R. Haeb-Umbach, “Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment,” 2024, doi: 10.21437/interspeech.2024-1286.
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 57659 |

P. Vieting, S. Berger, T. von Neumann, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription,” 2024.
LibreCat
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48391
R. Aralikatti, C. Boeddeker, G. Wichern, A. Subramanian, and J. Le Roux, “Reverberation as Supervision For Speech Separation,” 2023, doi: 10.1109/icassp49357.2023.10095022.
LibreCat
| DOI
2023 | Journal Article | LibreCat-ID: 35602 |

T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 576–589, 2023, doi: 10.1109/taslp.2022.3228629.
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 48281 |

T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems,” 2023, doi: 10.1109/icassp49357.2023.10094784.
LibreCat
| Files available
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48275 |

T. von Neumann, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems,” presented at the CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin, 2023.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 47128 |

T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization,” presented at the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, 2023, doi: 10.1109/icassp49357.2023.10095370.
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 47129 |

T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures,” 2023, doi: 10.21437/interspeech.2023-1379.
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 54439 |

C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “Multi-stage diarization refinement for the CHiME-7 DASR scenario,” 2023, doi: 10.21437/chime.2023-10.
LibreCat
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48390 |

S. Berger, P. Vieting, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Mixture Encoder for Joint Speech Separation and Recognition,” 2023, doi: 10.21437/interspeech.2023-1815.
LibreCat
| DOI
| Download (ext.)
2022 | Journal Article | LibreCat-ID: 33669 |

W. Zhang, X. Chang, C. Boeddeker, T. Nakatani, S. Watanabe, and Y. Qian, “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi: 10.1109/TASLP.2022.3209942.
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33847 |

T. Cord-Landwehr, T. von Neumann, C. Boeddeker, and R. Haeb-Umbach, “MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg, 2022.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33848 |

T. Cord-Landwehr, C. Boeddeker, T. von Neumann, C. Zorila, R. Doddipatla, and R. Haeb-Umbach, “Monaural source separation: From anechoic to reverberant environments,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33819 |

T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data,” 2022, doi: 10.1109/icassp43922.2022.9746757.
LibreCat
| Files available
| DOI
2022 | Misc | LibreCat-ID: 33816 |

T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer, and R. Haeb-Umbach, A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022.
LibreCat
| Files available
| DOI