Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
49 Publications
2024 | Preprint | LibreCat-ID: 56273 |

Cornell S, Park T, Huang S, et al. The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization. arXiv:240716447. Published online 2024.
LibreCat
| Download (ext.)
| arXiv
2024 | Conference Paper | LibreCat-ID: 56004 |

von Neumann T, Boeddeker C, Cord-Landwehr T, Delcroix M, Haeb-Umbach R. Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. In: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). IEEE; 2024. doi:10.1109/icasspw62465.2024.10625894
LibreCat
| Files available
| DOI
2024 | Journal Article | LibreCat-ID: 52958 |

Boeddeker C, Subramanian AS, Wichern G, Haeb-Umbach R, Le Roux J. TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2024;32:1185-1197. doi:10.1109/taslp.2024.3350887
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 53659
Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2024. doi:10.1109/icassp48485.2024.10445911
LibreCat
| DOI
2024 | Preprint | LibreCat-ID: 57085 |

Cord-Landwehr T, Boeddeker C, Haeb-Umbach R. Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models. Published online 2024.
LibreCat
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 56272 |

Boeddeker C, Cord-Landwehr T, Haeb-Umbach R. Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment. In: Interspeech 2024. ISCA; 2024. doi:10.21437/interspeech.2024-1286
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting P, Berger S, von Neumann T, Boeddeker C, Schlüter R, Haeb-Umbach R. Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription. In: 2024 IEEE Spoken Language Technology Workshop (SLT). ; 2024.
LibreCat
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48391
Aralikatti R, Boeddeker C, Wichern G, Subramanian A, Le Roux J. Reverberation as Supervision For Speech Separation. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10095022
LibreCat
| DOI
2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2023;31:576-589. doi:10.1109/taslp.2022.3228629
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10094784
LibreCat
| Files available
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann T, Boeddeker C, Delcroix M, Haeb-Umbach R. MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. In: Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. ; 2023.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 47128 |

Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10095370
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 47129 |

Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures. In: INTERSPEECH 2023. ISCA; 2023. doi:10.21437/interspeech.2023-1379
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 54439 |

Boeddeker C, Cord-Landwehr T, von Neumann T, Haeb-Umbach R. Multi-stage diarization refinement for the CHiME-7 DASR scenario. In: 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). ISCA; 2023. doi:10.21437/chime.2023-10
LibreCat
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48390 |

Berger S, Vieting P, Boeddeker C, Schlüter R, Haeb-Umbach R. Mixture Encoder for Joint Speech Separation and Recognition. In: INTERSPEECH 2023. ISCA; 2023. doi:10.21437/interspeech.2023-1815
LibreCat
| DOI
| Download (ext.)
2022 | Journal Article | LibreCat-ID: 33669 |

Zhang W, Chang X, Boeddeker C, Nakatani T, Watanabe S, Qian Y. End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party. IEEE/ACM Transactions on Audio, Speech, and Language Processing. Published online 2022. doi:10.1109/TASLP.2022.3209942
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr T, von Neumann T, Boeddeker C, Haeb-Umbach R. MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). ; 2022.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr T, Boeddeker C, von Neumann T, Zorila C, Doddipatla R, Haeb-Umbach R. Monaural source separation: From anechoic to reverberant environments. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746757
LibreCat
| Files available
| DOI
2022 | Misc | LibreCat-ID: 33816 |

Gburrek T, Boeddeker C, von Neumann T, Cord-Landwehr T, Schmalenstroeer J, Haeb-Umbach R. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv; 2022. doi:10.48550/ARXIV.2205.00944
LibreCat
| Files available
| DOI