Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
49 Publications
2024 | Preprint | LibreCat-ID: 56273 |

Cornell, S., Park, T., Huang, S., Boeddeker, C., Chang, X., Maciejewski, M., Wiesner, M., Garcia, P., & Watanabe, S. (2024). The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization. In arXiv:2407.16447.
LibreCat
| Download (ext.)
| arXiv
2024 | Conference Paper | LibreCat-ID: 56004 |

von Neumann, T., Boeddeker, C., Cord-Landwehr, T., Delcroix, M., & Haeb-Umbach, R. (2024). Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). https://doi.org/10.1109/icasspw62465.2024.10625894
LibreCat
| Files available
| DOI
2024 | Journal Article | LibreCat-ID: 52958 |

Boeddeker, C., Subramanian, A. S., Wichern, G., Haeb-Umbach, R., & Le Roux, J. (2024). TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 32, 1185–1197. https://doi.org/10.1109/taslp.2024.3350887
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 53659
Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2024). Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul. https://doi.org/10.1109/icassp48485.2024.10445911
LibreCat
| DOI
2024 | Preprint | LibreCat-ID: 57085 |

Cord-Landwehr, T., Boeddeker, C., & Haeb-Umbach, R. (2024). Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models.
LibreCat
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 56272 |

Boeddeker, C., Cord-Landwehr, T., & Haeb-Umbach, R. (2024). Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment. Interspeech 2024. https://doi.org/10.21437/interspeech.2024-1286
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting, P., Berger, S., von Neumann, T., Boeddeker, C., Schlüter, R., & Haeb-Umbach, R. (2024). Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription. 2024 IEEE Spoken Language Technology Workshop (SLT).
LibreCat
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48391
Aralikatti, R., Boeddeker, C., Wichern, G., Subramanian, A., & Le Roux, J. (2023). Reverberation as Supervision For Speech Separation. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp49357.2023.10095022
LibreCat
| DOI
2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2023). On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp49357.2023.10094784
LibreCat
| Files available
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann, T., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 47128 |

Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2023). Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes. https://doi.org/10.1109/icassp49357.2023.10095370
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 47129 |

Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2023). A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures. INTERSPEECH 2023. https://doi.org/10.21437/interspeech.2023-1379
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 54439 |

Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2023). Multi-stage diarization refinement for the CHiME-7 DASR scenario. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). https://doi.org/10.21437/chime.2023-10
LibreCat
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48390 |

Berger, S., Vieting, P., Boeddeker, C., Schlüter, R., & Haeb-Umbach, R. (2023). Mixture Encoder for Joint Speech Separation and Recognition. INTERSPEECH 2023. https://doi.org/10.21437/interspeech.2023-1815
LibreCat
| DOI
| Download (ext.)
2022 | Journal Article | LibreCat-ID: 33669 |

Zhang, W., Chang, X., Boeddeker, C., Nakatani, T., Watanabe, S., & Qian, Y. (2022). End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party. IEEE/ACM Transactions on Audio, Speech, and Language Processing. https://doi.org/10.1109/TASLP.2022.3209942
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, T., von Neumann, T., Boeddeker, C., & Haeb-Umbach, R. (2022). MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr, T., Boeddeker, C., von Neumann, T., Zorila, C., Doddipatla, R., & Haeb-Umbach, R. (2022). Monaural source separation: From anechoic to reverberant environments. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2022). SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746757
LibreCat
| Files available
| DOI
2022 | Misc | LibreCat-ID: 33816 |

Gburrek, T., Boeddeker, C., von Neumann, T., Cord-Landwehr, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv. https://doi.org/10.48550/ARXIV.2205.00944
LibreCat
| Files available
| DOI