Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
333 Publications
2024 | Conference Paper | LibreCat-ID: 53659
Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2024. doi:10.1109/icassp48485.2024.10445911
LibreCat
| DOI
2024 | Conference Paper | LibreCat-ID: 56272 |

Boeddeker C, Cord-Landwehr T, Haeb-Umbach R. Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment. In: Interspeech 2024. ISCA; 2024. doi:10.21437/interspeech.2024-1286
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting P, Berger S, von Neumann T, Boeddeker C, Schlüter R, Haeb-Umbach R. Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription. In: 2024 IEEE Spoken Language Technology Workshop (SLT). ; 2024.
LibreCat
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48269 |

Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On the Integration of Sampling Rate Synchronization and Acoustic Beamforming. In: European Signal Processing Conference (EUSIPCO). ; 2023.
LibreCat
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48270 |

Schmalenstroeer J, Gburrek T, Haeb-Umbach R. LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices. In: ITG Conference on Speech Communication. ; 2023.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 48355 |

Rautenberg F, Kuhlmann M, Wiechmann J, Seebauer F, Wagner P, Haeb-Umbach R. On Feature Importance and Interpretability of Speaker Representations. In: ITG Conference on Speech Communication. ; 2023.
LibreCat
| Files available
| Download (ext.)
| arXiv
2023 | Conference Paper | LibreCat-ID: 48410 |

Wiechmann J, Rautenberg F, Wagner P, Haeb-Umbach R. Explaining voice characteristics to novice voice practitioners-How successful is it? In: 20th International Congress of the Phonetic Sciences (ICPhS) . ; 2023.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48391
Aralikatti R, Boeddeker C, Wichern G, Subramanian A, Le Roux J. Reverberation as Supervision For Speech Separation. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10095022
LibreCat
| DOI
2023 | Conference Paper | LibreCat-ID: 46069
Seebauer F, Kuhlmann M, Haeb-Umbach R, Wagner P. Re-examining the quality dimensions of synthetic speech. In: 12th Speech Synthesis Workshop (SSW) 2023. ; 2023.
LibreCat
2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2023;31:576-589. doi:10.1109/taslp.2022.3228629
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 49109 |

Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks. In: Proc. Asilomar Conference on Signals, Systems, and Computers. ; 2023.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 44849 |

Rautenberg F, Kuhlmann M, Ebbers J, et al. Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics. In: Fortschritte Der Akustik - DAGA 2023. ; 2023:1409-1412.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 49111
Ebbers J, Haeb-Umbach R, Serizel R. Post-Processing Independent Evaluation of Sound Event Detection Systems. In: Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023). ; 2023:36–40.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 57098
Seebauer F, Kuhlmann M, Häb-Umbach R, Wagner P. DISCERNING DIMENSIONS OF QUALITY FOR STATE OF THE ART SYNTHETIC SPEECH. In: Proceedings of the 20th International Congress of Phonetic Sciences. ; 2023.
LibreCat
2023 | Conference Paper | LibreCat-ID: 57086
Kuhlmann M, Meise A, Seebauer F, Wagner P, Häb-Umbach R. Investigating Speaker Embedding Disentanglement on Natural Read Speech. In: Speech Communication; 15th ITG Conference. ; 2023:121–125.
LibreCat
2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10094784
LibreCat
| Files available
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann T, Boeddeker C, Delcroix M, Haeb-Umbach R. MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. In: Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. ; 2023.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 47128 |

Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10095370
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 47129 |

Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures. In: INTERSPEECH 2023. ISCA; 2023. doi:10.21437/interspeech.2023-1379
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 54439 |

Boeddeker C, Cord-Landwehr T, von Neumann T, Haeb-Umbach R. Multi-stage diarization refinement for the CHiME-7 DASR scenario. In: 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). ISCA; 2023. doi:10.21437/chime.2023-10
LibreCat
| DOI
| Download (ext.)