Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
338 Publications
2025 | Conference Paper | LibreCat-ID: 59999
Rautenberg, Frederik, Michael Kuhlmann, Fritz Seebauer, Jana Wiechmann, Petra Wagner, and Reinhold Haeb-Umbach. “Speech Synthesis along Perceptual Voice Quality Dimensions.” In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025. https://doi.org/10.1109/icassp49660.2025.10888012.
LibreCat
| DOI
2025 | Conference Paper | LibreCat-ID: 61047
Rautenberg, Frederik, Fritz Seebauer, Jana Wiechmann, Michael Kuhlmann, Petra Wagner, and Reinhold Haeb-Umbach. “Synthesizing Speech with Selected Perceptual Voice Qualities – A Case Study with Creaky Voice.” In Interspeech 2025. ISCA, 2025. https://doi.org/10.21437/Interspeech.2025-1443.
LibreCat
| DOI
2025 | Conference Paper | LibreCat-ID: 61079 |
Cord-Landwehr, Tobias, Tobias Gburrek, Marc Deegen, and Reinhold Haeb-Umbach. “Spatio-Spectral Diarization of Meetings by Combining TDOA-Based Segmentation and Speaker Embedding-Based Clustering.” In Proceedings of INTERSPEECH, 2025. https://doi.org/10.21437/Interspeech.2025-1663.
LibreCat
| Files available
| DOI
| arXiv
2025 | Conference Paper | LibreCat-ID: 62164
Kuhlmann, Michael, Fritz Seebauer, Petra Wagner, and Reinhold Häb-Umbach. “Towards Frame-Level Quality Predictions of Synthetic Speech.” In Interspeech 2025. ISCA, 2025. https://doi.org/10.21437/interspeech.2025-2190.
LibreCat
| DOI
2025 | Conference Paper | LibreCat-ID: 62174
Meise, Adrian , Tobias Cord-Landwehr, and Reinhold Haeb-Umbach. “On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation.” In ITG Conference on Speech Communication, 2025.
LibreCat
2025 | Conference Paper | LibreCat-ID: 62163
Werning, Alexander, and Reinhold Häb-Umbach. “A Fully Zero-Shot Approach to Obtaining Specialized and Compact Audio Tagging Models.” In Proceedings of the 16th ITG Conference on Speech Communication, edited by Sebastian Möller, Timo Gerkmann, and Dorothea Kolossa, 76–80. Berlin, 2025.
LibreCat
2025 | Conference Paper | LibreCat-ID: 59900
Werning, Alexander, and Reinhold Häb-Umbach. “Distilling Efficient Audio Models Using Data Pruning with CLAP.” In Proceedings of DAS|DAGA 2025, edited by Deutsche Gesellschaft für Akustik e.V. (DEGA), Berlin, 2025. Copenhagen, 2025.
LibreCat
2024 | Preprint | LibreCat-ID: 56273 |
Cornell, Samuele, Taejin Park, Steve Huang, Christoph Boeddeker, Xuankai Chang, Matthew Maciejewski, Matthew Wiesner, Paola Garcia, and Shinji Watanabe. “The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization.” ArXiv:2407.16447, 2024.
LibreCat
| Download (ext.)
| arXiv
2024 | Conference Paper | LibreCat-ID: 57031 |
Gburrek, Tobias, Adrian Meise, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models.” In 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE, 2024. https://doi.org/10.1109/iwaenc61483.2024.10694103.
LibreCat
| Files available
| DOI
2024 | Journal Article | LibreCat-ID: 52958 |
Boeddeker, Christoph, Aswin Shanmugam Subramanian, Gordon Wichern, Reinhold Haeb-Umbach, and Jonathan Le Roux. “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings.” IEEE/ACM Transactions on Audio, Speech, and Language Processing 32 (2024): 1185–97. https://doi.org/10.1109/taslp.2024.3350887.
LibreCat
| Files available
| DOI
| Download (ext.)
2024 | Report | LibreCat-ID: 57161
Werning, Alexander, and Reinhold Haeb-Umbach. UPB-NT Submission to DCASE24: Dataset Pruning for Targeted Knowledge Distillation, 2024.
LibreCat
2024 | Conference Paper | LibreCat-ID: 57099
Xie, Yuying, Michael Kuhlmann, Frederik Rautenberg, Zheng-Hua Tan, and Reinhold Häb-Umbach. “Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder.” In 2024 32nd European Signal Processing Conference (EUSIPCO), 436–440, 2024.
LibreCat
2024 | Conference Paper | LibreCat-ID: 56004 |
Neumann, Thilo von, Christoph Boeddeker, Tobias Cord-Landwehr, Marc Delcroix, and Reinhold Haeb-Umbach. “Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization.” In 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). IEEE, 2024. https://doi.org/10.1109/icasspw62465.2024.10625894.
LibreCat
| Files available
| DOI
2024 | Conference Paper | LibreCat-ID: 56272 |
Boeddeker, Christoph, Tobias Cord-Landwehr, and Reinhold Haeb-Umbach. “Once More Diarization: Improving Meeting Transcription Systems through Segment-Level Speaker Reassignment.” In Interspeech 2024. ISCA, 2024. https://doi.org/10.21437/interspeech.2024-1286.
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 57659 |
Vieting, Peter, Simon Berger, Thilo von Neumann, Christoph Boeddeker, Ralf Schlüter, and Reinhold Haeb-Umbach. “Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription.” In 2024 IEEE Spoken Language Technology Workshop (SLT), 2024.
LibreCat
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 57085 |
Cord-Landwehr, Tobias, Christoph Boeddeker, and Reinhold Haeb-Umbach. “Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models.” In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024. https://doi.org/10.1109/ICASSP49660.2025.10888445.
LibreCat
| Files available
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 53659
Cord-Landwehr, Tobias, Christoph Boeddeker, Cătălin Zorilă, Rama Doddipatla, and Reinhold Haeb-Umbach. “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios.” In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2024. https://doi.org/10.1109/icassp48485.2024.10445911.
LibreCat
| Files available
| DOI
2024 | Conference Paper | LibreCat-ID: 57160
Werning, Alexander, and Reinhold Haeb-Umbach. “Target-Specific Dataset Pruning for Compression of Audio Tagging Models.” In 32nd European Signal Processing Conference (EUSIPCO 2024), 2024.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 48269 |
Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “On the Integration of Sampling Rate Synchronization and Acoustic Beamforming.” In European Signal Processing Conference (EUSIPCO), 2023.
LibreCat
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48270 |
Schmalenstroeer, Joerg, Tobias Gburrek, and Reinhold Haeb-Umbach. “LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices.” In ITG Conference on Speech Communication, 2023.
LibreCat
| Files available