LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

339 Publications

2026 | Conference Paper | LibreCat-ID: 65606 |

Meise, A. T., Cord-Landwehr, T., Boeddeker, C., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2026). Loose Coupling of Spectral and Spatial Models for Multi-Channel Diarization and Enhancement of Meetings in Dynamic Environments. ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Barcelona. https://doi.org/10.1109/icassp55912.2026.11463540

LibreCat | DOI | Download (ext.) | arXiv

2025 | Conference Paper | LibreCat-ID: 59999

Rautenberg, F., Kuhlmann, M., Seebauer, F., Wiechmann, J., Wagner, P., & Haeb-Umbach, R. (2025). Speech Synthesis along Perceptual Voice Quality Dimensions. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India . https://doi.org/10.1109/icassp49660.2025.10888012

LibreCat | DOI

2025 | Conference Paper | LibreCat-ID: 61079 |

Cord-Landwehr, T., Gburrek, T., Deegen, M., & Haeb-Umbach, R. (2025). Spatio-spectral diarization of meetings by combining TDOA-based segmentation and speaker embedding-based clustering. Proceedings of INTERSPEECH. Interspeech 2025, Rotterdam. https://doi.org/10.21437/Interspeech.2025-1663

LibreCat | Files available | DOI | arXiv

2025 | Conference Paper | LibreCat-ID: 62164

Kuhlmann, M., Seebauer, F., Wagner, P., & Häb-Umbach, R. (2025). Towards Frame-level Quality Predictions of Synthetic Speech. Interspeech 2025. https://doi.org/10.21437/interspeech.2025-2190

LibreCat | DOI

2025 | Conference Paper | LibreCat-ID: 62163

Werning, A., & Häb-Umbach, R. (2025). A Fully Zero-Shot Approach to Obtaining Specialized and Compact Audio Tagging Models. In S. Möller, T. Gerkmann, & D. Kolossa (Eds.), Proceedings of the 16th ITG Conference on Speech Communication (pp. 76–80).

LibreCat

2025 | Conference Paper | LibreCat-ID: 59900

Werning, A., & Häb-Umbach, R. (2025). Distilling Efficient Audio Models using Data Pruning with CLAP. In Deutsche Gesellschaft für Akustik e.V. (DEGA), Berlin, 2025 (Ed.), Proceedings of DAS|DAGA 2025.

LibreCat

2025 | Conference Paper | LibreCat-ID: 62174

Meise, A. T., Cord-Landwehr, T., & Haeb-Umbach, R. (2025). On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Berlin.

LibreCat

2025 | Conference Paper | LibreCat-ID: 61047

Rautenberg, F., Seebauer, F., Wiechmann, J., Kuhlmann, M., Wagner, P., & Haeb-Umbach, R. (2025). Synthesizing Speech with Selected Perceptual Voice Qualities – A Case Study with Creaky Voice. Interspeech 2025. Interspeech, Rotterdam. https://doi.org/10.21437/Interspeech.2025-1443

LibreCat | DOI

2024 | Preprint | LibreCat-ID: 56273 |

Cornell, S., Park, T., Huang, S., Boeddeker, C., Chang, X., Maciejewski, M., Wiesner, M., Garcia, P., & Watanabe, S. (2024). The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization. In arXiv:2407.16447.

LibreCat | Download (ext.) | arXiv

2024 | Journal Article | LibreCat-ID: 52958 |

Boeddeker, C., Subramanian, A. S., Wichern, G., Haeb-Umbach, R., & Le Roux, J. (2024). TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 32, 1185–1197. https://doi.org/10.1109/taslp.2024.3350887

LibreCat | Files available | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57160

Werning, A., & Haeb-Umbach, R. (2024). Target-Specific Dataset Pruning for Compression of Audio Tagging Models. 32nd European Signal Processing Conference (EUSIPCO 2024). 32nd European Signal Processing Conference, Lyon.

LibreCat | Files available

2024 | Conference Paper | LibreCat-ID: 57031 |

Gburrek, T., Meise, A. T., Schmalenstroeer, J., & Haeb-Umbach, R. (2024). Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models. 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC). https://doi.org/10.1109/iwaenc61483.2024.10694103

LibreCat | Files available | DOI

2024 | Report | LibreCat-ID: 57161

Werning, A., & Haeb-Umbach, R. (2024). UPB-NT submission to DCASE24: Dataset pruning for targeted knowledge distillation.

LibreCat

2024 | Conference Paper | LibreCat-ID: 57099

Xie, Y., Kuhlmann, M., Rautenberg, F., Tan, Z.-H., & Häb-Umbach, R. (2024). Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder. 2024 32nd European Signal Processing Conference (EUSIPCO), 436–440.

LibreCat

2024 | Conference Paper | LibreCat-ID: 56004 |

von Neumann, T., Boeddeker, C., Cord-Landwehr, T., Delcroix, M., & Haeb-Umbach, R. (2024). Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). https://doi.org/10.1109/icasspw62465.2024.10625894

LibreCat | Files available | DOI

2024 | Conference Paper | LibreCat-ID: 56272 |

Boeddeker, C., Cord-Landwehr, T., & Haeb-Umbach, R. (2024). Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment. Interspeech 2024. https://doi.org/10.21437/interspeech.2024-1286

LibreCat | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting, P., Berger, S., von Neumann, T., Boeddeker, C., Schlüter, R., & Haeb-Umbach, R. (2024). Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription. 2024 IEEE Spoken Language Technology Workshop (SLT).

LibreCat | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57085 |

Cord-Landwehr, T., Boeddeker, C., & Haeb-Umbach, R. (2024). Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India. https://doi.org/10.1109/ICASSP49660.2025.10888445

LibreCat | Files available | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 53659

Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2024). Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul. https://doi.org/10.1109/icassp48485.2024.10445911

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48269 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2023). On the Integration of Sampling Rate Synchronization and Acoustic Beamforming. European Signal Processing Conference (EUSIPCO). European Signal Processing Conference (EUSIPCO), Helsinki.

LibreCat | Download (ext.)

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

339 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

339 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options