LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

339 Publications

2026 | Conference Paper | LibreCat-ID: 65606 |

Meise AT, Cord-Landwehr T, Boeddeker C, Delcroix M, Nakatani T, Haeb-Umbach R. Loose Coupling of Spectral and Spatial Models for Multi-Channel Diarization and Enhancement of Meetings in Dynamic Environments. In: ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2026. doi:10.1109/icassp55912.2026.11463540

LibreCat | DOI | Download (ext.) | arXiv

2025 | Conference Paper | LibreCat-ID: 59999

Rautenberg F, Kuhlmann M, Seebauer F, Wiechmann J, Wagner P, Haeb-Umbach R. Speech Synthesis along Perceptual Voice Quality Dimensions. In: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2025. doi:10.1109/icassp49660.2025.10888012

LibreCat | DOI

2025 | Conference Paper | LibreCat-ID: 61079 |

Cord-Landwehr T, Gburrek T, Deegen M, Haeb-Umbach R. Spatio-spectral diarization of meetings by combining TDOA-based segmentation and speaker embedding-based clustering. In: Proceedings of INTERSPEECH. ; 2025. doi:10.21437/Interspeech.2025-1663

LibreCat | Files available | DOI | arXiv

2025 | Conference Paper | LibreCat-ID: 62164

Kuhlmann M, Seebauer F, Wagner P, Häb-Umbach R. Towards Frame-level Quality Predictions of Synthetic Speech. In: Interspeech 2025. ISCA; 2025. doi:10.21437/interspeech.2025-2190

LibreCat | DOI

2025 | Conference Paper | LibreCat-ID: 62163

Werning A, Häb-Umbach R. A Fully Zero-Shot Approach to Obtaining Specialized and Compact Audio Tagging Models. In: Möller S, Gerkmann T, Kolossa D, eds. Proceedings of the 16th ITG Conference on Speech Communication. ; 2025:76-80.

LibreCat

2025 | Conference Paper | LibreCat-ID: 59900

Werning A, Häb-Umbach R. Distilling Efficient Audio Models using Data Pruning with CLAP. In: Deutsche Gesellschaft für Akustik e.V. (DEGA), Berlin, 2025, ed. Proceedings of DAS|DAGA 2025. ; 2025.

LibreCat

2025 | Conference Paper | LibreCat-ID: 62174

Meise AT, Cord-Landwehr T, Haeb-Umbach R. On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation. In: ITG Conference on Speech Communication. ; 2025.

LibreCat

2025 | Conference Paper | LibreCat-ID: 61047

Rautenberg F, Seebauer F, Wiechmann J, Kuhlmann M, Wagner P, Haeb-Umbach R. Synthesizing Speech with Selected Perceptual Voice Qualities – A Case Study with Creaky Voice. In: Interspeech 2025. ISCA; 2025. doi:10.21437/Interspeech.2025-1443

LibreCat | DOI

2024 | Preprint | LibreCat-ID: 56273 |

Cornell S, Park T, Huang S, et al. The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization. arXiv:240716447. Published online 2024.

LibreCat | Download (ext.) | arXiv

2024 | Journal Article | LibreCat-ID: 52958 |

Boeddeker C, Subramanian AS, Wichern G, Haeb-Umbach R, Le Roux J. TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2024;32:1185-1197. doi:10.1109/taslp.2024.3350887

LibreCat | Files available | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57160

Werning A, Haeb-Umbach R. Target-Specific Dataset Pruning for Compression of Audio Tagging Models. In: 32nd European Signal Processing Conference (EUSIPCO 2024). ; 2024.

LibreCat | Files available

2024 | Conference Paper | LibreCat-ID: 57031 |

Gburrek T, Meise AT, Schmalenstroeer J, Haeb-Umbach R. Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models. In: 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2024. doi:10.1109/iwaenc61483.2024.10694103

LibreCat | Files available | DOI

2024 | Report | LibreCat-ID: 57161

Werning A, Haeb-Umbach R. UPB-NT Submission to DCASE24: Dataset Pruning for Targeted Knowledge Distillation.; 2024.

LibreCat

2024 | Conference Paper | LibreCat-ID: 57099

Xie Y, Kuhlmann M, Rautenberg F, Tan Z-H, Häb-Umbach R. Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder. In: 2024 32nd European Signal Processing Conference (EUSIPCO). ; 2024:436–440.

LibreCat

2024 | Conference Paper | LibreCat-ID: 56004 |

von Neumann T, Boeddeker C, Cord-Landwehr T, Delcroix M, Haeb-Umbach R. Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. In: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). IEEE; 2024. doi:10.1109/icasspw62465.2024.10625894

LibreCat | Files available | DOI

2024 | Conference Paper | LibreCat-ID: 56272 |

Boeddeker C, Cord-Landwehr T, Haeb-Umbach R. Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment. In: Interspeech 2024. ISCA; 2024. doi:10.21437/interspeech.2024-1286

LibreCat | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting P, Berger S, von Neumann T, Boeddeker C, Schlüter R, Haeb-Umbach R. Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription. In: 2024 IEEE Spoken Language Technology Workshop (SLT). ; 2024.

LibreCat | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57085 |

Cord-Landwehr T, Boeddeker C, Haeb-Umbach R. Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models. In: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2024. doi:10.1109/ICASSP49660.2025.10888445

LibreCat | Files available | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 53659

Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2024. doi:10.1109/icassp48485.2024.10445911

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48269 |

Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On the Integration of Sampling Rate Synchronization and Acoustic Beamforming. In: European Signal Processing Conference (EUSIPCO). ; 2023.

LibreCat | Download (ext.)

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

339 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

339 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options