LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

339 Publications

2026 | Conference Paper | LibreCat-ID: 65606 |

A. T. Meise, T. Cord-Landwehr, C. Boeddeker, M. Delcroix, T. Nakatani, and R. Haeb-Umbach, “Loose Coupling of Spectral and Spatial Models for Multi-Channel Diarization and Enhancement of Meetings in Dynamic Environments,” presented at the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Barcelona, 2026, doi: 10.1109/icassp55912.2026.11463540.

LibreCat | DOI | Download (ext.) | arXiv

2025 | Conference Paper | LibreCat-ID: 59999

F. Rautenberg, M. Kuhlmann, F. Seebauer, J. Wiechmann, P. Wagner, and R. Haeb-Umbach, “Speech Synthesis along Perceptual Voice Quality Dimensions,” presented at the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India , 2025, doi: 10.1109/icassp49660.2025.10888012.

LibreCat | DOI

2025 | Conference Paper | LibreCat-ID: 61079 |

T. Cord-Landwehr, T. Gburrek, M. Deegen, and R. Haeb-Umbach, “Spatio-spectral diarization of meetings by combining TDOA-based segmentation and speaker embedding-based clustering,” presented at the Interspeech 2025, Rotterdam, 2025, doi: 10.21437/Interspeech.2025-1663.

LibreCat | Files available | DOI | arXiv

2025 | Conference Paper | LibreCat-ID: 62164

M. Kuhlmann, F. Seebauer, P. Wagner, and R. Häb-Umbach, “Towards Frame-level Quality Predictions of Synthetic Speech,” 2025, doi: 10.21437/interspeech.2025-2190.

LibreCat | DOI

2025 | Conference Paper | LibreCat-ID: 62163

A. Werning and R. Häb-Umbach, “A Fully Zero-Shot Approach to Obtaining Specialized and Compact Audio Tagging Models,” in Proceedings of the 16th ITG Conference on Speech Communication, Berlin, 2025, pp. 76–80.

LibreCat

2025 | Conference Paper | LibreCat-ID: 59900

A. Werning and R. Häb-Umbach, “Distilling Efficient Audio Models using Data Pruning with CLAP,” in Proceedings of DAS|DAGA 2025, Copenhagen, 2025.

LibreCat

2025 | Conference Paper | LibreCat-ID: 62174

A. T. Meise, T. Cord-Landwehr, and R. Haeb-Umbach, “On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation,” presented at the ITG Conference on Speech Communication, Berlin, 2025.

LibreCat

2025 | Conference Paper | LibreCat-ID: 61047

F. Rautenberg, F. Seebauer, J. Wiechmann, M. Kuhlmann, P. Wagner, and R. Haeb-Umbach, “Synthesizing Speech with Selected Perceptual Voice Qualities – A Case Study with Creaky Voice,” presented at the Interspeech, Rotterdam, 2025, doi: 10.21437/Interspeech.2025-1443.

LibreCat | DOI

2024 | Preprint | LibreCat-ID: 56273 |

S. Cornell et al., “The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization,” arXiv:2407.16447. 2024.

LibreCat | Download (ext.) | arXiv

2024 | Journal Article | LibreCat-ID: 52958 |

C. Boeddeker, A. S. Subramanian, G. Wichern, R. Haeb-Umbach, and J. Le Roux, “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1185–1197, 2024, doi: 10.1109/taslp.2024.3350887.

LibreCat | Files available | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57160

A. Werning and R. Haeb-Umbach, “Target-Specific Dataset Pruning for Compression of Audio Tagging Models,” presented at the 32nd European Signal Processing Conference, Lyon, 2024.

LibreCat | Files available

2024 | Conference Paper | LibreCat-ID: 57031 |

T. Gburrek, A. T. Meise, J. Schmalenstroeer, and R. Haeb-Umbach, “Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models,” 2024, doi: 10.1109/iwaenc61483.2024.10694103.

LibreCat | Files available | DOI

2024 | Report | LibreCat-ID: 57161

A. Werning and R. Haeb-Umbach, UPB-NT submission to DCASE24: Dataset pruning for targeted knowledge distillation. 2024.

LibreCat

2024 | Conference Paper | LibreCat-ID: 57099

Y. Xie, M. Kuhlmann, F. Rautenberg, Z.-H. Tan, and R. Häb-Umbach, “Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder,” in 2024 32nd European Signal Processing Conference (EUSIPCO), 2024, pp. 436–440.

LibreCat

2024 | Conference Paper | LibreCat-ID: 56004 |

T. von Neumann, C. Boeddeker, T. Cord-Landwehr, M. Delcroix, and R. Haeb-Umbach, “Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization,” 2024, doi: 10.1109/icasspw62465.2024.10625894.

LibreCat | Files available | DOI

2024 | Conference Paper | LibreCat-ID: 56272 |

C. Boeddeker, T. Cord-Landwehr, and R. Haeb-Umbach, “Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment,” 2024, doi: 10.21437/interspeech.2024-1286.

LibreCat | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57659 |

P. Vieting, S. Berger, T. von Neumann, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription,” 2024.

LibreCat | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57085 |

T. Cord-Landwehr, C. Boeddeker, and R. Haeb-Umbach, “Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models,” presented at the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India, 2024, doi: 10.1109/ICASSP49660.2025.10888445.

LibreCat | Files available | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 53659

T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios,” presented at the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, 2024, doi: 10.1109/icassp48485.2024.10445911.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48269 |

T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On the Integration of Sampling Rate Synchronization and Acoustic Beamforming,” presented at the European Signal Processing Conference (EUSIPCO), Helsinki, 2023.

LibreCat | Download (ext.)

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

339 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

339 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options