LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

326 Publications

2026 | Conference Paper | LibreCat-ID: 65606 |

Loose Coupling of Spectral and Spatial Models for Multi-Channel Diarization and Enhancement of Meetings in Dynamic Environments
A.T. Meise, T. Cord-Landwehr, C. Boeddeker, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2026.

LibreCat | DOI | Download (ext.) | arXiv

2025 | Conference Paper | LibreCat-ID: 59999

Speech Synthesis along Perceptual Voice Quality Dimensions
F. Rautenberg, M. Kuhlmann, F. Seebauer, J. Wiechmann, P. Wagner, R. Haeb-Umbach, in: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2025.

LibreCat | DOI

2025 | Conference Paper | LibreCat-ID: 61079 |

Spatio-spectral diarization of meetings by combining TDOA-based segmentation and speaker embedding-based clustering
T. Cord-Landwehr, T. Gburrek, M. Deegen, R. Haeb-Umbach, in: Proceedings of INTERSPEECH, 2025.

LibreCat | Files available | DOI | arXiv

2025 | Conference Paper | LibreCat-ID: 62164

Towards Frame-level Quality Predictions of Synthetic Speech
M. Kuhlmann, F. Seebauer, P. Wagner, R. Häb-Umbach, in: Interspeech 2025, ISCA, 2025.

LibreCat | DOI

2025 | Conference Paper | LibreCat-ID: 62163

A Fully Zero-Shot Approach to Obtaining Specialized and Compact Audio Tagging Models
A. Werning, R. Häb-Umbach, in: S. Möller, T. Gerkmann, D. Kolossa (Eds.), Proceedings of the 16th ITG Conference on Speech Communication, Berlin, 2025, pp. 76–80.

LibreCat

2025 | Conference Paper | LibreCat-ID: 59900

Distilling Efficient Audio Models using Data Pruning with CLAP
A. Werning, R. Häb-Umbach, in: Deutsche Gesellschaft für Akustik e.V. (DEGA), Berlin, 2025 (Ed.), Proceedings of DAS|DAGA 2025, Copenhagen, 2025.

LibreCat

2025 | Conference Paper | LibreCat-ID: 62174

On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation
A.T. Meise, T. Cord-Landwehr, R. Haeb-Umbach, in: ITG Conference on Speech Communication, 2025.

LibreCat

2025 | Conference Paper | LibreCat-ID: 61047

Synthesizing Speech with Selected Perceptual Voice Qualities – A Case Study with Creaky Voice
F. Rautenberg, F. Seebauer, J. Wiechmann, M. Kuhlmann, P. Wagner, R. Haeb-Umbach, in: Interspeech 2025, ISCA, 2025.

LibreCat | DOI

2024 | Journal Article | LibreCat-ID: 52958 |

TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
C. Boeddeker, A.S. Subramanian, G. Wichern, R. Haeb-Umbach, J. Le Roux, IEEE/ACM Transactions on Audio, Speech, and Language Processing 32 (2024) 1185–1197.

LibreCat | Files available | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57160

Target-Specific Dataset Pruning for Compression of Audio Tagging Models
A. Werning, R. Haeb-Umbach, in: 32nd European Signal Processing Conference (EUSIPCO 2024), 2024.

LibreCat | Files available

2024 | Conference Paper | LibreCat-ID: 57031 |

Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models
T. Gburrek, A.T. Meise, J. Schmalenstroeer, R. Haeb-Umbach, in: 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2024.

LibreCat | Files available | DOI

2024 | Report | LibreCat-ID: 57161

UPB-NT submission to DCASE24: Dataset pruning for targeted knowledge distillation
A. Werning, R. Haeb-Umbach, UPB-NT Submission to DCASE24: Dataset Pruning for Targeted Knowledge Distillation, 2024.

LibreCat

2024 | Conference Paper | LibreCat-ID: 57099

Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder
Y. Xie, M. Kuhlmann, F. Rautenberg, Z.-H. Tan, R. Häb-Umbach, in: 2024 32nd European Signal Processing Conference (EUSIPCO), 2024, pp. 436–440.

LibreCat

2024 | Conference Paper | LibreCat-ID: 56004 |

Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
T. von Neumann, C. Boeddeker, T. Cord-Landwehr, M. Delcroix, R. Haeb-Umbach, in: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), IEEE, 2024.

LibreCat | Files available | DOI

2024 | Conference Paper | LibreCat-ID: 56272 |

Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
C. Boeddeker, T. Cord-Landwehr, R. Haeb-Umbach, in: Interspeech 2024, ISCA, 2024.

LibreCat | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57659 |

Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription
P. Vieting, S. Berger, T. von Neumann, C. Boeddeker, R. Schlüter, R. Haeb-Umbach, in: 2024 IEEE Spoken Language Technology Workshop (SLT), 2024.

LibreCat | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57085 |

Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models
T. Cord-Landwehr, C. Boeddeker, R. Haeb-Umbach, in: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024.

LibreCat | Files available | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 53659

Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, R. Haeb-Umbach, in: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2024.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48269 |

On the Integration of Sampling Rate Synchronization and Acoustic Beamforming
T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: European Signal Processing Conference (EUSIPCO), 2023.

LibreCat | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48270 |

LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices
J. Schmalenstroeer, T. Gburrek, R. Haeb-Umbach, in: ITG Conference on Speech Communication, 2023.

LibreCat | Files available

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

326 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

326 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options