Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

333 Publications


2025 | Conference Paper | LibreCat-ID: 59900
Distilling Efficient Audio Models using Data Pruning with CLAP
A. Werning, R. Häb-Umbach, in: Deutsche Gesellschaft für Akustik e.V. (DEGA), Berlin, 2025 (Ed.), Proceedings of DAS|DAGA 2025, Copenhagen, 2025.
LibreCat | DOI
 

2025 | Conference Paper | LibreCat-ID: 59999
Speech Synthesis along Perceptual Voice Quality Dimensions
F. Rautenberg, M. Kuhlmann, F. Seebauer, J. Wiechmann, P. Wagner, R. Haeb-Umbach, in: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2025.
LibreCat | DOI
 

2024 | Preprint | LibreCat-ID: 56273 | OA
The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization
S. Cornell, T. Park, S. Huang, C. Boeddeker, X. Chang, M. Maciejewski, M. Wiesner, P. Garcia, S. Watanabe, ArXiv:2407.16447 (2024).
LibreCat | Download (ext.) | arXiv
 

2024 | Conference Paper | LibreCat-ID: 57031 | OA
Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models
T. Gburrek, A. Meise, J. Schmalenstroeer, R. Haeb-Umbach, in: 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2024.
LibreCat | Files available | DOI
 

2024 | Journal Article | LibreCat-ID: 52958 | OA
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
C. Boeddeker, A.S. Subramanian, G. Wichern, R. Haeb-Umbach, J. Le Roux, IEEE/ACM Transactions on Audio, Speech, and Language Processing 32 (2024) 1185–1197.
LibreCat | Files available | DOI | Download (ext.)
 

2024 | Conference Paper | LibreCat-ID: 57085 | OA
Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models
T. Cord-Landwehr, C. Boeddeker, R. Haeb-Umbach, in: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024.
LibreCat | DOI | Download (ext.)
 

2024 | Report | LibreCat-ID: 57161
UPB-NT submission to DCASE24: Dataset pruning for targeted knowledge distillation
A. Werning, R. Haeb-Umbach, UPB-NT Submission to DCASE24: Dataset Pruning for Targeted Knowledge Distillation, 2024.
LibreCat
 

2024 | Conference Paper | LibreCat-ID: 57160
Target-Specific Dataset Pruning for Compression of Audio Tagging Models
A. Werning, R. Haeb-Umbach, in: 32nd European Signal Processing Conference (EUSIPCO 2024), 2024.
LibreCat | Files available
 

2024 | Conference Paper | LibreCat-ID: 57099
Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder
Y. Xie, M. Kuhlmann, F. Rautenberg, Z.-H. Tan, R. Häb-Umbach, in: 2024 32nd European Signal Processing Conference (EUSIPCO), 2024, pp. 436–440.
LibreCat
 

2024 | Conference Paper | LibreCat-ID: 56004 | OA
Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
T. von Neumann, C. Boeddeker, T. Cord-Landwehr, M. Delcroix, R. Haeb-Umbach, in: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), IEEE, 2024.
LibreCat | Files available | DOI
 

Filters and Search Terms

department=54

Search

Filter Publications

Display / Sort

Export / Embed