Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

331 Publications


2024 | Preprint | LibreCat-ID: 56273 | OA
Cornell, S., Park, T., Huang, S., Boeddeker, C., Chang, X., Maciejewski, M., Wiesner, M., Garcia, P., & Watanabe, S. (2024). The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant  Automatic Speech Recognition and Diarization. In arXiv:2407.16447.
LibreCat | Download (ext.) | arXiv
 

2024 | Conference Paper | LibreCat-ID: 57031 | OA
Gburrek, T., Meise, A., Schmalenstroeer, J., & Haeb-Umbach, R. (2024). Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models. 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC). https://doi.org/10.1109/iwaenc61483.2024.10694103
LibreCat | Files available | DOI
 

2024 | Report | LibreCat-ID: 57161
Werning, A., & Haeb-Umbach, R. (2024). UPB-NT submission to DCASE24: Dataset pruning for targeted knowledge distillation.
LibreCat
 

2024 | Conference Paper | LibreCat-ID: 57160
Werning, A., & Haeb-Umbach, R. (2024). Target-Specific Dataset Pruning for Compression of Audio Tagging Models. 32nd European Signal Processing Conference (EUSIPCO 2024). 32nd European Signal Processing Conference, Lyon.
LibreCat | Files available
 

2024 | Conference Paper | LibreCat-ID: 57099
Xie, Y., Kuhlmann, M., Rautenberg, F., Tan, Z.-H., & Häb-Umbach, R. (2024). Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder. 2024 32nd European Signal Processing Conference (EUSIPCO), 436–440.
LibreCat
 

2024 | Conference Paper | LibreCat-ID: 56004 | OA
von Neumann, T., Boeddeker, C., Cord-Landwehr, T., Delcroix, M., & Haeb-Umbach, R. (2024). Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). https://doi.org/10.1109/icasspw62465.2024.10625894
LibreCat | Files available | DOI
 

2024 | Journal Article | LibreCat-ID: 52958 | OA
Boeddeker, C., Subramanian, A. S., Wichern, G., Haeb-Umbach, R., & Le Roux, J. (2024). TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 32, 1185–1197. https://doi.org/10.1109/taslp.2024.3350887
LibreCat | DOI | Download (ext.)
 

2024 | Conference Paper | LibreCat-ID: 53659
Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2024). Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul. https://doi.org/10.1109/icassp48485.2024.10445911
LibreCat | DOI
 

2024 | Preprint | LibreCat-ID: 57085 | OA
Cord-Landwehr, T., Boeddeker, C., & Haeb-Umbach, R. (2024). Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models.
LibreCat | Download (ext.)
 

2024 | Conference Paper | LibreCat-ID: 56272 | OA
Boeddeker, C., Cord-Landwehr, T., & Haeb-Umbach, R. (2024). Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment. Interspeech 2024. https://doi.org/10.21437/interspeech.2024-1286
LibreCat | DOI | Download (ext.)
 

2024 | Conference Paper | LibreCat-ID: 57659 | OA
Vieting, P., Berger, S., von Neumann, T., Boeddeker, C., Schlüter, R., & Haeb-Umbach, R. (2024). Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription. 2024 IEEE Spoken Language Technology Workshop (SLT).
LibreCat | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48269 | OA
Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2023). On the Integration of Sampling Rate Synchronization and Acoustic Beamforming. European Signal Processing Conference (EUSIPCO). European Signal Processing Conference (EUSIPCO), Helsinki.
LibreCat | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48270 | OA
Schmalenstroeer, J., Gburrek, T., & Haeb-Umbach, R. (2023). LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Aachen.
LibreCat | Files available
 

2023 | Conference Paper | LibreCat-ID: 48355 | OA
Rautenberg, F., Kuhlmann, M., Wiechmann, J., Seebauer, F., Wagner, P., & Haeb-Umbach, R. (2023). On Feature Importance and Interpretability of Speaker Representations. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Aachen.
LibreCat | Files available | Download (ext.) | arXiv
 

2023 | Conference Paper | LibreCat-ID: 48410 | OA
Wiechmann, J., Rautenberg, F., Wagner, P., & Haeb-Umbach, R. (2023). Explaining voice characteristics to novice voice practitioners-How successful is it? 20th International Congress of the Phonetic Sciences (ICPhS) .
LibreCat | Files available | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48391
Aralikatti, R., Boeddeker, C., Wichern, G., Subramanian, A., & Le Roux, J. (2023). Reverberation as Supervision For Speech Separation. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp49357.2023.10095022
LibreCat | DOI
 

2023 | Conference Paper | LibreCat-ID: 46069
Seebauer, F., Kuhlmann, M., Haeb-Umbach, R., & Wagner, P. (2023). Re-examining the quality dimensions of synthetic speech. 12th Speech Synthesis Workshop (SSW) 2023.
LibreCat
 

2023 | Journal Article | LibreCat-ID: 35602 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 49109 | OA
Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2023). Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks. Proc. Asilomar Conference on Signals, Systems, and Computers. 57th Asilomar Conference on Signals, Systems, and Computers.
LibreCat | Files available
 

2023 | Conference Paper | LibreCat-ID: 44849 | OA
Rautenberg, F., Kuhlmann, M., Ebbers, J., Wiechmann, J., Seebauer, F., Wagner, P., & Haeb-Umbach, R. (2023). Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics. Fortschritte Der Akustik - DAGA 2023, 1409–1412.
LibreCat | Files available | Download (ext.)
 

Filters and Search Terms

department=54

Search

Filter Publications

Display / Sort

Citation Style: APA

Export / Embed