Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

317 Publications


2024 | Journal Article | LibreCat-ID: 52958 | OA
Boeddeker, C., Subramanian, A. S., Wichern, G., Haeb-Umbach, R., & Le Roux, J. (2024). TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 32, 1185–1197. https://doi.org/10.1109/taslp.2024.3350887
LibreCat | DOI | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48269 | OA
Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2023). On the Integration of Sampling Rate Synchronization and Acoustic Beamforming. European Signal Processing Conference (EUSIPCO). European Signal Processing Conference (EUSIPCO), Helsinki.
LibreCat | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 47128 | OA
Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2023). Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes. https://doi.org/10.1109/icassp49357.2023.10095370
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48270 | OA
Schmalenstroeer, J., Gburrek, T., & Haeb-Umbach, R. (2023). LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Aachen.
LibreCat | Files available
 

2023 | Conference Paper | LibreCat-ID: 47129 | OA
Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2023). A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures. INTERSPEECH 2023. https://doi.org/10.21437/interspeech.2023-1379
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48355 | OA
Rautenberg, F., Kuhlmann, M., Wiechmann, J., Seebauer, F., Wagner, P., & Haeb-Umbach, R. (2023). On Feature Importance and Interpretability of Speaker Representations. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Aachen.
LibreCat | Files available | Download (ext.) | arXiv
 

2023 | Conference Paper | LibreCat-ID: 48410 | OA
Wiechmann, J., Rautenberg, F., Wagner, P., & Haeb-Umbach, R. (2023). Explaining voice characteristics to novice voice practitioners-How successful is it? 20th International Congress of the Phonetic Sciences (ICPhS) .
LibreCat | Files available | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48391
Aralikatti, R., Boeddeker, C., Wichern, G., Subramanian, A., & Le Roux, J. (2023). Reverberation as Supervision For Speech Separation. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp49357.2023.10095022
LibreCat | DOI
 

2023 | Conference Paper | LibreCat-ID: 48390
Berger, S., Vieting, P., Boeddeker, C., Schlüter, R., & Haeb-Umbach, R. (2023). Mixture Encoder for Joint Speech Separation and Recognition. INTERSPEECH 2023. https://doi.org/10.21437/interspeech.2023-1815
LibreCat | DOI
 

2023 | Conference Paper | LibreCat-ID: 46069
Seebauer, F., Kuhlmann, M., Haeb-Umbach, R., & Wagner, P. (2023). Re-examining the quality dimensions of synthetic speech. 12th Speech Synthesis Workshop (SSW) 2023.
LibreCat
 

2023 | Journal Article | LibreCat-ID: 35602 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48281 | OA
von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2023). On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp49357.2023.10094784
LibreCat | Files available | DOI | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48275 | OA
von Neumann, T., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin.
LibreCat | Files available | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 49109 | OA
Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2023). Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks. Proc. Asilomar Conference on Signals, Systems, and Computers. 57th Asilomar Conference on Signals, Systems, and Computers.
LibreCat | Files available
 

2023 | Conference Paper | LibreCat-ID: 49111
Ebbers, J., Haeb-Umbach, R., & Serizel, R. (2023). Post-Processing Independent Evaluation of Sound Event Detection Systems. Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023), 36–40.
LibreCat | Files available
 

2023 | Conference Paper | LibreCat-ID: 44849 | OA
Rautenberg, F., Kuhlmann, M., Ebbers, J., Wiechmann, J., Seebauer, F., Wagner, P., & Haeb-Umbach, R. (2023). Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics. Fortschritte Der Akustik - DAGA 2023, 1409–1412.
LibreCat | Files available | Download (ext.)
 

2022 | Journal Article | LibreCat-ID: 33669 | OA
Zhang, W., Chang, X., Boeddeker, C., Nakatani, T., Watanabe, S., & Qian, Y. (2022). End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party. IEEE/ACM Transactions on Audio, Speech, and Language Processing. https://doi.org/10.1109/TASLP.2022.3209942
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33954 | OA
Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2022). An Initialization Scheme for Meeting Separation with Spatial Mixture Models. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10929
LibreCat | DOI | Download (ext.)
 

2022 | Conference Paper | LibreCat-ID: 33471
Heitkämper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (n.d.). Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels. Proceedings of the 30th European Signal Processing Conference (EUSIPCO). 30th European Signal Processing Conference (EUSIPCO), Belgrad.
LibreCat | Files available
 

2022 | Conference Paper | LibreCat-ID: 33806
Afifi, H., Karl, H., Gburrek, T., & Schmalenstroeer, J. (2022). Data-driven Time Synchronization in Wireless Multimedia Networks. 2022 International Wireless Communications and Mobile Computing (IWCMC). https://doi.org/10.1109/iwcmc55113.2022.9824980
LibreCat | DOI
 

Filters and Search Terms

department=54

Search

Filter Publications

Display / Sort

Citation Style: APA

Export / Embed