Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

42 Publications


2024 | Journal Article | LibreCat-ID: 52958 | OA
C. Boeddeker, A. S. Subramanian, G. Wichern, R. Haeb-Umbach, and J. Le Roux, “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1185–1197, 2024, doi: 10.1109/taslp.2024.3350887.
LibreCat | DOI | Download (ext.)
 

2024 | Conference Paper | LibreCat-ID: 53659
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios,” presented at the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, 2024, doi: 10.1109/icassp48485.2024.10445911.
LibreCat | DOI
 

2023 | Conference Paper | LibreCat-ID: 47128 | OA
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization,” presented at the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, 2023, doi: 10.1109/icassp49357.2023.10095370.
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 47129 | OA
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures,” 2023, doi: 10.21437/interspeech.2023-1379.
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48391
R. Aralikatti, C. Boeddeker, G. Wichern, A. Subramanian, and J. Le Roux, “Reverberation as Supervision For Speech Separation,” 2023, doi: 10.1109/icassp49357.2023.10095022.
LibreCat | DOI
 

2023 | Conference Paper | LibreCat-ID: 48390
S. Berger, P. Vieting, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Mixture Encoder for Joint Speech Separation and Recognition,” 2023, doi: 10.21437/interspeech.2023-1815.
LibreCat | DOI
 

2023 | Journal Article | LibreCat-ID: 35602 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 576–589, 2023, doi: 10.1109/taslp.2022.3228629.
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48281 | OA
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems,” 2023, doi: 10.1109/icassp49357.2023.10094784.
LibreCat | Files available | DOI | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48275 | OA
T. von Neumann, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems,” presented at the CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin, 2023.
LibreCat | Files available | Download (ext.)
 

2022 | Journal Article | LibreCat-ID: 33669 | OA
W. Zhang, X. Chang, C. Boeddeker, T. Nakatani, S. Watanabe, and Y. Qian, “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi: 10.1109/TASLP.2022.3209942.
LibreCat | Files available | DOI
 

Filters and Search Terms

(person=40767)

status=public

Search

Filter Publications

Display / Sort

Citation Style: IEEE

Export / Embed