Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

43 Publications


2022 | Journal Article | LibreCat-ID: 33669 | OA
W. Zhang, X. Chang, C. Boeddeker, T. Nakatani, S. Watanabe, and Y. Qian, “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi: 10.1109/TASLP.2022.3209942.
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33954 | OA
C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “An Initialization Scheme for Meeting Separation with Spatial Mixture Models,” 2022, doi: 10.21437/interspeech.2022-10929.
LibreCat | DOI | Download (ext.)
 

2022 | Conference Paper | LibreCat-ID: 33958
K. Kinoshita, T. von Neumann, M. Delcroix, C. Boeddeker, and R. Haeb-Umbach, “Utterance-by-utterance overlap-aware neural diarization with Graph-PIT,” in Proc. Interspeech 2022, 2022, pp. 1486–1490, doi: 10.21437/Interspeech.2022-11408.
LibreCat | DOI
 

2022 | Conference Paper | LibreCat-ID: 33819 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data,” 2022, doi: 10.1109/icassp43922.2022.9746757.
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33847 | OA
T. Cord-Landwehr, T. von Neumann, C. Boeddeker, and R. Haeb-Umbach, “MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg, 2022.
LibreCat | Files available | arXiv
 

2022 | Conference Paper | LibreCat-ID: 33848 | OA
T. Cord-Landwehr, C. Boeddeker, T. von Neumann, C. Zorila, R. Doddipatla, and R. Haeb-Umbach, “Monaural source separation: From anechoic to reverberant environments,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
LibreCat | Files available | arXiv
 

2022 | Misc | LibreCat-ID: 33816 | OA
T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer, and R. Haeb-Umbach, A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022.
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 28256
W. Zhang et al., “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend,” 2021, doi: 10.1109/icassp39728.2021.9414464.
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 28262
C. Li et al., “ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration,” 2021, doi: 10.1109/slt48900.2021.9383615.
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 28261
C. Li et al., “Dual-Path RNN for Long Recording Speech Separation,” 2021, doi: 10.1109/slt48900.2021.9383514.
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 44843 | OA
C. Boeddeker, F. Rautenberg, and R. Haeb-Umbach, “A Comparison and Combination of Unsupervised Blind Source Separation  Techniques,” presented at the ITG Conference on Speech Communication, Kiel, 2021.
LibreCat | Files available | Download (ext.) | arXiv
 

2021 | Conference Paper | LibreCat-ID: 28259 | OA
C. Boeddeker et al., “Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation,” 2021, doi: 10.1109/icassp39728.2021.9414661.
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 26770 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers,” presented at the Interspeech, 2021, doi: 10.21437/interspeech.2021-1177.
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 29173 | OA
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “Speeding Up Permutation Invariant Training for Source Separation,” presented at the Speech Communication; 14th ITG Conference, Kiel, 2021.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20700 | OA
C. Boeddeker et al., “Towards a speaker diarization system for the CHiME 2020 dinner party transcription,” in Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.
LibreCat | Files available
 

2020 | Journal Article | LibreCat-ID: 17598 | OA
T. Nakatani, C. Boeddeker, K. Kinoshita, R. Ikeshita, M. Delcroix, and R. Haeb-Umbach, “Jointly optimal denoising, dereverberation, and source separation,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 1–1, 2020, doi: 10.1109/TASLP.2020.3013118.
LibreCat | DOI | Download (ext.)
 

2020 | Conference Paper | LibreCat-ID: 20504
J. Heitkaemper, D. Jakobeit, C. Boeddeker, L. Drude, and R. Haeb-Umbach, “Demystifying TasNet: A Dissecting Approach,” 2020.
LibreCat | Files available
 

2020 | Preprint | LibreCat-ID: 28263
S. Watanabe et al., “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings,” arXiv:2004.09249. 2020.
LibreCat
 

2020 | Conference Paper | LibreCat-ID: 20762 | OA
T. von Neumann et al., “End-to-End Training of Time Domain Audio Separation and Recognition,” in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008, doi: 10.1109/ICASSP40776.2020.9053461.
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20764 | OA
T. von Neumann et al., “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR,” in Proc. Interspeech 2020, 2020, pp. 3097–3101, doi: 10.21437/Interspeech.2020-2519.
LibreCat | Files available | DOI
 

Filters and Search Terms

(person=40767)

status=public

Search

Filter Publications

Display / Sort

Citation Style: IEEE

Export / Embed