49 Publications

Mark all

[49]
2024 | Preprint | LibreCat-ID: 56273 | OA
S. Cornell et al., “The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant  Automatic Speech Recognition and Diarization,” arXiv:2407.16447. 2024.
LibreCat | Download (ext.) | arXiv
 
[48]
2024 | Conference Paper | LibreCat-ID: 56004 | OA
T. von Neumann, C. Boeddeker, T. Cord-Landwehr, M. Delcroix, and R. Haeb-Umbach, “Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization,” 2024, doi: 10.1109/icasspw62465.2024.10625894.
LibreCat | Files available | DOI
 
[47]
2024 | Conference Paper | LibreCat-ID: 53659
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios,” presented at the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, 2024, doi: 10.1109/icassp48485.2024.10445911.
LibreCat | DOI
 
[46]
2024 | Conference Paper | LibreCat-ID: 56272 | OA
C. Boeddeker, T. Cord-Landwehr, and R. Haeb-Umbach, “Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment,” 2024, doi: 10.21437/interspeech.2024-1286.
LibreCat | DOI | Download (ext.)
 
[45]
2024 | Conference Paper | LibreCat-ID: 57659 | OA
P. Vieting, S. Berger, T. von Neumann, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription,” 2024.
LibreCat | Download (ext.)
 
[44]
2024 | Journal Article | LibreCat-ID: 52958 | OA
C. Boeddeker, A. S. Subramanian, G. Wichern, R. Haeb-Umbach, and J. Le Roux, “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1185–1197, 2024, doi: 10.1109/taslp.2024.3350887.
LibreCat | Files available | DOI | Download (ext.)
 
[43]
2024 | Conference Paper | LibreCat-ID: 57085 | OA
T. Cord-Landwehr, C. Boeddeker, and R. Haeb-Umbach, “Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models,” presented at the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India, 2024, doi: 10.1109/ICASSP49660.2025.10888445.
LibreCat | DOI | Download (ext.)
 
[42]
2023 | Conference Paper | LibreCat-ID: 48391
R. Aralikatti, C. Boeddeker, G. Wichern, A. Subramanian, and J. Le Roux, “Reverberation as Supervision For Speech Separation,” 2023, doi: 10.1109/icassp49357.2023.10095022.
LibreCat | DOI
 
[41]
2023 | Journal Article | LibreCat-ID: 35602 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 576–589, 2023, doi: 10.1109/taslp.2022.3228629.
LibreCat | Files available | DOI
 
[40]
2023 | Conference Paper | LibreCat-ID: 48281 | OA
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems,” 2023, doi: 10.1109/icassp49357.2023.10094784.
LibreCat | Files available | DOI | Download (ext.)
 
[39]
2023 | Conference Paper | LibreCat-ID: 48275 | OA
T. von Neumann, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems,” presented at the CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin, 2023.
LibreCat | Files available | Download (ext.)
 
[38]
2023 | Conference Paper | LibreCat-ID: 47128 | OA
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization,” presented at the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, 2023, doi: 10.1109/icassp49357.2023.10095370.
LibreCat | Files available | DOI
 
[37]
2023 | Conference Paper | LibreCat-ID: 47129 | OA
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures,” 2023, doi: 10.21437/interspeech.2023-1379.
LibreCat | Files available | DOI
 
[36]
2023 | Conference Paper | LibreCat-ID: 54439 | OA
C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “Multi-stage diarization refinement for the CHiME-7 DASR scenario,” 2023, doi: 10.21437/chime.2023-10.
LibreCat | DOI | Download (ext.)
 
[35]
2023 | Conference Paper | LibreCat-ID: 48390 | OA
S. Berger, P. Vieting, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Mixture Encoder for Joint Speech Separation and Recognition,” 2023, doi: 10.21437/interspeech.2023-1815.
LibreCat | DOI | Download (ext.)
 
[34]
2022 | Journal Article | LibreCat-ID: 33669 | OA
W. Zhang, X. Chang, C. Boeddeker, T. Nakatani, S. Watanabe, and Y. Qian, “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi: 10.1109/TASLP.2022.3209942.
LibreCat | Files available | DOI
 
[33]
2022 | Conference Paper | LibreCat-ID: 33847 | OA
T. Cord-Landwehr, T. von Neumann, C. Boeddeker, and R. Haeb-Umbach, “MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg, 2022.
LibreCat | Files available | arXiv
 
[32]
2022 | Conference Paper | LibreCat-ID: 33848 | OA
T. Cord-Landwehr, C. Boeddeker, T. von Neumann, C. Zorila, R. Doddipatla, and R. Haeb-Umbach, “Monaural source separation: From anechoic to reverberant environments,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
LibreCat | Files available | arXiv
 
[31]
2022 | Conference Paper | LibreCat-ID: 33819 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data,” 2022, doi: 10.1109/icassp43922.2022.9746757.
LibreCat | Files available | DOI
 
[30]
2022 | Misc | LibreCat-ID: 33816 | OA
T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer, and R. Haeb-Umbach, A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022.
LibreCat | Files available | DOI
 
[29]
2022 | Conference Paper | LibreCat-ID: 33954 | OA
C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “An Initialization Scheme for Meeting Separation with Spatial Mixture Models,” 2022, doi: 10.21437/interspeech.2022-10929.
LibreCat | DOI | Download (ext.)
 
[28]
2022 | Conference Paper | LibreCat-ID: 33958
K. Kinoshita, T. von Neumann, M. Delcroix, C. Boeddeker, and R. Haeb-Umbach, “Utterance-by-utterance overlap-aware neural diarization with Graph-PIT,” in Proc. Interspeech 2022, 2022, pp. 1486–1490, doi: 10.21437/Interspeech.2022-11408.
LibreCat | DOI | Download (ext.)
 
[27]
2021 | Conference Paper | LibreCat-ID: 28256
W. Zhang et al., “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend,” 2021, doi: 10.1109/icassp39728.2021.9414464.
LibreCat | DOI
 
[26]
2021 | Conference Paper | LibreCat-ID: 28262
C. Li et al., “ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration,” 2021, doi: 10.1109/slt48900.2021.9383615.
LibreCat | DOI
 
[25]
2021 | Conference Paper | LibreCat-ID: 28261
C. Li et al., “Dual-Path RNN for Long Recording Speech Separation,” 2021, doi: 10.1109/slt48900.2021.9383514.
LibreCat | DOI
 
[24]
2021 | Conference Paper | LibreCat-ID: 44843 | OA
C. Boeddeker, F. Rautenberg, and R. Haeb-Umbach, “A Comparison and Combination of Unsupervised Blind Source Separation  Techniques,” presented at the ITG Conference on Speech Communication, Kiel, 2021.
LibreCat | Files available | Download (ext.) | arXiv
 
[23]
2021 | Conference Paper | LibreCat-ID: 28259 | OA
C. Boeddeker et al., “Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation,” 2021, doi: 10.1109/icassp39728.2021.9414661.
LibreCat | Files available | DOI
 
[22]
2021 | Conference Paper | LibreCat-ID: 26770 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers,” presented at the Interspeech, 2021, doi: 10.21437/interspeech.2021-1177.
LibreCat | Files available | DOI
 
[21]
2021 | Conference Paper | LibreCat-ID: 29173 | OA
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “Speeding Up Permutation Invariant Training for Source Separation,” presented at the Speech Communication; 14th ITG Conference, Kiel, 2021.
LibreCat | Files available
 
[20]
2020 | Conference Paper | LibreCat-ID: 20700 | OA
C. Boeddeker et al., “Towards a speaker diarization system for the CHiME 2020 dinner party transcription,” in Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.
LibreCat | Files available
 
[19]
2020 | Journal Article | LibreCat-ID: 17598 | OA
T. Nakatani, C. Boeddeker, K. Kinoshita, R. Ikeshita, M. Delcroix, and R. Haeb-Umbach, “Jointly optimal denoising, dereverberation, and source separation,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 1–1, 2020, doi: 10.1109/TASLP.2020.3013118.
LibreCat | DOI | Download (ext.)
 
[18]
2020 | Conference Paper | LibreCat-ID: 20504
J. Heitkaemper, D. Jakobeit, C. Boeddeker, L. Drude, and R. Haeb-Umbach, “Demystifying TasNet: A Dissecting Approach,” 2020.
LibreCat | Files available
 
[17]
2020 | Preprint | LibreCat-ID: 28263
S. Watanabe et al., “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings,” arXiv:2004.09249. 2020.
LibreCat
 
[16]
2020 | Conference Paper | LibreCat-ID: 20762 | OA
T. von Neumann et al., “End-to-End Training of Time Domain Audio Separation and Recognition,” in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008, doi: 10.1109/ICASSP40776.2020.9053461.
LibreCat | Files available | DOI
 
[15]
2020 | Conference Paper | LibreCat-ID: 20764 | OA
T. von Neumann et al., “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR,” in Proc. Interspeech 2020, 2020, pp. 3097–3101, doi: 10.21437/Interspeech.2020-2519.
LibreCat | Files available | DOI
 
[14]
2020 | Conference Paper | LibreCat-ID: 20695 | OA
C. Boeddeker, T. Nakatani, K. Kinoshita, and R. Haeb-Umbach, “Jointly Optimal Dereverberation and Beamforming,” 2020, doi: 10.1109/icassp40776.2020.9054393.
LibreCat | Files available | DOI
 
[13]
2019 | Journal Article | LibreCat-ID: 19446 | OA
L. Drude, J. Heitkaemper, C. Boeddeker, and R. Haeb-Umbach, “SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition,” ArXiv e-prints, 2019.
LibreCat | Files available
 
[12]
2019 | Conference Paper | LibreCat-ID: 15816 | OA
C. Zorila, C. Boeddeker, R. Doddipatla, and R. Haeb-Umbach, “An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription,” in ASRU 2019, Sentosa, Singapore, 2019.
LibreCat | Files available
 
[11]
2019 | Conference Paper | LibreCat-ID: 14826 | OA
N. Kanda, C. Boeddeker, J. Heitkaemper, Y. Fujita, S. Horiguchi, and R. Haeb-Umbach, “Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR,” in INTERSPEECH 2019, Graz, Austria, 2019.
LibreCat | Files available
 
[10]
2018 | Conference Paper | LibreCat-ID: 11872 | OA
L. Drude et al., “Integration neural network based beamforming and weighted prediction error dereverberation,” in INTERSPEECH 2018, Hyderabad, India, 2018.
LibreCat | Files available | Download (ext.)
 
[9]
2018 | Conference Paper | LibreCat-ID: 11873 | OA
L. Drude, J. Heymann, C. Boeddeker, and R. Haeb-Umbach, “NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing,” in ITG 2018, Oldenburg, Germany, 2018.
LibreCat | Files available | Download (ext.)
 
[8]
2018 | Conference Paper | LibreCat-ID: 12901 | OA
C. Boeddeker, H. Erdogan, T. Yoshioka, and R. Haeb-Umbach, “Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition,” in ICASSP 2018, Calgary, Canada, 2018.
LibreCat | Files available | Download (ext.)
 
[7]
2018 | Conference Paper | LibreCat-ID: 12899 | OA
C. Boeddeker, J. Heitkaemper, J. Schmalenstroeer, L. Drude, J. Heymann, and R. Haeb-Umbach, “Front-End Processing for the CHiME-5 Dinner Party Scenario,” 2018.
LibreCat | Files available | Download (ext.)
 
[6]
2018 | Conference Paper | LibreCat-ID: 11876 | OA
M. Kitza et al., “The RWTH/UPB System Combination for the CHiME 2018 Workshop,” 2018.
LibreCat | Download (ext.)
 
[5]
2017 | Report | LibreCat-ID: 11735 | OA
C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, and R. Haeb-Umbach, On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming. 2017.
LibreCat | Download (ext.)
 
[4]
2017 | Conference Paper | LibreCat-ID: 11736 | OA
C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, and R. Haeb-Umbach, “Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Download (ext.)
 
[3]
2017 | Conference Paper | LibreCat-ID: 11809 | OA
J. Heymann, L. Drude, C. Boeddeker, P. Hanebrink, and R. Haeb-Umbach, “BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Files available | Download (ext.)
 
[2]
2017 | Conference Paper | LibreCat-ID: 11895 | OA
J. Schmalenstroeer, J. Heymann, L. Drude, C. Boeddeker, and R. Haeb-Umbach, “Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming,” 2017.
LibreCat | Files available | Download (ext.)
 
[1]
2016 | Conference Paper | LibreCat-ID: 11751 | OA
L. Drude, C. Boeddeker, and R. Haeb-Umbach, “Blind Speech Separation based on Complex Spherical k-Mode Clustering,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2016.
LibreCat | Files available | Download (ext.)
 

Search

Filter Publications

Display / Sort

Citation Style: IEEE

Export / Embed

49 Publications

Mark all

[49]
2024 | Preprint | LibreCat-ID: 56273 | OA
S. Cornell et al., “The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant  Automatic Speech Recognition and Diarization,” arXiv:2407.16447. 2024.
LibreCat | Download (ext.) | arXiv
 
[48]
2024 | Conference Paper | LibreCat-ID: 56004 | OA
T. von Neumann, C. Boeddeker, T. Cord-Landwehr, M. Delcroix, and R. Haeb-Umbach, “Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization,” 2024, doi: 10.1109/icasspw62465.2024.10625894.
LibreCat | Files available | DOI
 
[47]
2024 | Conference Paper | LibreCat-ID: 53659
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios,” presented at the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, 2024, doi: 10.1109/icassp48485.2024.10445911.
LibreCat | DOI
 
[46]
2024 | Conference Paper | LibreCat-ID: 56272 | OA
C. Boeddeker, T. Cord-Landwehr, and R. Haeb-Umbach, “Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment,” 2024, doi: 10.21437/interspeech.2024-1286.
LibreCat | DOI | Download (ext.)
 
[45]
2024 | Conference Paper | LibreCat-ID: 57659 | OA
P. Vieting, S. Berger, T. von Neumann, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription,” 2024.
LibreCat | Download (ext.)
 
[44]
2024 | Journal Article | LibreCat-ID: 52958 | OA
C. Boeddeker, A. S. Subramanian, G. Wichern, R. Haeb-Umbach, and J. Le Roux, “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1185–1197, 2024, doi: 10.1109/taslp.2024.3350887.
LibreCat | Files available | DOI | Download (ext.)
 
[43]
2024 | Conference Paper | LibreCat-ID: 57085 | OA
T. Cord-Landwehr, C. Boeddeker, and R. Haeb-Umbach, “Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models,” presented at the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India, 2024, doi: 10.1109/ICASSP49660.2025.10888445.
LibreCat | DOI | Download (ext.)
 
[42]
2023 | Conference Paper | LibreCat-ID: 48391
R. Aralikatti, C. Boeddeker, G. Wichern, A. Subramanian, and J. Le Roux, “Reverberation as Supervision For Speech Separation,” 2023, doi: 10.1109/icassp49357.2023.10095022.
LibreCat | DOI
 
[41]
2023 | Journal Article | LibreCat-ID: 35602 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 576–589, 2023, doi: 10.1109/taslp.2022.3228629.
LibreCat | Files available | DOI
 
[40]
2023 | Conference Paper | LibreCat-ID: 48281 | OA
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems,” 2023, doi: 10.1109/icassp49357.2023.10094784.
LibreCat | Files available | DOI | Download (ext.)
 
[39]
2023 | Conference Paper | LibreCat-ID: 48275 | OA
T. von Neumann, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems,” presented at the CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin, 2023.
LibreCat | Files available | Download (ext.)
 
[38]
2023 | Conference Paper | LibreCat-ID: 47128 | OA
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization,” presented at the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, 2023, doi: 10.1109/icassp49357.2023.10095370.
LibreCat | Files available | DOI
 
[37]
2023 | Conference Paper | LibreCat-ID: 47129 | OA
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures,” 2023, doi: 10.21437/interspeech.2023-1379.
LibreCat | Files available | DOI
 
[36]
2023 | Conference Paper | LibreCat-ID: 54439 | OA
C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “Multi-stage diarization refinement for the CHiME-7 DASR scenario,” 2023, doi: 10.21437/chime.2023-10.
LibreCat | DOI | Download (ext.)
 
[35]
2023 | Conference Paper | LibreCat-ID: 48390 | OA
S. Berger, P. Vieting, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Mixture Encoder for Joint Speech Separation and Recognition,” 2023, doi: 10.21437/interspeech.2023-1815.
LibreCat | DOI | Download (ext.)
 
[34]
2022 | Journal Article | LibreCat-ID: 33669 | OA
W. Zhang, X. Chang, C. Boeddeker, T. Nakatani, S. Watanabe, and Y. Qian, “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi: 10.1109/TASLP.2022.3209942.
LibreCat | Files available | DOI
 
[33]
2022 | Conference Paper | LibreCat-ID: 33847 | OA
T. Cord-Landwehr, T. von Neumann, C. Boeddeker, and R. Haeb-Umbach, “MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg, 2022.
LibreCat | Files available | arXiv
 
[32]
2022 | Conference Paper | LibreCat-ID: 33848 | OA
T. Cord-Landwehr, C. Boeddeker, T. von Neumann, C. Zorila, R. Doddipatla, and R. Haeb-Umbach, “Monaural source separation: From anechoic to reverberant environments,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
LibreCat | Files available | arXiv
 
[31]
2022 | Conference Paper | LibreCat-ID: 33819 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data,” 2022, doi: 10.1109/icassp43922.2022.9746757.
LibreCat | Files available | DOI
 
[30]
2022 | Misc | LibreCat-ID: 33816 | OA
T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer, and R. Haeb-Umbach, A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022.
LibreCat | Files available | DOI
 
[29]
2022 | Conference Paper | LibreCat-ID: 33954 | OA
C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “An Initialization Scheme for Meeting Separation with Spatial Mixture Models,” 2022, doi: 10.21437/interspeech.2022-10929.
LibreCat | DOI | Download (ext.)
 
[28]
2022 | Conference Paper | LibreCat-ID: 33958
K. Kinoshita, T. von Neumann, M. Delcroix, C. Boeddeker, and R. Haeb-Umbach, “Utterance-by-utterance overlap-aware neural diarization with Graph-PIT,” in Proc. Interspeech 2022, 2022, pp. 1486–1490, doi: 10.21437/Interspeech.2022-11408.
LibreCat | DOI | Download (ext.)
 
[27]
2021 | Conference Paper | LibreCat-ID: 28256
W. Zhang et al., “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend,” 2021, doi: 10.1109/icassp39728.2021.9414464.
LibreCat | DOI
 
[26]
2021 | Conference Paper | LibreCat-ID: 28262
C. Li et al., “ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration,” 2021, doi: 10.1109/slt48900.2021.9383615.
LibreCat | DOI
 
[25]
2021 | Conference Paper | LibreCat-ID: 28261
C. Li et al., “Dual-Path RNN for Long Recording Speech Separation,” 2021, doi: 10.1109/slt48900.2021.9383514.
LibreCat | DOI
 
[24]
2021 | Conference Paper | LibreCat-ID: 44843 | OA
C. Boeddeker, F. Rautenberg, and R. Haeb-Umbach, “A Comparison and Combination of Unsupervised Blind Source Separation  Techniques,” presented at the ITG Conference on Speech Communication, Kiel, 2021.
LibreCat | Files available | Download (ext.) | arXiv
 
[23]
2021 | Conference Paper | LibreCat-ID: 28259 | OA
C. Boeddeker et al., “Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation,” 2021, doi: 10.1109/icassp39728.2021.9414661.
LibreCat | Files available | DOI
 
[22]
2021 | Conference Paper | LibreCat-ID: 26770 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers,” presented at the Interspeech, 2021, doi: 10.21437/interspeech.2021-1177.
LibreCat | Files available | DOI
 
[21]
2021 | Conference Paper | LibreCat-ID: 29173 | OA
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “Speeding Up Permutation Invariant Training for Source Separation,” presented at the Speech Communication; 14th ITG Conference, Kiel, 2021.
LibreCat | Files available
 
[20]
2020 | Conference Paper | LibreCat-ID: 20700 | OA
C. Boeddeker et al., “Towards a speaker diarization system for the CHiME 2020 dinner party transcription,” in Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.
LibreCat | Files available
 
[19]
2020 | Journal Article | LibreCat-ID: 17598 | OA
T. Nakatani, C. Boeddeker, K. Kinoshita, R. Ikeshita, M. Delcroix, and R. Haeb-Umbach, “Jointly optimal denoising, dereverberation, and source separation,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 1–1, 2020, doi: 10.1109/TASLP.2020.3013118.
LibreCat | DOI | Download (ext.)
 
[18]
2020 | Conference Paper | LibreCat-ID: 20504
J. Heitkaemper, D. Jakobeit, C. Boeddeker, L. Drude, and R. Haeb-Umbach, “Demystifying TasNet: A Dissecting Approach,” 2020.
LibreCat | Files available
 
[17]
2020 | Preprint | LibreCat-ID: 28263
S. Watanabe et al., “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings,” arXiv:2004.09249. 2020.
LibreCat
 
[16]
2020 | Conference Paper | LibreCat-ID: 20762 | OA
T. von Neumann et al., “End-to-End Training of Time Domain Audio Separation and Recognition,” in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008, doi: 10.1109/ICASSP40776.2020.9053461.
LibreCat | Files available | DOI
 
[15]
2020 | Conference Paper | LibreCat-ID: 20764 | OA
T. von Neumann et al., “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR,” in Proc. Interspeech 2020, 2020, pp. 3097–3101, doi: 10.21437/Interspeech.2020-2519.
LibreCat | Files available | DOI
 
[14]
2020 | Conference Paper | LibreCat-ID: 20695 | OA
C. Boeddeker, T. Nakatani, K. Kinoshita, and R. Haeb-Umbach, “Jointly Optimal Dereverberation and Beamforming,” 2020, doi: 10.1109/icassp40776.2020.9054393.
LibreCat | Files available | DOI
 
[13]
2019 | Journal Article | LibreCat-ID: 19446 | OA
L. Drude, J. Heitkaemper, C. Boeddeker, and R. Haeb-Umbach, “SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition,” ArXiv e-prints, 2019.
LibreCat | Files available
 
[12]
2019 | Conference Paper | LibreCat-ID: 15816 | OA
C. Zorila, C. Boeddeker, R. Doddipatla, and R. Haeb-Umbach, “An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription,” in ASRU 2019, Sentosa, Singapore, 2019.
LibreCat | Files available
 
[11]
2019 | Conference Paper | LibreCat-ID: 14826 | OA
N. Kanda, C. Boeddeker, J. Heitkaemper, Y. Fujita, S. Horiguchi, and R. Haeb-Umbach, “Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR,” in INTERSPEECH 2019, Graz, Austria, 2019.
LibreCat | Files available
 
[10]
2018 | Conference Paper | LibreCat-ID: 11872 | OA
L. Drude et al., “Integration neural network based beamforming and weighted prediction error dereverberation,” in INTERSPEECH 2018, Hyderabad, India, 2018.
LibreCat | Files available | Download (ext.)
 
[9]
2018 | Conference Paper | LibreCat-ID: 11873 | OA
L. Drude, J. Heymann, C. Boeddeker, and R. Haeb-Umbach, “NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing,” in ITG 2018, Oldenburg, Germany, 2018.
LibreCat | Files available | Download (ext.)
 
[8]
2018 | Conference Paper | LibreCat-ID: 12901 | OA
C. Boeddeker, H. Erdogan, T. Yoshioka, and R. Haeb-Umbach, “Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition,” in ICASSP 2018, Calgary, Canada, 2018.
LibreCat | Files available | Download (ext.)
 
[7]
2018 | Conference Paper | LibreCat-ID: 12899 | OA
C. Boeddeker, J. Heitkaemper, J. Schmalenstroeer, L. Drude, J. Heymann, and R. Haeb-Umbach, “Front-End Processing for the CHiME-5 Dinner Party Scenario,” 2018.
LibreCat | Files available | Download (ext.)
 
[6]
2018 | Conference Paper | LibreCat-ID: 11876 | OA
M. Kitza et al., “The RWTH/UPB System Combination for the CHiME 2018 Workshop,” 2018.
LibreCat | Download (ext.)
 
[5]
2017 | Report | LibreCat-ID: 11735 | OA
C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, and R. Haeb-Umbach, On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming. 2017.
LibreCat | Download (ext.)
 
[4]
2017 | Conference Paper | LibreCat-ID: 11736 | OA
C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, and R. Haeb-Umbach, “Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Download (ext.)
 
[3]
2017 | Conference Paper | LibreCat-ID: 11809 | OA
J. Heymann, L. Drude, C. Boeddeker, P. Hanebrink, and R. Haeb-Umbach, “BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Files available | Download (ext.)
 
[2]
2017 | Conference Paper | LibreCat-ID: 11895 | OA
J. Schmalenstroeer, J. Heymann, L. Drude, C. Boeddeker, and R. Haeb-Umbach, “Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming,” 2017.
LibreCat | Files available | Download (ext.)
 
[1]
2016 | Conference Paper | LibreCat-ID: 11751 | OA
L. Drude, C. Boeddeker, and R. Haeb-Umbach, “Blind Speech Separation based on Complex Spherical k-Mode Clustering,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2016.
LibreCat | Files available | Download (ext.)
 

Search

Filter Publications

Display / Sort

Citation Style: IEEE

Export / Embed