42 Publications

Mark all

[42]
2024 | Journal Article | LibreCat-ID: 52958 | OA
C. Boeddeker, A. S. Subramanian, G. Wichern, R. Haeb-Umbach, and J. Le Roux, “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1185–1197, 2024, doi: 10.1109/taslp.2024.3350887.
LibreCat | DOI | Download (ext.)
 
[41]
2024 | Conference Paper | LibreCat-ID: 53659
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios,” presented at the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, 2024, doi: 10.1109/icassp48485.2024.10445911.
LibreCat | DOI
 
[40]
2023 | Conference Paper | LibreCat-ID: 47128 | OA
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization,” presented at the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, 2023, doi: 10.1109/icassp49357.2023.10095370.
LibreCat | Files available | DOI
 
[39]
2023 | Conference Paper | LibreCat-ID: 47129 | OA
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures,” 2023, doi: 10.21437/interspeech.2023-1379.
LibreCat | Files available | DOI
 
[38]
2023 | Conference Paper | LibreCat-ID: 48391
R. Aralikatti, C. Boeddeker, G. Wichern, A. Subramanian, and J. Le Roux, “Reverberation as Supervision For Speech Separation,” 2023, doi: 10.1109/icassp49357.2023.10095022.
LibreCat | DOI
 
[37]
2023 | Conference Paper | LibreCat-ID: 48390
S. Berger, P. Vieting, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Mixture Encoder for Joint Speech Separation and Recognition,” 2023, doi: 10.21437/interspeech.2023-1815.
LibreCat | DOI
 
[36]
2023 | Journal Article | LibreCat-ID: 35602 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 576–589, 2023, doi: 10.1109/taslp.2022.3228629.
LibreCat | Files available | DOI
 
[35]
2023 | Conference Paper | LibreCat-ID: 48281 | OA
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems,” 2023, doi: 10.1109/icassp49357.2023.10094784.
LibreCat | Files available | DOI | Download (ext.)
 
[34]
2023 | Conference Paper | LibreCat-ID: 48275 | OA
T. von Neumann, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems,” presented at the CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin, 2023.
LibreCat | Files available | Download (ext.)
 
[33]
2022 | Journal Article | LibreCat-ID: 33669 | OA
W. Zhang, X. Chang, C. Boeddeker, T. Nakatani, S. Watanabe, and Y. Qian, “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi: 10.1109/TASLP.2022.3209942.
LibreCat | Files available | DOI
 
[32]
2022 | Conference Paper | LibreCat-ID: 33954 | OA
C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “An Initialization Scheme for Meeting Separation with Spatial Mixture Models,” 2022, doi: 10.21437/interspeech.2022-10929.
LibreCat | DOI | Download (ext.)
 
[31]
2022 | Conference Paper | LibreCat-ID: 33958
K. Kinoshita, T. von Neumann, M. Delcroix, C. Boeddeker, and R. Haeb-Umbach, “Utterance-by-utterance overlap-aware neural diarization with Graph-PIT,” in Proc. Interspeech 2022, 2022, pp. 1486–1490, doi: 10.21437/Interspeech.2022-11408.
LibreCat | DOI
 
[30]
2022 | Conference Paper | LibreCat-ID: 33819 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data,” 2022, doi: 10.1109/icassp43922.2022.9746757.
LibreCat | Files available | DOI
 
[29]
2022 | Conference Paper | LibreCat-ID: 33847 | OA
T. Cord-Landwehr, T. von Neumann, C. Boeddeker, and R. Haeb-Umbach, “MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg, 2022.
LibreCat | Files available | arXiv
 
[28]
2022 | Conference Paper | LibreCat-ID: 33848 | OA
T. Cord-Landwehr, C. Boeddeker, T. von Neumann, C. Zorila, R. Doddipatla, and R. Haeb-Umbach, “Monaural source separation: From anechoic to reverberant environments,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
LibreCat | Files available | arXiv
 
[27]
2022 | Misc | LibreCat-ID: 33816 | OA
T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer, and R. Haeb-Umbach, A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022.
LibreCat | Files available | DOI
 
[26]
2021 | Conference Paper | LibreCat-ID: 28256
W. Zhang et al., “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend,” 2021, doi: 10.1109/icassp39728.2021.9414464.
LibreCat | DOI
 
[25]
2021 | Conference Paper | LibreCat-ID: 28262
C. Li et al., “ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration,” 2021, doi: 10.1109/slt48900.2021.9383615.
LibreCat | DOI
 
[24]
2021 | Conference Paper | LibreCat-ID: 28261
C. Li et al., “Dual-Path RNN for Long Recording Speech Separation,” 2021, doi: 10.1109/slt48900.2021.9383514.
LibreCat | DOI
 
[23]
2021 | Conference Paper | LibreCat-ID: 44843 | OA
C. Boeddeker, F. Rautenberg, and R. Haeb-Umbach, “A Comparison and Combination of Unsupervised Blind Source Separation  Techniques,” presented at the ITG Conference on Speech Communication, Kiel, 2021.
LibreCat | Files available | Download (ext.) | arXiv
 
[22]
2021 | Conference Paper | LibreCat-ID: 28259 | OA
C. Boeddeker et al., “Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation,” 2021, doi: 10.1109/icassp39728.2021.9414661.
LibreCat | Files available | DOI
 
[21]
2021 | Conference Paper | LibreCat-ID: 26770 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers,” presented at the Interspeech, 2021, doi: 10.21437/interspeech.2021-1177.
LibreCat | Files available | DOI
 
[20]
2021 | Conference Paper | LibreCat-ID: 29173 | OA
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “Speeding Up Permutation Invariant Training for Source Separation,” presented at the Speech Communication; 14th ITG Conference, Kiel, 2021.
LibreCat | Files available
 
[19]
2020 | Conference Paper | LibreCat-ID: 20700 | OA
C. Boeddeker et al., “Towards a speaker diarization system for the CHiME 2020 dinner party transcription,” in Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.
LibreCat | Files available
 
[18]
2020 | Journal Article | LibreCat-ID: 17598 | OA
T. Nakatani, C. Boeddeker, K. Kinoshita, R. Ikeshita, M. Delcroix, and R. Haeb-Umbach, “Jointly optimal denoising, dereverberation, and source separation,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 1–1, 2020, doi: 10.1109/TASLP.2020.3013118.
LibreCat | DOI | Download (ext.)
 
[17]
2020 | Conference Paper | LibreCat-ID: 20504
J. Heitkaemper, D. Jakobeit, C. Boeddeker, L. Drude, and R. Haeb-Umbach, “Demystifying TasNet: A Dissecting Approach,” 2020.
LibreCat | Files available
 
[16]
2020 | Preprint | LibreCat-ID: 28263
S. Watanabe et al., “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings,” arXiv:2004.09249. 2020.
LibreCat
 
[15]
2020 | Conference Paper | LibreCat-ID: 20762 | OA
T. von Neumann et al., “End-to-End Training of Time Domain Audio Separation and Recognition,” in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008, doi: 10.1109/ICASSP40776.2020.9053461.
LibreCat | Files available | DOI
 
[14]
2020 | Conference Paper | LibreCat-ID: 20764 | OA
T. von Neumann et al., “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR,” in Proc. Interspeech 2020, 2020, pp. 3097–3101, doi: 10.21437/Interspeech.2020-2519.
LibreCat | Files available | DOI
 
[13]
2019 | Journal Article | LibreCat-ID: 19446 | OA
L. Drude, J. Heitkaemper, C. Boeddeker, and R. Haeb-Umbach, “SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition,” ArXiv e-prints, 2019.
LibreCat | Files available
 
[12]
2019 | Conference Paper | LibreCat-ID: 15816 | OA
C. Zorila, C. Boeddeker, R. Doddipatla, and R. Haeb-Umbach, “An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription,” in ASRU 2019, Sentosa, Singapore, 2019.
LibreCat | Files available
 
[11]
2019 | Conference Paper | LibreCat-ID: 14826 | OA
N. Kanda, C. Boeddeker, J. Heitkaemper, Y. Fujita, S. Horiguchi, and R. Haeb-Umbach, “Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR,” in INTERSPEECH 2019, Graz, Austria, 2019.
LibreCat | Files available
 
[10]
2018 | Conference Paper | LibreCat-ID: 11872 | OA
L. Drude et al., “Integration neural network based beamforming and weighted prediction error dereverberation,” in INTERSPEECH 2018, Hyderabad, India, 2018.
LibreCat | Files available | Download (ext.)
 
[9]
2018 | Conference Paper | LibreCat-ID: 11873 | OA
L. Drude, J. Heymann, C. Boeddeker, and R. Haeb-Umbach, “NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing,” in ITG 2018, Oldenburg, Germany, 2018.
LibreCat | Files available | Download (ext.)
 
[8]
2018 | Conference Paper | LibreCat-ID: 12901 | OA
C. Boeddeker, H. Erdogan, T. Yoshioka, and R. Haeb-Umbach, “Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition,” in ICASSP 2018, Calgary, Canada, 2018.
LibreCat | Files available | Download (ext.)
 
[7]
2018 | Conference Paper | LibreCat-ID: 12899 | OA
C. Boeddeker, J. Heitkaemper, J. Schmalenstroeer, L. Drude, J. Heymann, and R. Haeb-Umbach, “Front-End Processing for the CHiME-5 Dinner Party Scenario,” 2018.
LibreCat | Files available | Download (ext.)
 
[6]
2018 | Conference Paper | LibreCat-ID: 11876 | OA
M. Kitza et al., “The RWTH/UPB System Combination for the CHiME 2018 Workshop,” 2018.
LibreCat | Download (ext.)
 
[5]
2017 | Report | LibreCat-ID: 11735 | OA
C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, and R. Haeb-Umbach, On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming. 2017.
LibreCat | Download (ext.)
 
[4]
2017 | Conference Paper | LibreCat-ID: 11736 | OA
C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, and R. Haeb-Umbach, “Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Download (ext.)
 
[3]
2017 | Conference Paper | LibreCat-ID: 11809 | OA
J. Heymann, L. Drude, C. Boeddeker, P. Hanebrink, and R. Haeb-Umbach, “BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Files available | Download (ext.)
 
[2]
2017 | Conference Paper | LibreCat-ID: 11895 | OA
J. Schmalenstroeer, J. Heymann, L. Drude, C. Boeddeker, and R. Haeb-Umbach, “Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming,” 2017.
LibreCat | Files available | Download (ext.)
 
[1]
2016 | Conference Paper | LibreCat-ID: 11751 | OA
L. Drude, C. Boeddeker, and R. Haeb-Umbach, “Blind Speech Separation based on Complex Spherical k-Mode Clustering,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2016.
LibreCat | Files available | Download (ext.)
 

Search

Filter Publications

Display / Sort

Citation Style: IEEE

Export / Embed

42 Publications

Mark all

[42]
2024 | Journal Article | LibreCat-ID: 52958 | OA
C. Boeddeker, A. S. Subramanian, G. Wichern, R. Haeb-Umbach, and J. Le Roux, “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1185–1197, 2024, doi: 10.1109/taslp.2024.3350887.
LibreCat | DOI | Download (ext.)
 
[41]
2024 | Conference Paper | LibreCat-ID: 53659
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios,” presented at the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, 2024, doi: 10.1109/icassp48485.2024.10445911.
LibreCat | DOI
 
[40]
2023 | Conference Paper | LibreCat-ID: 47128 | OA
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization,” presented at the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, 2023, doi: 10.1109/icassp49357.2023.10095370.
LibreCat | Files available | DOI
 
[39]
2023 | Conference Paper | LibreCat-ID: 47129 | OA
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures,” 2023, doi: 10.21437/interspeech.2023-1379.
LibreCat | Files available | DOI
 
[38]
2023 | Conference Paper | LibreCat-ID: 48391
R. Aralikatti, C. Boeddeker, G. Wichern, A. Subramanian, and J. Le Roux, “Reverberation as Supervision For Speech Separation,” 2023, doi: 10.1109/icassp49357.2023.10095022.
LibreCat | DOI
 
[37]
2023 | Conference Paper | LibreCat-ID: 48390
S. Berger, P. Vieting, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Mixture Encoder for Joint Speech Separation and Recognition,” 2023, doi: 10.21437/interspeech.2023-1815.
LibreCat | DOI
 
[36]
2023 | Journal Article | LibreCat-ID: 35602 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 576–589, 2023, doi: 10.1109/taslp.2022.3228629.
LibreCat | Files available | DOI
 
[35]
2023 | Conference Paper | LibreCat-ID: 48281 | OA
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems,” 2023, doi: 10.1109/icassp49357.2023.10094784.
LibreCat | Files available | DOI | Download (ext.)
 
[34]
2023 | Conference Paper | LibreCat-ID: 48275 | OA
T. von Neumann, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems,” presented at the CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin, 2023.
LibreCat | Files available | Download (ext.)
 
[33]
2022 | Journal Article | LibreCat-ID: 33669 | OA
W. Zhang, X. Chang, C. Boeddeker, T. Nakatani, S. Watanabe, and Y. Qian, “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi: 10.1109/TASLP.2022.3209942.
LibreCat | Files available | DOI
 
[32]
2022 | Conference Paper | LibreCat-ID: 33954 | OA
C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “An Initialization Scheme for Meeting Separation with Spatial Mixture Models,” 2022, doi: 10.21437/interspeech.2022-10929.
LibreCat | DOI | Download (ext.)
 
[31]
2022 | Conference Paper | LibreCat-ID: 33958
K. Kinoshita, T. von Neumann, M. Delcroix, C. Boeddeker, and R. Haeb-Umbach, “Utterance-by-utterance overlap-aware neural diarization with Graph-PIT,” in Proc. Interspeech 2022, 2022, pp. 1486–1490, doi: 10.21437/Interspeech.2022-11408.
LibreCat | DOI
 
[30]
2022 | Conference Paper | LibreCat-ID: 33819 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data,” 2022, doi: 10.1109/icassp43922.2022.9746757.
LibreCat | Files available | DOI
 
[29]
2022 | Conference Paper | LibreCat-ID: 33847 | OA
T. Cord-Landwehr, T. von Neumann, C. Boeddeker, and R. Haeb-Umbach, “MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg, 2022.
LibreCat | Files available | arXiv
 
[28]
2022 | Conference Paper | LibreCat-ID: 33848 | OA
T. Cord-Landwehr, C. Boeddeker, T. von Neumann, C. Zorila, R. Doddipatla, and R. Haeb-Umbach, “Monaural source separation: From anechoic to reverberant environments,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
LibreCat | Files available | arXiv
 
[27]
2022 | Misc | LibreCat-ID: 33816 | OA
T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer, and R. Haeb-Umbach, A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022.
LibreCat | Files available | DOI
 
[26]
2021 | Conference Paper | LibreCat-ID: 28256
W. Zhang et al., “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend,” 2021, doi: 10.1109/icassp39728.2021.9414464.
LibreCat | DOI
 
[25]
2021 | Conference Paper | LibreCat-ID: 28262
C. Li et al., “ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration,” 2021, doi: 10.1109/slt48900.2021.9383615.
LibreCat | DOI
 
[24]
2021 | Conference Paper | LibreCat-ID: 28261
C. Li et al., “Dual-Path RNN for Long Recording Speech Separation,” 2021, doi: 10.1109/slt48900.2021.9383514.
LibreCat | DOI
 
[23]
2021 | Conference Paper | LibreCat-ID: 44843 | OA
C. Boeddeker, F. Rautenberg, and R. Haeb-Umbach, “A Comparison and Combination of Unsupervised Blind Source Separation  Techniques,” presented at the ITG Conference on Speech Communication, Kiel, 2021.
LibreCat | Files available | Download (ext.) | arXiv
 
[22]
2021 | Conference Paper | LibreCat-ID: 28259 | OA
C. Boeddeker et al., “Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation,” 2021, doi: 10.1109/icassp39728.2021.9414661.
LibreCat | Files available | DOI
 
[21]
2021 | Conference Paper | LibreCat-ID: 26770 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers,” presented at the Interspeech, 2021, doi: 10.21437/interspeech.2021-1177.
LibreCat | Files available | DOI
 
[20]
2021 | Conference Paper | LibreCat-ID: 29173 | OA
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “Speeding Up Permutation Invariant Training for Source Separation,” presented at the Speech Communication; 14th ITG Conference, Kiel, 2021.
LibreCat | Files available
 
[19]
2020 | Conference Paper | LibreCat-ID: 20700 | OA
C. Boeddeker et al., “Towards a speaker diarization system for the CHiME 2020 dinner party transcription,” in Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.
LibreCat | Files available
 
[18]
2020 | Journal Article | LibreCat-ID: 17598 | OA
T. Nakatani, C. Boeddeker, K. Kinoshita, R. Ikeshita, M. Delcroix, and R. Haeb-Umbach, “Jointly optimal denoising, dereverberation, and source separation,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 1–1, 2020, doi: 10.1109/TASLP.2020.3013118.
LibreCat | DOI | Download (ext.)
 
[17]
2020 | Conference Paper | LibreCat-ID: 20504
J. Heitkaemper, D. Jakobeit, C. Boeddeker, L. Drude, and R. Haeb-Umbach, “Demystifying TasNet: A Dissecting Approach,” 2020.
LibreCat | Files available
 
[16]
2020 | Preprint | LibreCat-ID: 28263
S. Watanabe et al., “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings,” arXiv:2004.09249. 2020.
LibreCat
 
[15]
2020 | Conference Paper | LibreCat-ID: 20762 | OA
T. von Neumann et al., “End-to-End Training of Time Domain Audio Separation and Recognition,” in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008, doi: 10.1109/ICASSP40776.2020.9053461.
LibreCat | Files available | DOI
 
[14]
2020 | Conference Paper | LibreCat-ID: 20764 | OA
T. von Neumann et al., “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR,” in Proc. Interspeech 2020, 2020, pp. 3097–3101, doi: 10.21437/Interspeech.2020-2519.
LibreCat | Files available | DOI
 
[13]
2019 | Journal Article | LibreCat-ID: 19446 | OA
L. Drude, J. Heitkaemper, C. Boeddeker, and R. Haeb-Umbach, “SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition,” ArXiv e-prints, 2019.
LibreCat | Files available
 
[12]
2019 | Conference Paper | LibreCat-ID: 15816 | OA
C. Zorila, C. Boeddeker, R. Doddipatla, and R. Haeb-Umbach, “An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription,” in ASRU 2019, Sentosa, Singapore, 2019.
LibreCat | Files available
 
[11]
2019 | Conference Paper | LibreCat-ID: 14826 | OA
N. Kanda, C. Boeddeker, J. Heitkaemper, Y. Fujita, S. Horiguchi, and R. Haeb-Umbach, “Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR,” in INTERSPEECH 2019, Graz, Austria, 2019.
LibreCat | Files available
 
[10]
2018 | Conference Paper | LibreCat-ID: 11872 | OA
L. Drude et al., “Integration neural network based beamforming and weighted prediction error dereverberation,” in INTERSPEECH 2018, Hyderabad, India, 2018.
LibreCat | Files available | Download (ext.)
 
[9]
2018 | Conference Paper | LibreCat-ID: 11873 | OA
L. Drude, J. Heymann, C. Boeddeker, and R. Haeb-Umbach, “NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing,” in ITG 2018, Oldenburg, Germany, 2018.
LibreCat | Files available | Download (ext.)
 
[8]
2018 | Conference Paper | LibreCat-ID: 12901 | OA
C. Boeddeker, H. Erdogan, T. Yoshioka, and R. Haeb-Umbach, “Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition,” in ICASSP 2018, Calgary, Canada, 2018.
LibreCat | Files available | Download (ext.)
 
[7]
2018 | Conference Paper | LibreCat-ID: 12899 | OA
C. Boeddeker, J. Heitkaemper, J. Schmalenstroeer, L. Drude, J. Heymann, and R. Haeb-Umbach, “Front-End Processing for the CHiME-5 Dinner Party Scenario,” 2018.
LibreCat | Files available | Download (ext.)
 
[6]
2018 | Conference Paper | LibreCat-ID: 11876 | OA
M. Kitza et al., “The RWTH/UPB System Combination for the CHiME 2018 Workshop,” 2018.
LibreCat | Download (ext.)
 
[5]
2017 | Report | LibreCat-ID: 11735 | OA
C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, and R. Haeb-Umbach, On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming. 2017.
LibreCat | Download (ext.)
 
[4]
2017 | Conference Paper | LibreCat-ID: 11736 | OA
C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, and R. Haeb-Umbach, “Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Download (ext.)
 
[3]
2017 | Conference Paper | LibreCat-ID: 11809 | OA
J. Heymann, L. Drude, C. Boeddeker, P. Hanebrink, and R. Haeb-Umbach, “BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Files available | Download (ext.)
 
[2]
2017 | Conference Paper | LibreCat-ID: 11895 | OA
J. Schmalenstroeer, J. Heymann, L. Drude, C. Boeddeker, and R. Haeb-Umbach, “Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming,” 2017.
LibreCat | Files available | Download (ext.)
 
[1]
2016 | Conference Paper | LibreCat-ID: 11751 | OA
L. Drude, C. Boeddeker, and R. Haeb-Umbach, “Blind Speech Separation based on Complex Spherical k-Mode Clustering,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2016.
LibreCat | Files available | Download (ext.)
 

Search

Filter Publications

Display / Sort

Citation Style: IEEE

Export / Embed