26 Publications

Mark all

[26]
2021 | Conference Paper | LibreCat-ID: 28256
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
W. Zhang, C. Boeddeker, S. Watanabe, T. Nakatani, M. Delcroix, K. Kinoshita, T. Ochiai, N. Kamo, R. Haeb-Umbach, Y. Qian, in: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021.
LibreCat | DOI
 
[25]
2021 | Conference Paper | LibreCat-ID: 28262
ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration
C. Li, J. Shi, W. Zhang, A.S. Subramanian, X. Chang, N. Kamo, M. Hira, T. Hayashi, C. Boeddeker, Z. Chen, S. Watanabe, in: 2021 IEEE Spoken Language Technology Workshop (SLT), 2021.
LibreCat | DOI
 
[24]
2021 | Conference Paper | LibreCat-ID: 28261
Dual-Path RNN for Long Recording Speech Separation
C. Li, Y. Luo, C. Han, J. Li, T. Yoshioka, T. Zhou, M. Delcroix, K. Kinoshita, C. Boeddeker, Y. Qian, S. Watanabe, Z. Chen, in: 2021 IEEE Spoken Language Technology Workshop (SLT), 2021.
LibreCat | DOI
 
[23]
2021 | Conference Paper | LibreCat-ID: 28259
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation
C. Boeddeker, W. Zhang, T. Nakatani, K. Kinoshita, T. Ochiai, M. Delcroix, N. Kamo, Y. Qian, R. Haeb-Umbach, in: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021.
LibreCat | Files available | DOI
 
[22]
2021 | Conference Paper | LibreCat-ID: 29311
A Comparison and Combination of Unsupervised Blind Source Separation Techniques
C. Boeddeker, F. Rautenberg, R. Haeb-Umbach, in: Speech Communication; 14th ITG Conference, 2021, pp. 1–5.
LibreCat | Files available
 
[21]
2021 | Conference Paper | LibreCat-ID: 26770 | OA
Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, R. Haeb-Umbach, in: Interspeech 2021, 2021.
LibreCat | Files available | DOI
 
[20]
2021 | Conference Paper | LibreCat-ID: 29173 | OA
Speeding Up Permutation Invariant Training for Source Separation
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, R. Haeb-Umbach, in: Speech Communication; 14th ITG Conference, 2021.
LibreCat | Files available
 
[19]
2020 | Journal Article | LibreCat-ID: 17598 | OA
Jointly optimal denoising, dereverberation, and source separation
T. Nakatani, C. Boeddeker, K. Kinoshita, R. Ikeshita, M. Delcroix, R. Haeb-Umbach, IEEE/ACM Transactions on Audio, Speech, and Language Processing (2020) 1–1.
LibreCat | DOI | Download (ext.)
 
[18]
2020 | Conference Paper | LibreCat-ID: 20700 | OA
Towards a speaker diarization system for the CHiME 2020 dinner party transcription
C. Boeddeker, T. Cord-Landwehr, J. Heitkaemper, C. Zorila, D. Hayakawa, M. Li, M. Liu, R. Doddipatla, R. Haeb-Umbach, in: Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.
LibreCat | Files available
 
[17]
2020 | Conference Paper | LibreCat-ID: 20504
Demystifying TasNet: A Dissecting Approach
J. Heitkaemper, D. Jakobeit, C. Boeddeker, L. Drude, R. Haeb-Umbach, in: ICASSP 2020 Virtual Barcelona Spain, 2020.
LibreCat | Files available
 
[16]
2020 | Preprint | LibreCat-ID: 28263
CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
S. Watanabe, M. Mandel, J. Barker, E. Vincent, A. Arora, X. Chang, S. Khudanpur, V. Manohar, D. Povey, D. Raj, D. Snyder, A.S. Subramanian, J. Trmal, B.B. Yair, C. Boeddeker, Z. Ni, Y. Fujita, S. Horiguchi, N. Kanda, T. Yoshioka, N. Ryant, ArXiv:2004.09249 (2020).
LibreCat
 
[15]
2020 | Conference Paper | LibreCat-ID: 20762 | OA
End-to-End Training of Time Domain Audio Separation and Recognition
T. von Neumann, K. Kinoshita, L. Drude, C. Boeddeker, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008.
LibreCat | Files available | DOI
 
[14]
2020 | Conference Paper | LibreCat-ID: 20764 | OA
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR
T. von Neumann, C. Boeddeker, L. Drude, K. Kinoshita, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: Proc. Interspeech 2020, 2020, pp. 3097–3101.
LibreCat | Files available | DOI
 
[13]
2019 | Journal Article | LibreCat-ID: 19446 | OA
SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition
L. Drude, J. Heitkaemper, C. Boeddeker, R. Haeb-Umbach, ArXiv E-Prints (2019).
LibreCat | Files available
 
[12]
2019 | Conference Paper | LibreCat-ID: 15816 | OA
An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription
C. Zorila, C. Boeddeker, R. Doddipatla, R. Haeb-Umbach, in: ASRU 2019, Sentosa, Singapore, 2019.
LibreCat | Files available
 
[11]
2019 | Conference Paper | LibreCat-ID: 14826 | OA
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR
N. Kanda, C. Boeddeker, J. Heitkaemper, Y. Fujita, S. Horiguchi, R. Haeb-Umbach, in: INTERSPEECH 2019, Graz, Austria, 2019.
LibreCat | Files available
 
[10]
2018 | Conference Paper | LibreCat-ID: 11872 | OA
Integration neural network based beamforming and weighted prediction error dereverberation
L. Drude, C. Boeddeker, J. Heymann, K. Kinoshita, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: INTERSPEECH 2018, Hyderabad, India, 2018.
LibreCat | Files available | Download (ext.)
 
[9]
2018 | Conference Paper | LibreCat-ID: 11873 | OA
NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing
L. Drude, J. Heymann, C. Boeddeker, R. Haeb-Umbach, in: ITG 2018, Oldenburg, Germany, 2018.
LibreCat | Files available | Download (ext.)
 
[8]
2018 | Conference Paper | LibreCat-ID: 11876 | OA
The RWTH/UPB System Combination for the CHiME 2018 Workshop
M. Kitza, W. Michel, C. Boeddeker, J. Heitkaemper, T. Menne, R. Schlüter, H. Ney, J. Schmalenstroeer, L. Drude, J. Heymann, R. Haeb-Umbach, in: Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India, 2018.
LibreCat | Download (ext.)
 
[7]
2018 | Conference Paper | LibreCat-ID: 12899 | OA
Front-End Processing for the CHiME-5 Dinner Party Scenario
C. Boeddeker, J. Heitkaemper, J. Schmalenstroeer, L. Drude, J. Heymann, R. Haeb-Umbach, in: Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India, 2018.
LibreCat | Files available | Download (ext.)
 
[6]
2018 | Conference Paper | LibreCat-ID: 12901 | OA
Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition
C. Boeddeker, H. Erdogan, T. Yoshioka, R. Haeb-Umbach, in: ICASSP 2018, Calgary, Canada, 2018.
LibreCat | Files available | Download (ext.)
 
[5]
2017 | Report | LibreCat-ID: 11735 | OA
On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming
C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, R. Haeb-Umbach, On the Computation of Complex-Valued Gradients with Application to Statistically Optimum Beamforming, 2017.
LibreCat | Download (ext.)
 
[4]
2017 | Conference Paper | LibreCat-ID: 11736 | OA
Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation
C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, R. Haeb-Umbach, in: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Download (ext.)
 
[3]
2017 | Conference Paper | LibreCat-ID: 11809 | OA
BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System
J. Heymann, L. Drude, C. Boeddeker, P. Hanebrink, R. Haeb-Umbach, in: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Files available | Download (ext.)
 
[2]
2017 | Conference Paper | LibreCat-ID: 11895 | OA
Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming
J. Schmalenstroeer, J. Heymann, L. Drude, C. Boeddeker, R. Haeb-Umbach, in: IEEE 19th International Workshop on Multimedia Signal Processing (MMSP), 2017.
LibreCat | Files available | Download (ext.)
 
[1]
2016 | Conference Paper | LibreCat-ID: 11751 | OA
Blind Speech Separation based on Complex Spherical k-Mode Clustering
L. Drude, C. Boeddeker, R. Haeb-Umbach, in: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2016.
LibreCat | Files available | Download (ext.)
 

Search

Filter Publications

Display / Sort

Export / Embed

26 Publications

Mark all

[26]
2021 | Conference Paper | LibreCat-ID: 28256
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
W. Zhang, C. Boeddeker, S. Watanabe, T. Nakatani, M. Delcroix, K. Kinoshita, T. Ochiai, N. Kamo, R. Haeb-Umbach, Y. Qian, in: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021.
LibreCat | DOI
 
[25]
2021 | Conference Paper | LibreCat-ID: 28262
ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration
C. Li, J. Shi, W. Zhang, A.S. Subramanian, X. Chang, N. Kamo, M. Hira, T. Hayashi, C. Boeddeker, Z. Chen, S. Watanabe, in: 2021 IEEE Spoken Language Technology Workshop (SLT), 2021.
LibreCat | DOI
 
[24]
2021 | Conference Paper | LibreCat-ID: 28261
Dual-Path RNN for Long Recording Speech Separation
C. Li, Y. Luo, C. Han, J. Li, T. Yoshioka, T. Zhou, M. Delcroix, K. Kinoshita, C. Boeddeker, Y. Qian, S. Watanabe, Z. Chen, in: 2021 IEEE Spoken Language Technology Workshop (SLT), 2021.
LibreCat | DOI
 
[23]
2021 | Conference Paper | LibreCat-ID: 28259
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation
C. Boeddeker, W. Zhang, T. Nakatani, K. Kinoshita, T. Ochiai, M. Delcroix, N. Kamo, Y. Qian, R. Haeb-Umbach, in: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021.
LibreCat | Files available | DOI
 
[22]
2021 | Conference Paper | LibreCat-ID: 29311
A Comparison and Combination of Unsupervised Blind Source Separation Techniques
C. Boeddeker, F. Rautenberg, R. Haeb-Umbach, in: Speech Communication; 14th ITG Conference, 2021, pp. 1–5.
LibreCat | Files available
 
[21]
2021 | Conference Paper | LibreCat-ID: 26770 | OA
Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, R. Haeb-Umbach, in: Interspeech 2021, 2021.
LibreCat | Files available | DOI
 
[20]
2021 | Conference Paper | LibreCat-ID: 29173 | OA
Speeding Up Permutation Invariant Training for Source Separation
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, R. Haeb-Umbach, in: Speech Communication; 14th ITG Conference, 2021.
LibreCat | Files available
 
[19]
2020 | Journal Article | LibreCat-ID: 17598 | OA
Jointly optimal denoising, dereverberation, and source separation
T. Nakatani, C. Boeddeker, K. Kinoshita, R. Ikeshita, M. Delcroix, R. Haeb-Umbach, IEEE/ACM Transactions on Audio, Speech, and Language Processing (2020) 1–1.
LibreCat | DOI | Download (ext.)
 
[18]
2020 | Conference Paper | LibreCat-ID: 20700 | OA
Towards a speaker diarization system for the CHiME 2020 dinner party transcription
C. Boeddeker, T. Cord-Landwehr, J. Heitkaemper, C. Zorila, D. Hayakawa, M. Li, M. Liu, R. Doddipatla, R. Haeb-Umbach, in: Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.
LibreCat | Files available
 
[17]
2020 | Conference Paper | LibreCat-ID: 20504
Demystifying TasNet: A Dissecting Approach
J. Heitkaemper, D. Jakobeit, C. Boeddeker, L. Drude, R. Haeb-Umbach, in: ICASSP 2020 Virtual Barcelona Spain, 2020.
LibreCat | Files available
 
[16]
2020 | Preprint | LibreCat-ID: 28263
CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
S. Watanabe, M. Mandel, J. Barker, E. Vincent, A. Arora, X. Chang, S. Khudanpur, V. Manohar, D. Povey, D. Raj, D. Snyder, A.S. Subramanian, J. Trmal, B.B. Yair, C. Boeddeker, Z. Ni, Y. Fujita, S. Horiguchi, N. Kanda, T. Yoshioka, N. Ryant, ArXiv:2004.09249 (2020).
LibreCat
 
[15]
2020 | Conference Paper | LibreCat-ID: 20762 | OA
End-to-End Training of Time Domain Audio Separation and Recognition
T. von Neumann, K. Kinoshita, L. Drude, C. Boeddeker, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008.
LibreCat | Files available | DOI
 
[14]
2020 | Conference Paper | LibreCat-ID: 20764 | OA
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR
T. von Neumann, C. Boeddeker, L. Drude, K. Kinoshita, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: Proc. Interspeech 2020, 2020, pp. 3097–3101.
LibreCat | Files available | DOI
 
[13]
2019 | Journal Article | LibreCat-ID: 19446 | OA
SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition
L. Drude, J. Heitkaemper, C. Boeddeker, R. Haeb-Umbach, ArXiv E-Prints (2019).
LibreCat | Files available
 
[12]
2019 | Conference Paper | LibreCat-ID: 15816 | OA
An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription
C. Zorila, C. Boeddeker, R. Doddipatla, R. Haeb-Umbach, in: ASRU 2019, Sentosa, Singapore, 2019.
LibreCat | Files available
 
[11]
2019 | Conference Paper | LibreCat-ID: 14826 | OA
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR
N. Kanda, C. Boeddeker, J. Heitkaemper, Y. Fujita, S. Horiguchi, R. Haeb-Umbach, in: INTERSPEECH 2019, Graz, Austria, 2019.
LibreCat | Files available
 
[10]
2018 | Conference Paper | LibreCat-ID: 11872 | OA
Integration neural network based beamforming and weighted prediction error dereverberation
L. Drude, C. Boeddeker, J. Heymann, K. Kinoshita, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: INTERSPEECH 2018, Hyderabad, India, 2018.
LibreCat | Files available | Download (ext.)
 
[9]
2018 | Conference Paper | LibreCat-ID: 11873 | OA
NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing
L. Drude, J. Heymann, C. Boeddeker, R. Haeb-Umbach, in: ITG 2018, Oldenburg, Germany, 2018.
LibreCat | Files available | Download (ext.)
 
[8]
2018 | Conference Paper | LibreCat-ID: 11876 | OA
The RWTH/UPB System Combination for the CHiME 2018 Workshop
M. Kitza, W. Michel, C. Boeddeker, J. Heitkaemper, T. Menne, R. Schlüter, H. Ney, J. Schmalenstroeer, L. Drude, J. Heymann, R. Haeb-Umbach, in: Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India, 2018.
LibreCat | Download (ext.)
 
[7]
2018 | Conference Paper | LibreCat-ID: 12899 | OA
Front-End Processing for the CHiME-5 Dinner Party Scenario
C. Boeddeker, J. Heitkaemper, J. Schmalenstroeer, L. Drude, J. Heymann, R. Haeb-Umbach, in: Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India, 2018.
LibreCat | Files available | Download (ext.)
 
[6]
2018 | Conference Paper | LibreCat-ID: 12901 | OA
Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition
C. Boeddeker, H. Erdogan, T. Yoshioka, R. Haeb-Umbach, in: ICASSP 2018, Calgary, Canada, 2018.
LibreCat | Files available | Download (ext.)
 
[5]
2017 | Report | LibreCat-ID: 11735 | OA
On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming
C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, R. Haeb-Umbach, On the Computation of Complex-Valued Gradients with Application to Statistically Optimum Beamforming, 2017.
LibreCat | Download (ext.)
 
[4]
2017 | Conference Paper | LibreCat-ID: 11736 | OA
Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation
C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, R. Haeb-Umbach, in: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Download (ext.)
 
[3]
2017 | Conference Paper | LibreCat-ID: 11809 | OA
BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System
J. Heymann, L. Drude, C. Boeddeker, P. Hanebrink, R. Haeb-Umbach, in: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Files available | Download (ext.)
 
[2]
2017 | Conference Paper | LibreCat-ID: 11895 | OA
Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming
J. Schmalenstroeer, J. Heymann, L. Drude, C. Boeddeker, R. Haeb-Umbach, in: IEEE 19th International Workshop on Multimedia Signal Processing (MMSP), 2017.
LibreCat | Files available | Download (ext.)
 
[1]
2016 | Conference Paper | LibreCat-ID: 11751 | OA
Blind Speech Separation based on Complex Spherical k-Mode Clustering
L. Drude, C. Boeddeker, R. Haeb-Umbach, in: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2016.
LibreCat | Files available | Download (ext.)
 

Search

Filter Publications

Display / Sort

Export / Embed