42 Publications

Mark all

[42]
2024 | Journal Article | LibreCat-ID: 52958 | OA
Boeddeker C, Subramanian AS, Wichern G, Haeb-Umbach R, Le Roux J. TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2024;32:1185-1197. doi:10.1109/taslp.2024.3350887
LibreCat | DOI | Download (ext.)
 
[41]
2024 | Conference Paper | LibreCat-ID: 53659
Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2024. doi:10.1109/icassp48485.2024.10445911
LibreCat | DOI
 
[40]
2023 | Conference Paper | LibreCat-ID: 47128 | OA
Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10095370
LibreCat | Files available | DOI
 
[39]
2023 | Conference Paper | LibreCat-ID: 47129 | OA
Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures. In: INTERSPEECH 2023. ISCA; 2023. doi:10.21437/interspeech.2023-1379
LibreCat | Files available | DOI
 
[38]
2023 | Conference Paper | LibreCat-ID: 48391
Aralikatti R, Boeddeker C, Wichern G, Subramanian A, Le Roux J. Reverberation as Supervision For Speech Separation. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10095022
LibreCat | DOI
 
[37]
2023 | Conference Paper | LibreCat-ID: 48390
Berger S, Vieting P, Boeddeker C, Schlüter R, Haeb-Umbach R. Mixture Encoder for Joint Speech Separation and Recognition. In: INTERSPEECH 2023. ISCA; 2023. doi:10.21437/interspeech.2023-1815
LibreCat | DOI
 
[36]
2023 | Journal Article | LibreCat-ID: 35602 | OA
von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2023;31:576-589. doi:10.1109/taslp.2022.3228629
LibreCat | Files available | DOI
 
[35]
2023 | Conference Paper | LibreCat-ID: 48281 | OA
von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10094784
LibreCat | Files available | DOI | Download (ext.)
 
[34]
2023 | Conference Paper | LibreCat-ID: 48275 | OA
von Neumann T, Boeddeker C, Delcroix M, Haeb-Umbach R. MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. In: Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. ; 2023.
LibreCat | Files available | Download (ext.)
 
[33]
2022 | Journal Article | LibreCat-ID: 33669 | OA
Zhang W, Chang X, Boeddeker C, Nakatani T, Watanabe S, Qian Y. End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party. IEEE/ACM Transactions on Audio, Speech, and Language Processing. Published online 2022. doi:10.1109/TASLP.2022.3209942
LibreCat | Files available | DOI
 
[32]
2022 | Conference Paper | LibreCat-ID: 33954 | OA
Boeddeker C, Cord-Landwehr T, von Neumann T, Haeb-Umbach R. An Initialization Scheme for Meeting Separation with Spatial Mixture Models. In: Interspeech 2022. ISCA; 2022. doi:10.21437/interspeech.2022-10929
LibreCat | DOI | Download (ext.)
 
[31]
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita K, von Neumann T, Delcroix M, Boeddeker C, Haeb-Umbach R. Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. In: Proc. Interspeech 2022. ISCA; 2022:1486-1490. doi:10.21437/Interspeech.2022-11408
LibreCat | DOI
 
[30]
2022 | Conference Paper | LibreCat-ID: 33819 | OA
von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746757
LibreCat | Files available | DOI
 
[29]
2022 | Conference Paper | LibreCat-ID: 33847 | OA
Cord-Landwehr T, von Neumann T, Boeddeker C, Haeb-Umbach R. MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). ; 2022.
LibreCat | Files available | arXiv
 
[28]
2022 | Conference Paper | LibreCat-ID: 33848 | OA
Cord-Landwehr T, Boeddeker C, von Neumann T, Zorila C, Doddipatla R, Haeb-Umbach R. Monaural source separation: From anechoic to reverberant environments. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022.
LibreCat | Files available | arXiv
 
[27]
2022 | Misc | LibreCat-ID: 33816 | OA
Gburrek T, Boeddeker C, von Neumann T, Cord-Landwehr T, Schmalenstroeer J, Haeb-Umbach R. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv; 2022. doi:10.48550/ARXIV.2205.00944
LibreCat | Files available | DOI
 
[26]
2021 | Conference Paper | LibreCat-ID: 28256
Zhang W, Boeddeker C, Watanabe S, et al. End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9414464
LibreCat | DOI
 
[25]
2021 | Conference Paper | LibreCat-ID: 28262
Li C, Shi J, Zhang W, et al. ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration. In: 2021 IEEE Spoken Language Technology Workshop (SLT). ; 2021. doi:10.1109/slt48900.2021.9383615
LibreCat | DOI
 
[24]
2021 | Conference Paper | LibreCat-ID: 28261
Li C, Luo Y, Han C, et al. Dual-Path RNN for Long Recording Speech Separation. In: 2021 IEEE Spoken Language Technology Workshop (SLT). ; 2021. doi:10.1109/slt48900.2021.9383514
LibreCat | DOI
 
[23]
2021 | Conference Paper | LibreCat-ID: 44843 | OA
Boeddeker C, Rautenberg F, Haeb-Umbach R. A Comparison and Combination of Unsupervised Blind Source Separation  Techniques. In: ITG Conference on Speech Communication. ; 2021.
LibreCat | Files available | Download (ext.) | arXiv
 
[22]
2021 | Conference Paper | LibreCat-ID: 28259 | OA
Boeddeker C, Zhang W, Nakatani T, et al. Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9414661
LibreCat | Files available | DOI
 
[21]
2021 | Conference Paper | LibreCat-ID: 26770 | OA
von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. In: Interspeech 2021. ; 2021. doi:10.21437/interspeech.2021-1177
LibreCat | Files available | DOI
 
[20]
2021 | Conference Paper | LibreCat-ID: 29173 | OA
von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. Speeding Up Permutation Invariant Training for Source Separation. In: Speech Communication; 14th ITG Conference. ; 2021.
LibreCat | Files available
 
[19]
2020 | Conference Paper | LibreCat-ID: 20700 | OA
Boeddeker C, Cord-Landwehr T, Heitkaemper J, et al. Towards a speaker diarization system for the CHiME 2020 dinner party transcription. In: Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments. ; 2020.
LibreCat | Files available
 
[18]
2020 | Journal Article | LibreCat-ID: 17598 | OA
Nakatani T, Boeddeker C, Kinoshita K, Ikeshita R, Delcroix M, Haeb-Umbach R. Jointly optimal denoising, dereverberation, and source separation. IEEE/ACM Transactions on Audio, Speech, and Language Processing. Published online 2020:1-1. doi:10.1109/TASLP.2020.3013118
LibreCat | DOI | Download (ext.)
 
[17]
2020 | Conference Paper | LibreCat-ID: 20504
Heitkaemper J, Jakobeit D, Boeddeker C, Drude L, Haeb-Umbach R. Demystifying TasNet: A Dissecting Approach. In: ICASSP 2020 Virtual Barcelona Spain. ; 2020.
LibreCat | Files available
 
[16]
2020 | Preprint | LibreCat-ID: 28263
Watanabe S, Mandel M, Barker J, et al. CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings. arXiv:200409249. Published online 2020.
LibreCat
 
[15]
2020 | Conference Paper | LibreCat-ID: 20762 | OA
von Neumann T, Kinoshita K, Drude L, et al. End-to-End Training of Time Domain Audio Separation and Recognition. In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2020:7004-7008. doi:10.1109/ICASSP40776.2020.9053461
LibreCat | Files available | DOI
 
[14]
2020 | Conference Paper | LibreCat-ID: 20764 | OA
von Neumann T, Boeddeker C, Drude L, et al. Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. In: Proc. Interspeech 2020. ; 2020:3097-3101. doi:10.21437/Interspeech.2020-2519
LibreCat | Files available | DOI
 
[13]
2019 | Journal Article | LibreCat-ID: 19446 | OA
Drude L, Heitkaemper J, Boeddeker C, Haeb-Umbach R. SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. ArXiv e-prints. 2019.
LibreCat | Files available
 
[12]
2019 | Conference Paper | LibreCat-ID: 15816 | OA
Zorila C, Boeddeker C, Doddipatla R, Haeb-Umbach R. An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription. In: ASRU 2019, Sentosa, Singapore. ; 2019.
LibreCat | Files available
 
[11]
2019 | Conference Paper | LibreCat-ID: 14826 | OA
Kanda N, Boeddeker C, Heitkaemper J, Fujita Y, Horiguchi S, Haeb-Umbach R. Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. In: INTERSPEECH 2019, Graz, Austria. ; 2019.
LibreCat | Files available
 
[10]
2018 | Conference Paper | LibreCat-ID: 11872 | OA
Drude L, Boeddeker C, Heymann J, et al. Integration neural network based beamforming and weighted prediction error dereverberation. In: INTERSPEECH 2018, Hyderabad, India. ; 2018.
LibreCat | Files available | Download (ext.)
 
[9]
2018 | Conference Paper | LibreCat-ID: 11873 | OA
Drude L, Heymann J, Boeddeker C, Haeb-Umbach R. NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing. In: ITG 2018, Oldenburg, Germany. ; 2018.
LibreCat | Files available | Download (ext.)
 
[8]
2018 | Conference Paper | LibreCat-ID: 12901 | OA
Boeddeker C, Erdogan H, Yoshioka T, Haeb-Umbach R. Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition. In: ICASSP 2018, Calgary, Canada. ; 2018.
LibreCat | Files available | Download (ext.)
 
[7]
2018 | Conference Paper | LibreCat-ID: 12899 | OA
Boeddeker C, Heitkaemper J, Schmalenstroeer J, Drude L, Heymann J, Haeb-Umbach R. Front-End Processing for the CHiME-5 Dinner Party Scenario. In: Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India. ; 2018.
LibreCat | Files available | Download (ext.)
 
[6]
2018 | Conference Paper | LibreCat-ID: 11876 | OA
Kitza M, Michel W, Boeddeker C, et al. The RWTH/UPB System Combination for the CHiME 2018 Workshop. In: Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India. ; 2018.
LibreCat | Download (ext.)
 
[5]
2017 | Report | LibreCat-ID: 11735 | OA
Boeddeker C, Hanebrink P, Drude L, Heymann J, Haeb-Umbach R. On the Computation of Complex-Valued Gradients with Application to Statistically Optimum Beamforming.; 2017.
LibreCat | Download (ext.)
 
[4]
2017 | Conference Paper | LibreCat-ID: 11736 | OA
Boeddeker C, Hanebrink P, Drude L, Heymann J, Haeb-Umbach R. Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation. In: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP). ; 2017.
LibreCat | Download (ext.)
 
[3]
2017 | Conference Paper | LibreCat-ID: 11809 | OA
Heymann J, Drude L, Boeddeker C, Hanebrink P, Haeb-Umbach R. BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System. In: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP). ; 2017.
LibreCat | Files available | Download (ext.)
 
[2]
2017 | Conference Paper | LibreCat-ID: 11895 | OA
Schmalenstroeer J, Heymann J, Drude L, Boeddeker C, Haeb-Umbach R. Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming. In: IEEE 19th International Workshop on Multimedia Signal Processing (MMSP). ; 2017.
LibreCat | Files available | Download (ext.)
 
[1]
2016 | Conference Paper | LibreCat-ID: 11751 | OA
Drude L, Boeddeker C, Haeb-Umbach R. Blind Speech Separation based on Complex Spherical k-Mode Clustering. In: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP). ; 2016.
LibreCat | Files available | Download (ext.)
 

Search

Filter Publications

Display / Sort

Citation Style: AMA

Export / Embed

42 Publications

Mark all

[42]
2024 | Journal Article | LibreCat-ID: 52958 | OA
Boeddeker C, Subramanian AS, Wichern G, Haeb-Umbach R, Le Roux J. TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2024;32:1185-1197. doi:10.1109/taslp.2024.3350887
LibreCat | DOI | Download (ext.)
 
[41]
2024 | Conference Paper | LibreCat-ID: 53659
Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2024. doi:10.1109/icassp48485.2024.10445911
LibreCat | DOI
 
[40]
2023 | Conference Paper | LibreCat-ID: 47128 | OA
Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10095370
LibreCat | Files available | DOI
 
[39]
2023 | Conference Paper | LibreCat-ID: 47129 | OA
Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures. In: INTERSPEECH 2023. ISCA; 2023. doi:10.21437/interspeech.2023-1379
LibreCat | Files available | DOI
 
[38]
2023 | Conference Paper | LibreCat-ID: 48391
Aralikatti R, Boeddeker C, Wichern G, Subramanian A, Le Roux J. Reverberation as Supervision For Speech Separation. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10095022
LibreCat | DOI
 
[37]
2023 | Conference Paper | LibreCat-ID: 48390
Berger S, Vieting P, Boeddeker C, Schlüter R, Haeb-Umbach R. Mixture Encoder for Joint Speech Separation and Recognition. In: INTERSPEECH 2023. ISCA; 2023. doi:10.21437/interspeech.2023-1815
LibreCat | DOI
 
[36]
2023 | Journal Article | LibreCat-ID: 35602 | OA
von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2023;31:576-589. doi:10.1109/taslp.2022.3228629
LibreCat | Files available | DOI
 
[35]
2023 | Conference Paper | LibreCat-ID: 48281 | OA
von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10094784
LibreCat | Files available | DOI | Download (ext.)
 
[34]
2023 | Conference Paper | LibreCat-ID: 48275 | OA
von Neumann T, Boeddeker C, Delcroix M, Haeb-Umbach R. MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. In: Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. ; 2023.
LibreCat | Files available | Download (ext.)
 
[33]
2022 | Journal Article | LibreCat-ID: 33669 | OA
Zhang W, Chang X, Boeddeker C, Nakatani T, Watanabe S, Qian Y. End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party. IEEE/ACM Transactions on Audio, Speech, and Language Processing. Published online 2022. doi:10.1109/TASLP.2022.3209942
LibreCat | Files available | DOI
 
[32]
2022 | Conference Paper | LibreCat-ID: 33954 | OA
Boeddeker C, Cord-Landwehr T, von Neumann T, Haeb-Umbach R. An Initialization Scheme for Meeting Separation with Spatial Mixture Models. In: Interspeech 2022. ISCA; 2022. doi:10.21437/interspeech.2022-10929
LibreCat | DOI | Download (ext.)
 
[31]
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita K, von Neumann T, Delcroix M, Boeddeker C, Haeb-Umbach R. Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. In: Proc. Interspeech 2022. ISCA; 2022:1486-1490. doi:10.21437/Interspeech.2022-11408
LibreCat | DOI
 
[30]
2022 | Conference Paper | LibreCat-ID: 33819 | OA
von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746757
LibreCat | Files available | DOI
 
[29]
2022 | Conference Paper | LibreCat-ID: 33847 | OA
Cord-Landwehr T, von Neumann T, Boeddeker C, Haeb-Umbach R. MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). ; 2022.
LibreCat | Files available | arXiv
 
[28]
2022 | Conference Paper | LibreCat-ID: 33848 | OA
Cord-Landwehr T, Boeddeker C, von Neumann T, Zorila C, Doddipatla R, Haeb-Umbach R. Monaural source separation: From anechoic to reverberant environments. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022.
LibreCat | Files available | arXiv
 
[27]
2022 | Misc | LibreCat-ID: 33816 | OA
Gburrek T, Boeddeker C, von Neumann T, Cord-Landwehr T, Schmalenstroeer J, Haeb-Umbach R. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv; 2022. doi:10.48550/ARXIV.2205.00944
LibreCat | Files available | DOI
 
[26]
2021 | Conference Paper | LibreCat-ID: 28256
Zhang W, Boeddeker C, Watanabe S, et al. End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9414464
LibreCat | DOI
 
[25]
2021 | Conference Paper | LibreCat-ID: 28262
Li C, Shi J, Zhang W, et al. ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration. In: 2021 IEEE Spoken Language Technology Workshop (SLT). ; 2021. doi:10.1109/slt48900.2021.9383615
LibreCat | DOI
 
[24]
2021 | Conference Paper | LibreCat-ID: 28261
Li C, Luo Y, Han C, et al. Dual-Path RNN for Long Recording Speech Separation. In: 2021 IEEE Spoken Language Technology Workshop (SLT). ; 2021. doi:10.1109/slt48900.2021.9383514
LibreCat | DOI
 
[23]
2021 | Conference Paper | LibreCat-ID: 44843 | OA
Boeddeker C, Rautenberg F, Haeb-Umbach R. A Comparison and Combination of Unsupervised Blind Source Separation  Techniques. In: ITG Conference on Speech Communication. ; 2021.
LibreCat | Files available | Download (ext.) | arXiv
 
[22]
2021 | Conference Paper | LibreCat-ID: 28259 | OA
Boeddeker C, Zhang W, Nakatani T, et al. Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9414661
LibreCat | Files available | DOI
 
[21]
2021 | Conference Paper | LibreCat-ID: 26770 | OA
von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. In: Interspeech 2021. ; 2021. doi:10.21437/interspeech.2021-1177
LibreCat | Files available | DOI
 
[20]
2021 | Conference Paper | LibreCat-ID: 29173 | OA
von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. Speeding Up Permutation Invariant Training for Source Separation. In: Speech Communication; 14th ITG Conference. ; 2021.
LibreCat | Files available
 
[19]
2020 | Conference Paper | LibreCat-ID: 20700 | OA
Boeddeker C, Cord-Landwehr T, Heitkaemper J, et al. Towards a speaker diarization system for the CHiME 2020 dinner party transcription. In: Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments. ; 2020.
LibreCat | Files available
 
[18]
2020 | Journal Article | LibreCat-ID: 17598 | OA
Nakatani T, Boeddeker C, Kinoshita K, Ikeshita R, Delcroix M, Haeb-Umbach R. Jointly optimal denoising, dereverberation, and source separation. IEEE/ACM Transactions on Audio, Speech, and Language Processing. Published online 2020:1-1. doi:10.1109/TASLP.2020.3013118
LibreCat | DOI | Download (ext.)
 
[17]
2020 | Conference Paper | LibreCat-ID: 20504
Heitkaemper J, Jakobeit D, Boeddeker C, Drude L, Haeb-Umbach R. Demystifying TasNet: A Dissecting Approach. In: ICASSP 2020 Virtual Barcelona Spain. ; 2020.
LibreCat | Files available
 
[16]
2020 | Preprint | LibreCat-ID: 28263
Watanabe S, Mandel M, Barker J, et al. CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings. arXiv:200409249. Published online 2020.
LibreCat
 
[15]
2020 | Conference Paper | LibreCat-ID: 20762 | OA
von Neumann T, Kinoshita K, Drude L, et al. End-to-End Training of Time Domain Audio Separation and Recognition. In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2020:7004-7008. doi:10.1109/ICASSP40776.2020.9053461
LibreCat | Files available | DOI
 
[14]
2020 | Conference Paper | LibreCat-ID: 20764 | OA
von Neumann T, Boeddeker C, Drude L, et al. Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. In: Proc. Interspeech 2020. ; 2020:3097-3101. doi:10.21437/Interspeech.2020-2519
LibreCat | Files available | DOI
 
[13]
2019 | Journal Article | LibreCat-ID: 19446 | OA
Drude L, Heitkaemper J, Boeddeker C, Haeb-Umbach R. SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. ArXiv e-prints. 2019.
LibreCat | Files available
 
[12]
2019 | Conference Paper | LibreCat-ID: 15816 | OA
Zorila C, Boeddeker C, Doddipatla R, Haeb-Umbach R. An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription. In: ASRU 2019, Sentosa, Singapore. ; 2019.
LibreCat | Files available
 
[11]
2019 | Conference Paper | LibreCat-ID: 14826 | OA
Kanda N, Boeddeker C, Heitkaemper J, Fujita Y, Horiguchi S, Haeb-Umbach R. Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. In: INTERSPEECH 2019, Graz, Austria. ; 2019.
LibreCat | Files available
 
[10]
2018 | Conference Paper | LibreCat-ID: 11872 | OA
Drude L, Boeddeker C, Heymann J, et al. Integration neural network based beamforming and weighted prediction error dereverberation. In: INTERSPEECH 2018, Hyderabad, India. ; 2018.
LibreCat | Files available | Download (ext.)
 
[9]
2018 | Conference Paper | LibreCat-ID: 11873 | OA
Drude L, Heymann J, Boeddeker C, Haeb-Umbach R. NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing. In: ITG 2018, Oldenburg, Germany. ; 2018.
LibreCat | Files available | Download (ext.)
 
[8]
2018 | Conference Paper | LibreCat-ID: 12901 | OA
Boeddeker C, Erdogan H, Yoshioka T, Haeb-Umbach R. Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition. In: ICASSP 2018, Calgary, Canada. ; 2018.
LibreCat | Files available | Download (ext.)
 
[7]
2018 | Conference Paper | LibreCat-ID: 12899 | OA
Boeddeker C, Heitkaemper J, Schmalenstroeer J, Drude L, Heymann J, Haeb-Umbach R. Front-End Processing for the CHiME-5 Dinner Party Scenario. In: Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India. ; 2018.
LibreCat | Files available | Download (ext.)
 
[6]
2018 | Conference Paper | LibreCat-ID: 11876 | OA
Kitza M, Michel W, Boeddeker C, et al. The RWTH/UPB System Combination for the CHiME 2018 Workshop. In: Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India. ; 2018.
LibreCat | Download (ext.)
 
[5]
2017 | Report | LibreCat-ID: 11735 | OA
Boeddeker C, Hanebrink P, Drude L, Heymann J, Haeb-Umbach R. On the Computation of Complex-Valued Gradients with Application to Statistically Optimum Beamforming.; 2017.
LibreCat | Download (ext.)
 
[4]
2017 | Conference Paper | LibreCat-ID: 11736 | OA
Boeddeker C, Hanebrink P, Drude L, Heymann J, Haeb-Umbach R. Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation. In: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP). ; 2017.
LibreCat | Download (ext.)
 
[3]
2017 | Conference Paper | LibreCat-ID: 11809 | OA
Heymann J, Drude L, Boeddeker C, Hanebrink P, Haeb-Umbach R. BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System. In: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP). ; 2017.
LibreCat | Files available | Download (ext.)
 
[2]
2017 | Conference Paper | LibreCat-ID: 11895 | OA
Schmalenstroeer J, Heymann J, Drude L, Boeddeker C, Haeb-Umbach R. Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming. In: IEEE 19th International Workshop on Multimedia Signal Processing (MMSP). ; 2017.
LibreCat | Files available | Download (ext.)
 
[1]
2016 | Conference Paper | LibreCat-ID: 11751 | OA
Drude L, Boeddeker C, Haeb-Umbach R. Blind Speech Separation based on Complex Spherical k-Mode Clustering. In: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP). ; 2016.
LibreCat | Files available | Download (ext.)
 

Search

Filter Publications

Display / Sort

Citation Style: AMA

Export / Embed