Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
49 Publications
2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker C, Cord-Landwehr T, von Neumann T, Haeb-Umbach R. An Initialization Scheme for Meeting Separation with Spatial Mixture Models. In: Interspeech 2022. ISCA; 2022. doi:10.21437/interspeech.2022-10929
LibreCat
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita K, von Neumann T, Delcroix M, Boeddeker C, Haeb-Umbach R. Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. In: Proc. Interspeech 2022. ISCA; 2022:1486-1490. doi:10.21437/Interspeech.2022-11408
LibreCat
| DOI
| Download (ext.)
2021 | Conference Paper | LibreCat-ID: 28256
Zhang W, Boeddeker C, Watanabe S, et al. End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9414464
LibreCat
| DOI
2021 | Conference Paper | LibreCat-ID: 28262
Li C, Shi J, Zhang W, et al. ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration. In: 2021 IEEE Spoken Language Technology Workshop (SLT). ; 2021. doi:10.1109/slt48900.2021.9383615
LibreCat
| DOI
2021 | Conference Paper | LibreCat-ID: 28261
Li C, Luo Y, Han C, et al. Dual-Path RNN for Long Recording Speech Separation. In: 2021 IEEE Spoken Language Technology Workshop (SLT). ; 2021. doi:10.1109/slt48900.2021.9383514
LibreCat
| DOI
2021 | Conference Paper | LibreCat-ID: 44843 |

Boeddeker C, Rautenberg F, Haeb-Umbach R. A Comparison and Combination of Unsupervised Blind Source Separation Techniques. In: ITG Conference on Speech Communication. ; 2021.
LibreCat
| Files available
| Download (ext.)
| arXiv
2021 | Conference Paper | LibreCat-ID: 28259 |

Boeddeker C, Zhang W, Nakatani T, et al. Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9414661
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 26770 |

von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. In: Interspeech 2021. ; 2021. doi:10.21437/interspeech.2021-1177
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 29173 |

von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. Speeding Up Permutation Invariant Training for Source Separation. In: Speech Communication; 14th ITG Conference. ; 2021.
LibreCat
| Files available
2020 | Conference Paper | LibreCat-ID: 20700 |

Boeddeker C, Cord-Landwehr T, Heitkaemper J, et al. Towards a speaker diarization system for the CHiME 2020 dinner party transcription. In: Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments. ; 2020.
LibreCat
| Files available
2020 | Journal Article | LibreCat-ID: 17598 |

Nakatani T, Boeddeker C, Kinoshita K, Ikeshita R, Delcroix M, Haeb-Umbach R. Jointly optimal denoising, dereverberation, and source separation. IEEE/ACM Transactions on Audio, Speech, and Language Processing. Published online 2020:1-1. doi:10.1109/TASLP.2020.3013118
LibreCat
| DOI
| Download (ext.)
2020 | Conference Paper | LibreCat-ID: 20504
Heitkaemper J, Jakobeit D, Boeddeker C, Drude L, Haeb-Umbach R. Demystifying TasNet: A Dissecting Approach. In: ICASSP 2020 Virtual Barcelona Spain. ; 2020.
LibreCat
| Files available
2020 | Preprint | LibreCat-ID: 28263
Watanabe S, Mandel M, Barker J, et al. CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings. arXiv:200409249. Published online 2020.
LibreCat
2020 | Conference Paper | LibreCat-ID: 20762 |

von Neumann T, Kinoshita K, Drude L, et al. End-to-End Training of Time Domain Audio Separation and Recognition. In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2020:7004-7008. doi:10.1109/ICASSP40776.2020.9053461
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 20764 |

von Neumann T, Boeddeker C, Drude L, et al. Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. In: Proc. Interspeech 2020. ; 2020:3097-3101. doi:10.21437/Interspeech.2020-2519
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 20695 |

Boeddeker C, Nakatani T, Kinoshita K, Haeb-Umbach R. Jointly Optimal Dereverberation and Beamforming. In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2020. doi:10.1109/icassp40776.2020.9054393
LibreCat
| Files available
| DOI
2019 | Journal Article | LibreCat-ID: 19446 |

Drude L, Heitkaemper J, Boeddeker C, Haeb-Umbach R. SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. ArXiv e-prints. 2019.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 15816 |

Zorila C, Boeddeker C, Doddipatla R, Haeb-Umbach R. An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription. In: ASRU 2019, Sentosa, Singapore. ; 2019.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 14826 |

Kanda N, Boeddeker C, Heitkaemper J, Fujita Y, Horiguchi S, Haeb-Umbach R. Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. In: INTERSPEECH 2019, Graz, Austria. ; 2019.
LibreCat
| Files available
2018 | Conference Paper | LibreCat-ID: 11872 |

Drude L, Boeddeker C, Heymann J, et al. Integration neural network based beamforming and weighted prediction error dereverberation. In: INTERSPEECH 2018, Hyderabad, India. ; 2018.
LibreCat
| Files available
| Download (ext.)