Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
49 Publications
2022 | Conference Paper | LibreCat-ID: 33954 |

C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “An Initialization Scheme for Meeting Separation with Spatial Mixture Models,” 2022, doi: 10.21437/interspeech.2022-10929.
LibreCat
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33958
K. Kinoshita, T. von Neumann, M. Delcroix, C. Boeddeker, and R. Haeb-Umbach, “Utterance-by-utterance overlap-aware neural diarization with Graph-PIT,” in Proc. Interspeech 2022, 2022, pp. 1486–1490, doi: 10.21437/Interspeech.2022-11408.
LibreCat
| DOI
| Download (ext.)
2021 | Conference Paper | LibreCat-ID: 28256
W. Zhang et al., “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend,” 2021, doi: 10.1109/icassp39728.2021.9414464.
LibreCat
| DOI
2021 | Conference Paper | LibreCat-ID: 28262
C. Li et al., “ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration,” 2021, doi: 10.1109/slt48900.2021.9383615.
LibreCat
| DOI
2021 | Conference Paper | LibreCat-ID: 28261
C. Li et al., “Dual-Path RNN for Long Recording Speech Separation,” 2021, doi: 10.1109/slt48900.2021.9383514.
LibreCat
| DOI
2021 | Conference Paper | LibreCat-ID: 44843 |

C. Boeddeker, F. Rautenberg, and R. Haeb-Umbach, “A Comparison and Combination of Unsupervised Blind Source Separation Techniques,” presented at the ITG Conference on Speech Communication, Kiel, 2021.
LibreCat
| Files available
| Download (ext.)
| arXiv
2021 | Conference Paper | LibreCat-ID: 28259 |

C. Boeddeker et al., “Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation,” 2021, doi: 10.1109/icassp39728.2021.9414661.
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 26770 |

T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers,” presented at the Interspeech, 2021, doi: 10.21437/interspeech.2021-1177.
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 29173 |

T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “Speeding Up Permutation Invariant Training for Source Separation,” presented at the Speech Communication; 14th ITG Conference, Kiel, 2021.
LibreCat
| Files available
2020 | Conference Paper | LibreCat-ID: 20700 |

C. Boeddeker et al., “Towards a speaker diarization system for the CHiME 2020 dinner party transcription,” in Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.
LibreCat
| Files available
2020 | Journal Article | LibreCat-ID: 17598 |

T. Nakatani, C. Boeddeker, K. Kinoshita, R. Ikeshita, M. Delcroix, and R. Haeb-Umbach, “Jointly optimal denoising, dereverberation, and source separation,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 1–1, 2020, doi: 10.1109/TASLP.2020.3013118.
LibreCat
| DOI
| Download (ext.)
2020 | Conference Paper | LibreCat-ID: 20504
J. Heitkaemper, D. Jakobeit, C. Boeddeker, L. Drude, and R. Haeb-Umbach, “Demystifying TasNet: A Dissecting Approach,” 2020.
LibreCat
| Files available
2020 | Preprint | LibreCat-ID: 28263
S. Watanabe et al., “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings,” arXiv:2004.09249. 2020.
LibreCat
2020 | Conference Paper | LibreCat-ID: 20762 |

T. von Neumann et al., “End-to-End Training of Time Domain Audio Separation and Recognition,” in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008, doi: 10.1109/ICASSP40776.2020.9053461.
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 20764 |

T. von Neumann et al., “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR,” in Proc. Interspeech 2020, 2020, pp. 3097–3101, doi: 10.21437/Interspeech.2020-2519.
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 20695 |

C. Boeddeker, T. Nakatani, K. Kinoshita, and R. Haeb-Umbach, “Jointly Optimal Dereverberation and Beamforming,” 2020, doi: 10.1109/icassp40776.2020.9054393.
LibreCat
| Files available
| DOI
2019 | Journal Article | LibreCat-ID: 19446 |

L. Drude, J. Heitkaemper, C. Boeddeker, and R. Haeb-Umbach, “SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition,” ArXiv e-prints, 2019.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 15816 |

C. Zorila, C. Boeddeker, R. Doddipatla, and R. Haeb-Umbach, “An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription,” in ASRU 2019, Sentosa, Singapore, 2019.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 14826 |

N. Kanda, C. Boeddeker, J. Heitkaemper, Y. Fujita, S. Horiguchi, and R. Haeb-Umbach, “Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR,” in INTERSPEECH 2019, Graz, Austria, 2019.
LibreCat
| Files available
2018 | Conference Paper | LibreCat-ID: 11872 |

L. Drude et al., “Integration neural network based beamforming and weighted prediction error dereverberation,” in INTERSPEECH 2018, Hyderabad, India, 2018.
LibreCat
| Files available
| Download (ext.)