LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

43 Publications

2022 | Journal Article | LibreCat-ID: 33669 |

W. Zhang, X. Chang, C. Boeddeker, T. Nakatani, S. Watanabe, and Y. Qian, “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi: 10.1109/TASLP.2022.3209942.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33954 |

C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “An Initialization Scheme for Meeting Separation with Spatial Mixture Models,” 2022, doi: 10.21437/interspeech.2022-10929.

LibreCat | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33958

K. Kinoshita, T. von Neumann, M. Delcroix, C. Boeddeker, and R. Haeb-Umbach, “Utterance-by-utterance overlap-aware neural diarization with Graph-PIT,” in Proc. Interspeech 2022, 2022, pp. 1486–1490, doi: 10.21437/Interspeech.2022-11408.

LibreCat | DOI

2022 | Conference Paper | LibreCat-ID: 33819 |

T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data,” 2022, doi: 10.1109/icassp43922.2022.9746757.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33847 |

T. Cord-Landwehr, T. von Neumann, C. Boeddeker, and R. Haeb-Umbach, “MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg, 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33848 |

T. Cord-Landwehr, C. Boeddeker, T. von Neumann, C. Zorila, R. Doddipatla, and R. Haeb-Umbach, “Monaural source separation: From anechoic to reverberant environments,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.

LibreCat | Files available | arXiv

2022 | Misc | LibreCat-ID: 33816 |

T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer, and R. Haeb-Umbach, A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 28256

W. Zhang et al., “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend,” 2021, doi: 10.1109/icassp39728.2021.9414464.

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 28262

C. Li et al., “ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration,” 2021, doi: 10.1109/slt48900.2021.9383615.

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 28261

C. Li et al., “Dual-Path RNN for Long Recording Speech Separation,” 2021, doi: 10.1109/slt48900.2021.9383514.

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 44843 |

C. Boeddeker, F. Rautenberg, and R. Haeb-Umbach, “A Comparison and Combination of Unsupervised Blind Source Separation Techniques,” presented at the ITG Conference on Speech Communication, Kiel, 2021.

LibreCat | Files available | Download (ext.) | arXiv

2021 | Conference Paper | LibreCat-ID: 28259 |

C. Boeddeker et al., “Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation,” 2021, doi: 10.1109/icassp39728.2021.9414661.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 26770 |

T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers,” presented at the Interspeech, 2021, doi: 10.21437/interspeech.2021-1177.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 29173 |

T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “Speeding Up Permutation Invariant Training for Source Separation,” presented at the Speech Communication; 14th ITG Conference, Kiel, 2021.

LibreCat | Files available

2020 | Conference Paper | LibreCat-ID: 20700 |

C. Boeddeker et al., “Towards a speaker diarization system for the CHiME 2020 dinner party transcription,” in Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.

LibreCat | Files available

2020 | Journal Article | LibreCat-ID: 17598 |

T. Nakatani, C. Boeddeker, K. Kinoshita, R. Ikeshita, M. Delcroix, and R. Haeb-Umbach, “Jointly optimal denoising, dereverberation, and source separation,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 1–1, 2020, doi: 10.1109/TASLP.2020.3013118.

LibreCat | DOI | Download (ext.)

2020 | Conference Paper | LibreCat-ID: 20504

J. Heitkaemper, D. Jakobeit, C. Boeddeker, L. Drude, and R. Haeb-Umbach, “Demystifying TasNet: A Dissecting Approach,” 2020.

LibreCat | Files available

2020 | Preprint | LibreCat-ID: 28263

S. Watanabe et al., “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings,” arXiv:2004.09249. 2020.

LibreCat

2020 | Conference Paper | LibreCat-ID: 20762 |

T. von Neumann et al., “End-to-End Training of Time Domain Audio Separation and Recognition,” in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008, doi: 10.1109/ICASSP40776.2020.9053461.

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20764 |

T. von Neumann et al., “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR,” in Proc. Interspeech 2020, 2020, pp. 3097–3101, doi: 10.21437/Interspeech.2020-2519.

LibreCat | Files available | DOI

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

43 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

43 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options