Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

42 Publications


2024 | Journal Article | LibreCat-ID: 52958 | OA
Boeddeker, Christoph, et al. “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, Institute of Electrical and Electronics Engineers (IEEE), 2024, pp. 1185–97, doi:10.1109/taslp.2024.3350887.
LibreCat | DOI | Download (ext.)
 

2024 | Conference Paper | LibreCat-ID: 53659
Cord-Landwehr, Tobias, et al. “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios.” ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2024, doi:10.1109/icassp48485.2024.10445911.
LibreCat | DOI
 

2023 | Conference Paper | LibreCat-ID: 47128 | OA
Cord-Landwehr, Tobias, et al. “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10095370.
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 47129 | OA
Cord-Landwehr, Tobias, et al. “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures.” INTERSPEECH 2023, ISCA, 2023, doi:10.21437/interspeech.2023-1379.
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48391
Aralikatti, Rohith, et al. “Reverberation as Supervision For Speech Separation.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10095022.
LibreCat | DOI
 

2023 | Conference Paper | LibreCat-ID: 48390
Berger, Simon, et al. “Mixture Encoder for Joint Speech Separation and Recognition.” INTERSPEECH 2023, ISCA, 2023, doi:10.21437/interspeech.2023-1815.
LibreCat | DOI
 

2023 | Journal Article | LibreCat-ID: 35602 | OA
von Neumann, Thilo, et al. “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, Institute of Electrical and Electronics Engineers (IEEE), 2023, pp. 576–89, doi:10.1109/taslp.2022.3228629.
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48281 | OA
von Neumann, Thilo, et al. “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10094784.
LibreCat | Files available | DOI | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48275 | OA
von Neumann, Thilo, et al. “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems.” Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.
LibreCat | Files available | Download (ext.)
 

2022 | Journal Article | LibreCat-ID: 33669 | OA
Zhang, Wangyou, et al. “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi:10.1109/TASLP.2022.3209942.
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33954 | OA
Boeddeker, Christoph, et al. “An Initialization Scheme for Meeting Separation with Spatial Mixture Models.” Interspeech 2022, ISCA, 2022, doi:10.21437/interspeech.2022-10929.
LibreCat | DOI | Download (ext.)
 

2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita, Keisuke, et al. “Utterance-by-Utterance Overlap-Aware Neural Diarization with Graph-PIT.” Proc. Interspeech 2022, ISCA, 2022, pp. 1486–90, doi:10.21437/Interspeech.2022-11408.
LibreCat | DOI
 

2022 | Conference Paper | LibreCat-ID: 33819 | OA
von Neumann, Thilo, et al. “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data.” ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2022, doi:10.1109/icassp43922.2022.9746757.
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33847 | OA
Cord-Landwehr, Tobias, et al. “MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
LibreCat | Files available | arXiv
 

2022 | Conference Paper | LibreCat-ID: 33848 | OA
Cord-Landwehr, Tobias, et al. “Monaural Source Separation: From Anechoic to Reverberant Environments.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2022.
LibreCat | Files available | arXiv
 

2022 | Misc | LibreCat-ID: 33816 | OA
Gburrek, Tobias, et al. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022, doi:10.48550/ARXIV.2205.00944.
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 28256
Zhang, Wangyou, et al. “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend.” ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, doi:10.1109/icassp39728.2021.9414464.
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 28262
Li, Chenda, et al. “ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration.” 2021 IEEE Spoken Language Technology Workshop (SLT), 2021, doi:10.1109/slt48900.2021.9383615.
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 28261
Li, Chenda, et al. “Dual-Path RNN for Long Recording Speech Separation.” 2021 IEEE Spoken Language Technology Workshop (SLT), 2021, doi:10.1109/slt48900.2021.9383514.
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 44843 | OA
Boeddeker, Christoph, et al. “A Comparison and Combination of Unsupervised Blind Source Separation  Techniques.” ITG Conference on Speech Communication, 2021.
LibreCat | Files available | Download (ext.) | arXiv
 

2021 | Conference Paper | LibreCat-ID: 28259 | OA
Boeddeker, Christoph, et al. “Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation.” ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, doi:10.1109/icassp39728.2021.9414661.
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 26770 | OA
von Neumann, Thilo, et al. “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers.” Interspeech 2021, 2021, doi:10.21437/interspeech.2021-1177.
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 29173 | OA
von Neumann, Thilo, et al. “Speeding Up Permutation Invariant Training for Source Separation.” Speech Communication; 14th ITG Conference, 2021.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20700 | OA
Boeddeker, Christoph, et al. “Towards a Speaker Diarization System for the CHiME 2020 Dinner Party Transcription.” Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.
LibreCat | Files available
 

2020 | Journal Article | LibreCat-ID: 17598 | OA
Nakatani, Tomohiro, et al. “Jointly Optimal Denoising, Dereverberation, and Source Separation.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020, pp. 1–1, doi:10.1109/TASLP.2020.3013118.
LibreCat | DOI | Download (ext.)
 

2020 | Conference Paper | LibreCat-ID: 20504
Heitkaemper, Jens, et al. “Demystifying TasNet: A Dissecting Approach.” ICASSP 2020 Virtual Barcelona Spain, 2020.
LibreCat | Files available
 

2020 | Preprint | LibreCat-ID: 28263
Watanabe, Shinji, et al. “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings.” ArXiv:2004.09249, 2020.
LibreCat
 

2020 | Conference Paper | LibreCat-ID: 20762 | OA
von Neumann, Thilo, et al. “End-to-End Training of Time Domain Audio Separation and Recognition.” ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–08, doi:10.1109/ICASSP40776.2020.9053461.
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20764 | OA
von Neumann, Thilo, et al. “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR.” Proc. Interspeech 2020, 2020, pp. 3097–101, doi:10.21437/Interspeech.2020-2519.
LibreCat | Files available | DOI
 

2019 | Journal Article | LibreCat-ID: 19446 | OA
Drude, Lukas, et al. “SMS-WSJ: Database, Performance Measures, and Baseline Recipe for Multi-Channel Source Separation and Recognition.” ArXiv E-Prints, 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 15816 | OA
Zorila, Catalin, et al. “An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription.” ASRU 2019, Sentosa, Singapore, 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 14826 | OA
Kanda, Naoyuki, et al. “Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR.” INTERSPEECH 2019, Graz, Austria, 2019.
LibreCat | Files available
 

2018 | Conference Paper | LibreCat-ID: 11872 | OA
Drude, Lukas, et al. “Integration Neural Network Based Beamforming and Weighted Prediction Error Dereverberation.” INTERSPEECH 2018, Hyderabad, India, 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 11873 | OA
Drude, Lukas, et al. “NARA-WPE: A Python Package for Weighted Prediction Error Dereverberation in Numpy and Tensorflow for Online and Offline Processing.” ITG 2018, Oldenburg, Germany, 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 12901 | OA
Boeddeker, Christoph, et al. “Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition.” ICASSP 2018, Calgary, Canada, 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 12899 | OA
Boeddeker, Christoph, et al. “Front-End Processing for the CHiME-5 Dinner Party Scenario.” Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India, 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 11876 | OA
Kitza, Markus, et al. “The RWTH/UPB System Combination for the CHiME 2018 Workshop.” Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India, 2018.
LibreCat | Download (ext.)
 

2017 | Report | LibreCat-ID: 11735 | OA
Boeddeker, Christoph, et al. On the Computation of Complex-Valued Gradients with Application to Statistically Optimum Beamforming. 2017.
LibreCat | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11736 | OA
Boeddeker, Christoph, et al. “Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation.” Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11809 | OA
Heymann, Jahn, et al. “BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System.” Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.
LibreCat | Files available | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11895 | OA
Schmalenstroeer, Joerg, et al. “Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming.” IEEE 19th International Workshop on Multimedia Signal Processing (MMSP), 2017.
LibreCat | Files available | Download (ext.)
 

2016 | Conference Paper | LibreCat-ID: 11751 | OA
Drude, Lukas, et al. “Blind Speech Separation Based on Complex Spherical K-Mode Clustering.” Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2016.
LibreCat | Files available | Download (ext.)
 

Filters and Search Terms

(person=40767)

status=public

Search

Filter Publications

Display / Sort

Citation Style: MLA

Export / Embed