LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

42 Publications

2024 | Journal Article | LibreCat-ID: 52958 |

Boeddeker, Christoph, et al. “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, Institute of Electrical and Electronics Engineers (IEEE), 2024, pp. 1185–97, doi:10.1109/taslp.2024.3350887.

LibreCat | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 53659

Cord-Landwehr, Tobias, et al. “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios.” ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2024, doi:10.1109/icassp48485.2024.10445911.

LibreCat | DOI

2023 | Conference Paper | LibreCat-ID: 47128 |

Cord-Landwehr, Tobias, et al. “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10095370.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 47129 |

Cord-Landwehr, Tobias, et al. “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures.” INTERSPEECH 2023, ISCA, 2023, doi:10.21437/interspeech.2023-1379.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48391

Aralikatti, Rohith, et al. “Reverberation as Supervision For Speech Separation.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10095022.

LibreCat | DOI

2023 | Conference Paper | LibreCat-ID: 48390

Berger, Simon, et al. “Mixture Encoder for Joint Speech Separation and Recognition.” INTERSPEECH 2023, ISCA, 2023, doi:10.21437/interspeech.2023-1815.

LibreCat | DOI

2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann, Thilo, et al. “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, Institute of Electrical and Electronics Engineers (IEEE), 2023, pp. 576–89, doi:10.1109/taslp.2022.3228629.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann, Thilo, et al. “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10094784.

LibreCat | Files available | DOI | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann, Thilo, et al. “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems.” Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.

LibreCat | Files available | Download (ext.)

2022 | Journal Article | LibreCat-ID: 33669 |

Zhang, Wangyou, et al. “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi:10.1109/TASLP.2022.3209942.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker, Christoph, et al. “An Initialization Scheme for Meeting Separation with Spatial Mixture Models.” Interspeech 2022, ISCA, 2022, doi:10.21437/interspeech.2022-10929.

LibreCat | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33958

Kinoshita, Keisuke, et al. “Utterance-by-Utterance Overlap-Aware Neural Diarization with Graph-PIT.” Proc. Interspeech 2022, ISCA, 2022, pp. 1486–90, doi:10.21437/Interspeech.2022-11408.

LibreCat | DOI

2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann, Thilo, et al. “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data.” ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2022, doi:10.1109/icassp43922.2022.9746757.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, Tobias, et al. “MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr, Tobias, et al. “Monaural Source Separation: From Anechoic to Reverberant Environments.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2022.

LibreCat | Files available | arXiv

2022 | Misc | LibreCat-ID: 33816 |

Gburrek, Tobias, et al. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022, doi:10.48550/ARXIV.2205.00944.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 28256

Zhang, Wangyou, et al. “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend.” ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, doi:10.1109/icassp39728.2021.9414464.

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 28262

Li, Chenda, et al. “ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration.” 2021 IEEE Spoken Language Technology Workshop (SLT), 2021, doi:10.1109/slt48900.2021.9383615.

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 28261

Li, Chenda, et al. “Dual-Path RNN for Long Recording Speech Separation.” 2021 IEEE Spoken Language Technology Workshop (SLT), 2021, doi:10.1109/slt48900.2021.9383514.

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 44843 |

Boeddeker, Christoph, et al. “A Comparison and Combination of Unsupervised Blind Source Separation Techniques.” ITG Conference on Speech Communication, 2021.

LibreCat | Files available | Download (ext.) | arXiv

2021 | Conference Paper | LibreCat-ID: 28259 |

Boeddeker, Christoph, et al. “Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation.” ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, doi:10.1109/icassp39728.2021.9414661.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 26770 |

von Neumann, Thilo, et al. “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers.” Interspeech 2021, 2021, doi:10.21437/interspeech.2021-1177.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 29173 |

von Neumann, Thilo, et al. “Speeding Up Permutation Invariant Training for Source Separation.” Speech Communication; 14th ITG Conference, 2021.

LibreCat | Files available

2020 | Conference Paper | LibreCat-ID: 20700 |

Boeddeker, Christoph, et al. “Towards a Speaker Diarization System for the CHiME 2020 Dinner Party Transcription.” Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.

LibreCat | Files available

2020 | Journal Article | LibreCat-ID: 17598 |

Nakatani, Tomohiro, et al. “Jointly Optimal Denoising, Dereverberation, and Source Separation.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020, pp. 1–1, doi:10.1109/TASLP.2020.3013118.

LibreCat | DOI | Download (ext.)

2020 | Conference Paper | LibreCat-ID: 20504

Heitkaemper, Jens, et al. “Demystifying TasNet: A Dissecting Approach.” ICASSP 2020 Virtual Barcelona Spain, 2020.

LibreCat | Files available

2020 | Preprint | LibreCat-ID: 28263

Watanabe, Shinji, et al. “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings.” ArXiv:2004.09249, 2020.

LibreCat

2020 | Conference Paper | LibreCat-ID: 20762 |

von Neumann, Thilo, et al. “End-to-End Training of Time Domain Audio Separation and Recognition.” ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–08, doi:10.1109/ICASSP40776.2020.9053461.

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20764 |

von Neumann, Thilo, et al. “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR.” Proc. Interspeech 2020, 2020, pp. 3097–101, doi:10.21437/Interspeech.2020-2519.

LibreCat | Files available | DOI

2019 | Journal Article | LibreCat-ID: 19446 |

Drude, Lukas, et al. “SMS-WSJ: Database, Performance Measures, and Baseline Recipe for Multi-Channel Source Separation and Recognition.” ArXiv E-Prints, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 15816 |

Zorila, Catalin, et al. “An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription.” ASRU 2019, Sentosa, Singapore, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 14826 |

Kanda, Naoyuki, et al. “Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR.” INTERSPEECH 2019, Graz, Austria, 2019.

LibreCat | Files available

2018 | Conference Paper | LibreCat-ID: 11872 |

Drude, Lukas, et al. “Integration Neural Network Based Beamforming and Weighted Prediction Error Dereverberation.” INTERSPEECH 2018, Hyderabad, India, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11873 |

Drude, Lukas, et al. “NARA-WPE: A Python Package for Weighted Prediction Error Dereverberation in Numpy and Tensorflow for Online and Offline Processing.” ITG 2018, Oldenburg, Germany, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 12901 |

Boeddeker, Christoph, et al. “Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition.” ICASSP 2018, Calgary, Canada, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 12899 |

Boeddeker, Christoph, et al. “Front-End Processing for the CHiME-5 Dinner Party Scenario.” Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11876 |

Kitza, Markus, et al. “The RWTH/UPB System Combination for the CHiME 2018 Workshop.” Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India, 2018.

LibreCat | Download (ext.)

2017 | Report | LibreCat-ID: 11735 |

Boeddeker, Christoph, et al. On the Computation of Complex-Valued Gradients with Application to Statistically Optimum Beamforming. 2017.

LibreCat | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11736 |

Boeddeker, Christoph, et al. “Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation.” Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.

LibreCat | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11809 |

Heymann, Jahn, et al. “BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System.” Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.

LibreCat | Files available | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11895 |

Schmalenstroeer, Joerg, et al. “Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming.” IEEE 19th International Workshop on Multimedia Signal Processing (MMSP), 2017.

LibreCat | Files available | Download (ext.)

2016 | Conference Paper | LibreCat-ID: 11751 |

Drude, Lukas, et al. “Blind Speech Separation Based on Complex Spherical K-Mode Clustering.” Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2016.

LibreCat | Files available | Download (ext.)

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

42 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

42 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options