LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

333 Publications

2025 | Conference Paper | LibreCat-ID: 59900

Werning, Alexander, and Reinhold Häb-Umbach. “Distilling Efficient Audio Models Using Data Pruning with CLAP.” Proceedings of DAS|DAGA 2025, edited by Deutsche Gesellschaft für Akustik e.V. (DEGA), Berlin, 2025, 2025, doi:10.71568/DASDAGA2025.149.

LibreCat | DOI

2025 | Conference Paper | LibreCat-ID: 59999

Rautenberg, Frederik, et al. “Speech Synthesis along Perceptual Voice Quality Dimensions.” ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2025, doi:10.1109/icassp49660.2025.10888012.

LibreCat | DOI

2024 | Preprint | LibreCat-ID: 56273 |

Cornell, Samuele, et al. “The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization.” ArXiv:2407.16447, 2024.

LibreCat | Download (ext.) | arXiv

2024 | Conference Paper | LibreCat-ID: 57031 |

Gburrek, Tobias, et al. “Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models.” 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2024, doi:10.1109/iwaenc61483.2024.10694103.

LibreCat | Files available | DOI

2024 | Journal Article | LibreCat-ID: 52958 |

Boeddeker, Christoph, et al. “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, Institute of Electrical and Electronics Engineers (IEEE), 2024, pp. 1185–97, doi:10.1109/taslp.2024.3350887.

LibreCat | Files available | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57085 |

Cord-Landwehr, Tobias, et al. “Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models.” ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024, doi:10.1109/ICASSP49660.2025.10888445.

LibreCat | DOI | Download (ext.)

2024 | Report | LibreCat-ID: 57161

Werning, Alexander, and Reinhold Haeb-Umbach. UPB-NT Submission to DCASE24: Dataset Pruning for Targeted Knowledge Distillation. 2024.

LibreCat

2024 | Conference Paper | LibreCat-ID: 57160

Werning, Alexander, and Reinhold Haeb-Umbach. “Target-Specific Dataset Pruning for Compression of Audio Tagging Models.” 32nd European Signal Processing Conference (EUSIPCO 2024), 2024.

LibreCat | Files available

2024 | Conference Paper | LibreCat-ID: 57099

Xie, Yuying, et al. “Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder.” 2024 32nd European Signal Processing Conference (EUSIPCO), 2024, pp. 436–440.

LibreCat

2024 | Conference Paper | LibreCat-ID: 56004 |

von Neumann, Thilo, et al. “Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization.” 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), IEEE, 2024, doi:10.1109/icasspw62465.2024.10625894.

LibreCat | Files available | DOI

2024 | Conference Paper | LibreCat-ID: 53659

Cord-Landwehr, Tobias, et al. “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios.” ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2024, doi:10.1109/icassp48485.2024.10445911.

LibreCat | DOI

2024 | Conference Paper | LibreCat-ID: 56272 |

Boeddeker, Christoph, et al. “Once More Diarization: Improving Meeting Transcription Systems through Segment-Level Speaker Reassignment.” Interspeech 2024, ISCA, 2024, doi:10.21437/interspeech.2024-1286.

LibreCat | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting, Peter, et al. “Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription.” 2024 IEEE Spoken Language Technology Workshop (SLT), 2024.

LibreCat | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48269 |

Gburrek, Tobias, et al. “On the Integration of Sampling Rate Synchronization and Acoustic Beamforming.” European Signal Processing Conference (EUSIPCO), 2023.

LibreCat | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48270 |

Schmalenstroeer, Joerg, et al. “LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices.” ITG Conference on Speech Communication, 2023.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 48355 |

Rautenberg, Frederik, et al. “On Feature Importance and Interpretability of Speaker Representations.” ITG Conference on Speech Communication, 2023.

LibreCat | Files available | Download (ext.) | arXiv

2023 | Conference Paper | LibreCat-ID: 48410 |

Wiechmann, Jana, et al. “Explaining Voice Characteristics to Novice Voice Practitioners-How Successful Is It?” 20th International Congress of the Phonetic Sciences (ICPhS) , 2023.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48391

Aralikatti, Rohith, et al. “Reverberation as Supervision For Speech Separation.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10095022.

LibreCat | DOI

2023 | Conference Paper | LibreCat-ID: 46069

Seebauer, Fritz, et al. “Re-Examining the Quality Dimensions of Synthetic Speech.” 12th Speech Synthesis Workshop (SSW) 2023, 2023.

LibreCat

2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann, Thilo, et al. “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, Institute of Electrical and Electronics Engineers (IEEE), 2023, pp. 576–89, doi:10.1109/taslp.2022.3228629.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 49109 |

Gburrek, Tobias, et al. “Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks.” Proc. Asilomar Conference on Signals, Systems, and Computers, 2023.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 44849 |

Rautenberg, Frederik, et al. “Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics.” Fortschritte Der Akustik - DAGA 2023, 2023, pp. 1409–12.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 49111

Ebbers, Janek, et al. “Post-Processing Independent Evaluation of Sound Event Detection Systems.” Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023), 2023, pp. 36–40.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 57098

Seebauer, Fritz, et al. “DISCERNING DIMENSIONS OF QUALITY FOR STATE OF THE ART SYNTHETIC SPEECH.” Proceedings of the 20th International Congress of Phonetic Sciences, 2023.

LibreCat

2023 | Conference Paper | LibreCat-ID: 57086

Kuhlmann, Michael, et al. “Investigating Speaker Embedding Disentanglement on Natural Read Speech.” Speech Communication; 15th ITG Conference, 2023, pp. 121–125.

LibreCat

2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann, Thilo, et al. “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10094784.

LibreCat | Files available | DOI | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann, Thilo, et al. “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems.” Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 47128 |

Cord-Landwehr, Tobias, et al. “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10095370.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 47129 |

Cord-Landwehr, Tobias, et al. “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures.” INTERSPEECH 2023, ISCA, 2023, doi:10.21437/interspeech.2023-1379.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 54439 |

Boeddeker, Christoph, et al. “Multi-Stage Diarization Refinement for the CHiME-7 DASR Scenario.” 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023), ISCA, 2023, doi:10.21437/chime.2023-10.

LibreCat | DOI | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48390 |

Berger, Simon, et al. “Mixture Encoder for Joint Speech Separation and Recognition.” INTERSPEECH 2023, ISCA, 2023, doi:10.21437/interspeech.2023-1815.

LibreCat | DOI | Download (ext.)

2022 | Journal Article | LibreCat-ID: 33669 |

Zhang, Wangyou, et al. “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi:10.1109/TASLP.2022.3209942.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33471

Heitkämper, Jens, et al. “Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels.” Proceedings of the 30th European Signal Processing Conference (EUSIPCO).

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33806

Afifi, Haitham, et al. “Data-Driven Time Synchronization in Wireless Multimedia Networks.” 2022 International Wireless Communications and Mobile Computing (IWCMC), IEEE, 2022, doi:10.1109/iwcmc55113.2022.9824980.

LibreCat | DOI

2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, Tobias, et al. “MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33807 |

Gburrek, Tobias, et al. “On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes.” ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2022, doi:10.1109/icassp43922.2022.9746284.

LibreCat | Files available | DOI

2022 | Journal Article | LibreCat-ID: 33451 |

Grimm, Christopher, et al. “Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications.” IEEE Transactions on Vehicular Technology, vol. 71, no. 9, 2022, pp. 9435–49, doi:10.1109/TVT.2022.3182411.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33696 |

Wiechmann, Jana, et al. “Technically Enabled Explaining of Voice Characteristics.” 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P), 2022.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33857 |

Kuhlmann, Michael, et al. “Investigation into Target Speaking Rate Adaptation for Voice Conversion.” Interspeech 2022, ISCA, 2022, doi:10.21437/interspeech.2022-10740.

LibreCat | Files available | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33808 |

Gburrek, Tobias, et al. “Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2022, doi:10.1109/IWAENC53105.2022.9914772.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 34072 |

Ebbers, Janek, et al. “Threshold Independent Evaluation of Sound Event Detection Scores.” Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.

LibreCat | Files available

2022 | Report | LibreCat-ID: 49113

Ebbers, Janek, and Reinhold Haeb-Umbach. Pre-Training And Self-Training For Sound Event Detection In Domestic Environments. 2022.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr, Tobias, et al. “Monaural Source Separation: From Anechoic to Reverberant Environments.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann, Thilo, et al. “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data.” ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2022, doi:10.1109/icassp43922.2022.9746757.

LibreCat | Files available | DOI

2022 | Misc | LibreCat-ID: 33816 |

Gburrek, Tobias, et al. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022, doi:10.48550/ARXIV.2205.00944.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker, Christoph, et al. “An Initialization Scheme for Meeting Separation with Spatial Mixture Models.” Interspeech 2022, ISCA, 2022, doi:10.21437/interspeech.2022-10929.

LibreCat | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33958

Kinoshita, Keisuke, et al. “Utterance-by-Utterance Overlap-Aware Neural Diarization with Graph-PIT.” Proc. Interspeech 2022, ISCA, 2022, pp. 1486–90, doi:10.21437/Interspeech.2022-11408.

LibreCat | DOI | Download (ext.)

2021 | Journal Article | LibreCat-ID: 21065 |

Haeb-Umbach, Reinhold, et al. “Far-Field Automatic Speech Recognition.” Proceedings of the IEEE, vol. 109, no. 2, 2021, pp. 124–48, doi:10.1109/JPROC.2020.3018668.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 28256

Zhang, Wangyou, et al. “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend.” ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, doi:10.1109/icassp39728.2021.9414464.

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 28262

Li, Chenda, et al. “ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration.” 2021 IEEE Spoken Language Technology Workshop (SLT), 2021, doi:10.1109/slt48900.2021.9383615.

LibreCat | DOI

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

333 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

333 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options