LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

333 Publications

2025 | Conference Paper | LibreCat-ID: 59900

Werning, Alexander, and Reinhold Häb-Umbach. “Distilling Efficient Audio Models Using Data Pruning with CLAP.” In Proceedings of DAS|DAGA 2025, edited by Deutsche Gesellschaft für Akustik e.V. (DEGA), Berlin, 2025. Copenhagen, 2025. https://doi.org/10.71568/DASDAGA2025.149.

LibreCat | DOI

2025 | Conference Paper | LibreCat-ID: 59999

Rautenberg, Frederik, Michael Kuhlmann, Fritz Seebauer, Jana Wiechmann, Petra Wagner, and Reinhold Haeb-Umbach. “Speech Synthesis along Perceptual Voice Quality Dimensions.” In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025. https://doi.org/10.1109/icassp49660.2025.10888012.

LibreCat | DOI

2024 | Preprint | LibreCat-ID: 56273 |

Cornell, Samuele, Taejin Park, Steve Huang, Christoph Boeddeker, Xuankai Chang, Matthew Maciejewski, Matthew Wiesner, Paola Garcia, and Shinji Watanabe. “The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization.” ArXiv:2407.16447, 2024.

LibreCat | Download (ext.) | arXiv

2024 | Conference Paper | LibreCat-ID: 57031 |

Gburrek, Tobias, Adrian Meise, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models.” In 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE, 2024. https://doi.org/10.1109/iwaenc61483.2024.10694103.

LibreCat | Files available | DOI

2024 | Journal Article | LibreCat-ID: 52958 |

Boeddeker, Christoph, Aswin Shanmugam Subramanian, Gordon Wichern, Reinhold Haeb-Umbach, and Jonathan Le Roux. “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings.” IEEE/ACM Transactions on Audio, Speech, and Language Processing 32 (2024): 1185–97. https://doi.org/10.1109/taslp.2024.3350887.

LibreCat | Files available | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57085 |

Cord-Landwehr, Tobias, Christoph Boeddeker, and Reinhold Haeb-Umbach. “Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models.” In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024. https://doi.org/10.1109/ICASSP49660.2025.10888445.

LibreCat | DOI | Download (ext.)

2024 | Report | LibreCat-ID: 57161

Werning, Alexander, and Reinhold Haeb-Umbach. UPB-NT Submission to DCASE24: Dataset Pruning for Targeted Knowledge Distillation, 2024.

LibreCat

2024 | Conference Paper | LibreCat-ID: 57160

Werning, Alexander, and Reinhold Haeb-Umbach. “Target-Specific Dataset Pruning for Compression of Audio Tagging Models.” In 32nd European Signal Processing Conference (EUSIPCO 2024), 2024.

LibreCat | Files available

2024 | Conference Paper | LibreCat-ID: 57099

Xie, Yuying, Michael Kuhlmann, Frederik Rautenberg, Zheng-Hua Tan, and Reinhold Häb-Umbach. “Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder.” In 2024 32nd European Signal Processing Conference (EUSIPCO), 436–440, 2024.

LibreCat

2024 | Conference Paper | LibreCat-ID: 56004 |

Neumann, Thilo von, Christoph Boeddeker, Tobias Cord-Landwehr, Marc Delcroix, and Reinhold Haeb-Umbach. “Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization.” In 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). IEEE, 2024. https://doi.org/10.1109/icasspw62465.2024.10625894.

LibreCat | Files available | DOI

2024 | Conference Paper | LibreCat-ID: 53659

Cord-Landwehr, Tobias, Christoph Boeddeker, Cătălin Zorilă, Rama Doddipatla, and Reinhold Haeb-Umbach. “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios.” In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2024. https://doi.org/10.1109/icassp48485.2024.10445911.

LibreCat | DOI

2024 | Conference Paper | LibreCat-ID: 56272 |

Boeddeker, Christoph, Tobias Cord-Landwehr, and Reinhold Haeb-Umbach. “Once More Diarization: Improving Meeting Transcription Systems through Segment-Level Speaker Reassignment.” In Interspeech 2024. ISCA, 2024. https://doi.org/10.21437/interspeech.2024-1286.

LibreCat | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting, Peter, Simon Berger, Thilo von Neumann, Christoph Boeddeker, Ralf Schlüter, and Reinhold Haeb-Umbach. “Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription.” In 2024 IEEE Spoken Language Technology Workshop (SLT), 2024.

LibreCat | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48269 |

Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “On the Integration of Sampling Rate Synchronization and Acoustic Beamforming.” In European Signal Processing Conference (EUSIPCO), 2023.

LibreCat | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48270 |

Schmalenstroeer, Joerg, Tobias Gburrek, and Reinhold Haeb-Umbach. “LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices.” In ITG Conference on Speech Communication, 2023.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 48355 |

Rautenberg, Frederik, Michael Kuhlmann, Jana Wiechmann, Fritz Seebauer, Petra Wagner, and Reinhold Haeb-Umbach. “On Feature Importance and Interpretability of Speaker Representations.” In ITG Conference on Speech Communication, 2023.

LibreCat | Files available | Download (ext.) | arXiv

2023 | Conference Paper | LibreCat-ID: 48410 |

Wiechmann, Jana, Frederik Rautenberg, Petra Wagner, and Reinhold Haeb-Umbach. “Explaining Voice Characteristics to Novice Voice Practitioners-How Successful Is It?” In 20th International Congress of the Phonetic Sciences (ICPhS) , 2023.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48391

Aralikatti, Rohith, Christoph Boeddeker, Gordon Wichern, Aswin Subramanian, and Jonathan Le Roux. “Reverberation as Supervision For Speech Separation.” In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023. https://doi.org/10.1109/icassp49357.2023.10095022.

LibreCat | DOI

2023 | Conference Paper | LibreCat-ID: 46069

Seebauer, Fritz, Michael Kuhlmann, Reinhold Haeb-Umbach, and Petra Wagner. “Re-Examining the Quality Dimensions of Synthetic Speech.” In 12th Speech Synthesis Workshop (SSW) 2023, 2023.

LibreCat

2023 | Journal Article | LibreCat-ID: 35602 |

Neumann, Thilo von, Keisuke Kinoshita, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria.” IEEE/ACM Transactions on Audio, Speech, and Language Processing 31 (2023): 576–89. https://doi.org/10.1109/taslp.2022.3228629.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 49109 |

Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks.” In Proc. Asilomar Conference on Signals, Systems, and Computers, 2023.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 44849 |

Rautenberg, Frederik, Michael Kuhlmann, Janek Ebbers, Jana Wiechmann, Fritz Seebauer, Petra Wagner, and Reinhold Haeb-Umbach. “Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics.” In Fortschritte Der Akustik - DAGA 2023, 1409–12, 2023.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 49111

Ebbers, Janek, Reinhold Haeb-Umbach, and Romain Serizel. “Post-Processing Independent Evaluation of Sound Event Detection Systems.” In Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023), 36–40. Tampere, Finland, 2023.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 57098

Seebauer, Fritz, Michael Kuhlmann, Reinhold Häb-Umbach, and Petra Wagner. “DISCERNING DIMENSIONS OF QUALITY FOR STATE OF THE ART SYNTHETIC SPEECH.” In Proceedings of the 20th International Congress of Phonetic Sciences, 2023.

LibreCat

2023 | Conference Paper | LibreCat-ID: 57086

Kuhlmann, Michael, Adrian Meise, Fritz Seebauer, Petra Wagner, and Reinhold Häb-Umbach. “Investigating Speaker Embedding Disentanglement on Natural Read Speech.” In Speech Communication; 15th ITG Conference, 121–125, 2023.

LibreCat

2023 | Conference Paper | LibreCat-ID: 48281 |

Neumann, Thilo von, Christoph Boeddeker, Keisuke Kinoshita, Marc Delcroix, and Reinhold Haeb-Umbach. “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems.” In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023. https://doi.org/10.1109/icassp49357.2023.10094784.

LibreCat | Files available | DOI | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48275 |

Neumann, Thilo von, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems.” In Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 47128 |

Cord-Landwehr, Tobias, Christoph Boeddeker, Cătălin Zorilă, Rama Doddipatla, and Reinhold Haeb-Umbach. “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization.” In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023. https://doi.org/10.1109/icassp49357.2023.10095370.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 47129 |

Cord-Landwehr, Tobias, Christoph Boeddeker, Cătălin Zorilă, Rama Doddipatla, and Reinhold Haeb-Umbach. “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures.” In INTERSPEECH 2023. ISCA, 2023. https://doi.org/10.21437/interspeech.2023-1379.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 54439 |

Boeddeker, Christoph, Tobias Cord-Landwehr, Thilo von Neumann, and Reinhold Haeb-Umbach. “Multi-Stage Diarization Refinement for the CHiME-7 DASR Scenario.” In 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). ISCA, 2023. https://doi.org/10.21437/chime.2023-10.

LibreCat | DOI | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48390 |

Berger, Simon, Peter Vieting, Christoph Boeddeker, Ralf Schlüter, and Reinhold Haeb-Umbach. “Mixture Encoder for Joint Speech Separation and Recognition.” In INTERSPEECH 2023. ISCA, 2023. https://doi.org/10.21437/interspeech.2023-1815.

LibreCat | DOI | Download (ext.)

2022 | Journal Article | LibreCat-ID: 33669 |

Zhang, Wangyou, Xuankai Chang, Christoph Boeddeker, Tomohiro Nakatani, Shinji Watanabe, and Yanmin Qian. “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022. https://doi.org/10.1109/TASLP.2022.3209942.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33471

Heitkämper, Jens, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels.” In Proceedings of the 30th European Signal Processing Conference (EUSIPCO). Belgrad, n.d.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33806

Afifi, Haitham, Holger Karl, Tobias Gburrek, and Joerg Schmalenstroeer. “Data-Driven Time Synchronization in Wireless Multimedia Networks.” In 2022 International Wireless Communications and Mobile Computing (IWCMC). IEEE, 2022. https://doi.org/10.1109/iwcmc55113.2022.9824980.

LibreCat | DOI

2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, Tobias, Thilo von Neumann, Christoph Boeddeker, and Reinhold Haeb-Umbach. “MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator.” In 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33807 |

Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes.” In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022. https://doi.org/10.1109/icassp43922.2022.9746284.

LibreCat | Files available | DOI

2022 | Journal Article | LibreCat-ID: 33451 |

Grimm, Christopher, Tai Fei, Ernst Warsitz, Ridha Farhoud, Tobias Breddermann, and Reinhold Haeb-Umbach. “Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications.” IEEE Transactions on Vehicular Technology 71, no. 9 (2022): 9435–49. https://doi.org/10.1109/TVT.2022.3182411.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33696 |

Wiechmann, Jana, Thomas Glarner, Frederik Rautenberg, Petra Wagner, and Reinhold Haeb-Umbach. “Technically Enabled Explaining of Voice Characteristics.” In 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P), 2022.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33857 |

Kuhlmann, Michael, Fritz Seebauer, Janek Ebbers, Petra Wagner, and Reinhold Haeb-Umbach. “Investigation into Target Speaking Rate Adaptation for Voice Conversion.” In Interspeech 2022. ISCA, 2022. https://doi.org/10.21437/interspeech.2022-10740.

LibreCat | Files available | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33808 |

Gburrek, Tobias, Joerg Schmalenstroeer, Jens Heitkaemper, and Reinhold Haeb-Umbach. “Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription.” In 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE, 2022. https://doi.org/10.1109/IWAENC53105.2022.9914772.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 34072 |

Ebbers, Janek, Reinhold Haeb-Umbach, and Romain Serizel. “Threshold Independent Evaluation of Sound Event Detection Scores.” In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.

LibreCat | Files available

2022 | Report | LibreCat-ID: 49113

Ebbers, Janek, and Reinhold Haeb-Umbach. Pre-Training And Self-Training For Sound Event Detection In Domestic Environments, 2022.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr, Tobias, Christoph Boeddeker, Thilo von Neumann, Catalin Zorila, Rama Doddipatla, and Reinhold Haeb-Umbach. “Monaural Source Separation: From Anechoic to Reverberant Environments.” In 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). Bamberg: IEEE, 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33819 |

Neumann, Thilo von, Keisuke Kinoshita, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data.” In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022. https://doi.org/10.1109/icassp43922.2022.9746757.

LibreCat | Files available | DOI

2022 | Misc | LibreCat-ID: 33816 |

Gburrek, Tobias, Christoph Boeddeker, Thilo von Neumann, Tobias Cord-Landwehr, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022. https://doi.org/10.48550/ARXIV.2205.00944.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker, Christoph, Tobias Cord-Landwehr, Thilo von Neumann, and Reinhold Haeb-Umbach. “An Initialization Scheme for Meeting Separation with Spatial Mixture Models.” In Interspeech 2022. ISCA, 2022. https://doi.org/10.21437/interspeech.2022-10929.

LibreCat | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33958

Kinoshita, Keisuke, Thilo von Neumann, Marc Delcroix, Christoph Boeddeker, and Reinhold Haeb-Umbach. “Utterance-by-Utterance Overlap-Aware Neural Diarization with Graph-PIT.” In Proc. Interspeech 2022, 1486–90. ISCA, 2022. https://doi.org/10.21437/Interspeech.2022-11408.

LibreCat | DOI | Download (ext.)

2021 | Journal Article | LibreCat-ID: 21065 |

Haeb-Umbach, Reinhold, Jahn Heymann, Lukas Drude, Shinji Watanabe, Marc Delcroix, and Tomohiro Nakatani. “Far-Field Automatic Speech Recognition.” Proceedings of the IEEE 109, no. 2 (2021): 124–48. https://doi.org/10.1109/JPROC.2020.3018668.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 28256

Zhang, Wangyou, Christoph Boeddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, and Yanmin Qian. “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend.” In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021. https://doi.org/10.1109/icassp39728.2021.9414464.

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 28262

Li, Chenda, Jing Shi, Wangyou Zhang, Aswin Shanmugam Subramanian, Xuankai Chang, Naoyuki Kamo, Moto Hira, et al. “ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration.” In 2021 IEEE Spoken Language Technology Workshop (SLT), 2021. https://doi.org/10.1109/slt48900.2021.9383615.

LibreCat | DOI

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

333 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

333 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options