LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

333 Publications

2025 | Conference Paper | LibreCat-ID: 59900

Werning A, Häb-Umbach R. Distilling Efficient Audio Models using Data Pruning with CLAP. In: Deutsche Gesellschaft für Akustik e.V. (DEGA), Berlin, 2025, ed. Proceedings of DAS|DAGA 2025. ; 2025. doi:10.71568/DASDAGA2025.149

LibreCat | DOI

2025 | Conference Paper | LibreCat-ID: 59999

Rautenberg F, Kuhlmann M, Seebauer F, Wiechmann J, Wagner P, Haeb-Umbach R. Speech Synthesis along Perceptual Voice Quality Dimensions. In: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2025. doi:10.1109/icassp49660.2025.10888012

LibreCat | DOI

2024 | Preprint | LibreCat-ID: 56273 |

Cornell S, Park T, Huang S, et al. The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization. arXiv:240716447. Published online 2024.

LibreCat | Download (ext.) | arXiv

2024 | Conference Paper | LibreCat-ID: 57031 |

Gburrek T, Meise A, Schmalenstroeer J, Haeb-Umbach R. Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models. In: 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2024. doi:10.1109/iwaenc61483.2024.10694103

LibreCat | Files available | DOI

2024 | Journal Article | LibreCat-ID: 52958 |

Boeddeker C, Subramanian AS, Wichern G, Haeb-Umbach R, Le Roux J. TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2024;32:1185-1197. doi:10.1109/taslp.2024.3350887

LibreCat | Files available | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57085 |

Cord-Landwehr T, Boeddeker C, Haeb-Umbach R. Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models. In: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2024. doi:10.1109/ICASSP49660.2025.10888445

LibreCat | DOI | Download (ext.)

2024 | Report | LibreCat-ID: 57161

Werning A, Haeb-Umbach R. UPB-NT Submission to DCASE24: Dataset Pruning for Targeted Knowledge Distillation.; 2024.

LibreCat

2024 | Conference Paper | LibreCat-ID: 57160

Werning A, Haeb-Umbach R. Target-Specific Dataset Pruning for Compression of Audio Tagging Models. In: 32nd European Signal Processing Conference (EUSIPCO 2024). ; 2024.

LibreCat | Files available

2024 | Conference Paper | LibreCat-ID: 57099

Xie Y, Kuhlmann M, Rautenberg F, Tan Z-H, Häb-Umbach R. Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder. In: 2024 32nd European Signal Processing Conference (EUSIPCO). ; 2024:436–440.

LibreCat

2024 | Conference Paper | LibreCat-ID: 56004 |

von Neumann T, Boeddeker C, Cord-Landwehr T, Delcroix M, Haeb-Umbach R. Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. In: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). IEEE; 2024. doi:10.1109/icasspw62465.2024.10625894

LibreCat | Files available | DOI

2024 | Conference Paper | LibreCat-ID: 53659

Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2024. doi:10.1109/icassp48485.2024.10445911

LibreCat | DOI

2024 | Conference Paper | LibreCat-ID: 56272 |

Boeddeker C, Cord-Landwehr T, Haeb-Umbach R. Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment. In: Interspeech 2024. ISCA; 2024. doi:10.21437/interspeech.2024-1286

LibreCat | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting P, Berger S, von Neumann T, Boeddeker C, Schlüter R, Haeb-Umbach R. Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription. In: 2024 IEEE Spoken Language Technology Workshop (SLT). ; 2024.

LibreCat | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48269 |

Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On the Integration of Sampling Rate Synchronization and Acoustic Beamforming. In: European Signal Processing Conference (EUSIPCO). ; 2023.

LibreCat | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48270 |

Schmalenstroeer J, Gburrek T, Haeb-Umbach R. LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices. In: ITG Conference on Speech Communication. ; 2023.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 48355 |

Rautenberg F, Kuhlmann M, Wiechmann J, Seebauer F, Wagner P, Haeb-Umbach R. On Feature Importance and Interpretability of Speaker Representations. In: ITG Conference on Speech Communication. ; 2023.

LibreCat | Files available | Download (ext.) | arXiv

2023 | Conference Paper | LibreCat-ID: 48410 |

Wiechmann J, Rautenberg F, Wagner P, Haeb-Umbach R. Explaining voice characteristics to novice voice practitioners-How successful is it? In: 20th International Congress of the Phonetic Sciences (ICPhS) . ; 2023.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48391

Aralikatti R, Boeddeker C, Wichern G, Subramanian A, Le Roux J. Reverberation as Supervision For Speech Separation. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10095022

LibreCat | DOI

2023 | Conference Paper | LibreCat-ID: 46069

Seebauer F, Kuhlmann M, Haeb-Umbach R, Wagner P. Re-examining the quality dimensions of synthetic speech. In: 12th Speech Synthesis Workshop (SSW) 2023. ; 2023.

LibreCat

2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2023;31:576-589. doi:10.1109/taslp.2022.3228629

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 49109 |

Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks. In: Proc. Asilomar Conference on Signals, Systems, and Computers. ; 2023.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 44849 |

Rautenberg F, Kuhlmann M, Ebbers J, et al. Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics. In: Fortschritte Der Akustik - DAGA 2023. ; 2023:1409-1412.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 49111

Ebbers J, Haeb-Umbach R, Serizel R. Post-Processing Independent Evaluation of Sound Event Detection Systems. In: Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023). ; 2023:36–40.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 57098

Seebauer F, Kuhlmann M, Häb-Umbach R, Wagner P. DISCERNING DIMENSIONS OF QUALITY FOR STATE OF THE ART SYNTHETIC SPEECH. In: Proceedings of the 20th International Congress of Phonetic Sciences. ; 2023.

LibreCat

2023 | Conference Paper | LibreCat-ID: 57086

Kuhlmann M, Meise A, Seebauer F, Wagner P, Häb-Umbach R. Investigating Speaker Embedding Disentanglement on Natural Read Speech. In: Speech Communication; 15th ITG Conference. ; 2023:121–125.

LibreCat

2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10094784

LibreCat | Files available | DOI | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann T, Boeddeker C, Delcroix M, Haeb-Umbach R. MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. In: Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. ; 2023.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 47128 |

Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10095370

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 47129 |

Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures. In: INTERSPEECH 2023. ISCA; 2023. doi:10.21437/interspeech.2023-1379

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 54439 |

Boeddeker C, Cord-Landwehr T, von Neumann T, Haeb-Umbach R. Multi-stage diarization refinement for the CHiME-7 DASR scenario. In: 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). ISCA; 2023. doi:10.21437/chime.2023-10

LibreCat | DOI | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48390 |

Berger S, Vieting P, Boeddeker C, Schlüter R, Haeb-Umbach R. Mixture Encoder for Joint Speech Separation and Recognition. In: INTERSPEECH 2023. ISCA; 2023. doi:10.21437/interspeech.2023-1815

LibreCat | DOI | Download (ext.)

2022 | Journal Article | LibreCat-ID: 33669 |

Zhang W, Chang X, Boeddeker C, Nakatani T, Watanabe S, Qian Y. End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party. IEEE/ACM Transactions on Audio, Speech, and Language Processing. Published online 2022. doi:10.1109/TASLP.2022.3209942

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33471

Heitkämper J, Schmalenstroeer J, Haeb-Umbach R. Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels. In: Proceedings of the 30th European Signal Processing Conference (EUSIPCO).

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33806

Afifi H, Karl H, Gburrek T, Schmalenstroeer J. Data-driven Time Synchronization in Wireless Multimedia Networks. In: 2022 International Wireless Communications and Mobile Computing (IWCMC). IEEE; 2022. doi:10.1109/iwcmc55113.2022.9824980

LibreCat | DOI

2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr T, von Neumann T, Boeddeker C, Haeb-Umbach R. MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). ; 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33807 |

Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes. In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746284

LibreCat | Files available | DOI

2022 | Journal Article | LibreCat-ID: 33451 |

Grimm C, Fei T, Warsitz E, Farhoud R, Breddermann T, Haeb-Umbach R. Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications. IEEE Transactions on Vehicular Technology. 2022;71(9):9435-9449. doi:10.1109/TVT.2022.3182411

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33696 |

Wiechmann J, Glarner T, Rautenberg F, Wagner P, Haeb-Umbach R. Technically enabled explaining of voice characteristics. In: 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P). ; 2022.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33857 |

Kuhlmann M, Seebauer F, Ebbers J, Wagner P, Haeb-Umbach R. Investigation into Target Speaking Rate Adaptation for Voice Conversion. In: Interspeech 2022. ISCA; 2022. doi:10.21437/interspeech.2022-10740

LibreCat | Files available | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33808 |

Gburrek T, Schmalenstroeer J, Heitkaemper J, Haeb-Umbach R. Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022. doi:10.1109/IWAENC53105.2022.9914772

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 34072 |

Ebbers J, Haeb-Umbach R, Serizel R. Threshold Independent Evaluation of Sound Event Detection Scores. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2022.

LibreCat | Files available

2022 | Report | LibreCat-ID: 49113

Ebbers J, Haeb-Umbach R. Pre-Training And Self-Training For Sound Event Detection In Domestic Environments.; 2022.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr T, Boeddeker C, von Neumann T, Zorila C, Doddipatla R, Haeb-Umbach R. Monaural source separation: From anechoic to reverberant environments. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746757

LibreCat | Files available | DOI

2022 | Misc | LibreCat-ID: 33816 |

Gburrek T, Boeddeker C, von Neumann T, Cord-Landwehr T, Schmalenstroeer J, Haeb-Umbach R. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv; 2022. doi:10.48550/ARXIV.2205.00944

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker C, Cord-Landwehr T, von Neumann T, Haeb-Umbach R. An Initialization Scheme for Meeting Separation with Spatial Mixture Models. In: Interspeech 2022. ISCA; 2022. doi:10.21437/interspeech.2022-10929

LibreCat | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33958

Kinoshita K, von Neumann T, Delcroix M, Boeddeker C, Haeb-Umbach R. Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. In: Proc. Interspeech 2022. ISCA; 2022:1486-1490. doi:10.21437/Interspeech.2022-11408

LibreCat | DOI | Download (ext.)

2021 | Journal Article | LibreCat-ID: 21065 |

Haeb-Umbach R, Heymann J, Drude L, Watanabe S, Delcroix M, Nakatani T. Far-Field Automatic Speech Recognition. Proceedings of the IEEE. 2021;109(2):124-148. doi:10.1109/JPROC.2020.3018668

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 28256

Zhang W, Boeddeker C, Watanabe S, et al. End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9414464

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 28262

Li C, Shi J, Zhang W, et al. ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration. In: 2021 IEEE Spoken Language Technology Workshop (SLT). ; 2021. doi:10.1109/slt48900.2021.9383615

LibreCat | DOI

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

333 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

333 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options