Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
331 Publications
2024 | Preprint | LibreCat-ID: 56273 |

S. Cornell et al., “The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization,” arXiv:2407.16447. 2024.
LibreCat
| Download (ext.)
| arXiv
2024 | Conference Paper | LibreCat-ID: 57031 |

T. Gburrek, A. Meise, J. Schmalenstroeer, and R. Haeb-Umbach, “Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models,” 2024, doi: 10.1109/iwaenc61483.2024.10694103.
LibreCat
| Files available
| DOI
2024 | Report | LibreCat-ID: 57161
A. Werning and R. Haeb-Umbach, UPB-NT submission to DCASE24: Dataset pruning for targeted knowledge distillation. 2024.
LibreCat
2024 | Conference Paper | LibreCat-ID: 57160
A. Werning and R. Haeb-Umbach, “Target-Specific Dataset Pruning for Compression of Audio Tagging Models,” presented at the 32nd European Signal Processing Conference, Lyon, 2024.
LibreCat
| Files available
2024 | Conference Paper | LibreCat-ID: 57099
Y. Xie, M. Kuhlmann, F. Rautenberg, Z.-H. Tan, and R. Häb-Umbach, “Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder,” in 2024 32nd European Signal Processing Conference (EUSIPCO), 2024, pp. 436–440.
LibreCat
2024 | Conference Paper | LibreCat-ID: 56004 |

T. von Neumann, C. Boeddeker, T. Cord-Landwehr, M. Delcroix, and R. Haeb-Umbach, “Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization,” 2024, doi: 10.1109/icasspw62465.2024.10625894.
LibreCat
| Files available
| DOI
2024 | Journal Article | LibreCat-ID: 52958 |

C. Boeddeker, A. S. Subramanian, G. Wichern, R. Haeb-Umbach, and J. Le Roux, “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1185–1197, 2024, doi: 10.1109/taslp.2024.3350887.
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 53659
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios,” presented at the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, 2024, doi: 10.1109/icassp48485.2024.10445911.
LibreCat
| DOI
2024 | Preprint | LibreCat-ID: 57085 |

T. Cord-Landwehr, C. Boeddeker, and R. Haeb-Umbach, “Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models.” 2024.
LibreCat
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 56272 |

C. Boeddeker, T. Cord-Landwehr, and R. Haeb-Umbach, “Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment,” 2024, doi: 10.21437/interspeech.2024-1286.
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 57659 |

P. Vieting, S. Berger, T. von Neumann, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription,” 2024.
LibreCat
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48269 |

T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On the Integration of Sampling Rate Synchronization and Acoustic Beamforming,” presented at the European Signal Processing Conference (EUSIPCO), Helsinki, 2023.
LibreCat
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48270 |

J. Schmalenstroeer, T. Gburrek, and R. Haeb-Umbach, “LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices,” presented at the ITG Conference on Speech Communication, Aachen, 2023.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 48355 |

F. Rautenberg, M. Kuhlmann, J. Wiechmann, F. Seebauer, P. Wagner, and R. Haeb-Umbach, “On Feature Importance and Interpretability of Speaker Representations,” presented at the ITG Conference on Speech Communication, Aachen, 2023.
LibreCat
| Files available
| Download (ext.)
| arXiv
2023 | Conference Paper | LibreCat-ID: 48410 |

J. Wiechmann, F. Rautenberg, P. Wagner, and R. Haeb-Umbach, “Explaining voice characteristics to novice voice practitioners-How successful is it?,” 2023.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48391
R. Aralikatti, C. Boeddeker, G. Wichern, A. Subramanian, and J. Le Roux, “Reverberation as Supervision For Speech Separation,” 2023, doi: 10.1109/icassp49357.2023.10095022.
LibreCat
| DOI
2023 | Conference Paper | LibreCat-ID: 46069
F. Seebauer, M. Kuhlmann, R. Haeb-Umbach, and P. Wagner, “Re-examining the quality dimensions of synthetic speech,” 2023.
LibreCat
2023 | Journal Article | LibreCat-ID: 35602 |

T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 576–589, 2023, doi: 10.1109/taslp.2022.3228629.
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 49109 |

T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks,” presented at the 57th Asilomar Conference on Signals, Systems, and Computers, 2023.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 44849 |

F. Rautenberg et al., “Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics,” in Fortschritte der Akustik - DAGA 2023, Hamburg, 2023, pp. 1409–1412.
LibreCat
| Files available
| Download (ext.)