Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
331 Publications
2024 | Preprint | LibreCat-ID: 56273 |

The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization
S. Cornell, T. Park, S. Huang, C. Boeddeker, X. Chang, M. Maciejewski, M. Wiesner, P. Garcia, S. Watanabe, ArXiv:2407.16447 (2024).
LibreCat
| Download (ext.)
| arXiv
S. Cornell, T. Park, S. Huang, C. Boeddeker, X. Chang, M. Maciejewski, M. Wiesner, P. Garcia, S. Watanabe, ArXiv:2407.16447 (2024).
2024 | Conference Paper | LibreCat-ID: 57031 |

Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models
T. Gburrek, A. Meise, J. Schmalenstroeer, R. Haeb-Umbach, in: 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2024.
LibreCat
| Files available
| DOI
T. Gburrek, A. Meise, J. Schmalenstroeer, R. Haeb-Umbach, in: 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2024.
2024 | Report | LibreCat-ID: 57161
UPB-NT submission to DCASE24: Dataset pruning for targeted knowledge distillation
A. Werning, R. Haeb-Umbach, UPB-NT Submission to DCASE24: Dataset Pruning for Targeted Knowledge Distillation, 2024.
LibreCat
A. Werning, R. Haeb-Umbach, UPB-NT Submission to DCASE24: Dataset Pruning for Targeted Knowledge Distillation, 2024.
2024 | Conference Paper | LibreCat-ID: 57160
Target-Specific Dataset Pruning for Compression of Audio Tagging Models
A. Werning, R. Haeb-Umbach, in: 32nd European Signal Processing Conference (EUSIPCO 2024), 2024.
LibreCat
| Files available
A. Werning, R. Haeb-Umbach, in: 32nd European Signal Processing Conference (EUSIPCO 2024), 2024.
2024 | Conference Paper | LibreCat-ID: 57099
Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder
Y. Xie, M. Kuhlmann, F. Rautenberg, Z.-H. Tan, R. Häb-Umbach, in: 2024 32nd European Signal Processing Conference (EUSIPCO), 2024, pp. 436–440.
LibreCat
Y. Xie, M. Kuhlmann, F. Rautenberg, Z.-H. Tan, R. Häb-Umbach, in: 2024 32nd European Signal Processing Conference (EUSIPCO), 2024, pp. 436–440.
2024 | Conference Paper | LibreCat-ID: 56004 |

Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
T. von Neumann, C. Boeddeker, T. Cord-Landwehr, M. Delcroix, R. Haeb-Umbach, in: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), IEEE, 2024.
LibreCat
| Files available
| DOI
T. von Neumann, C. Boeddeker, T. Cord-Landwehr, M. Delcroix, R. Haeb-Umbach, in: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), IEEE, 2024.
2024 | Journal Article | LibreCat-ID: 52958 |

TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
C. Boeddeker, A.S. Subramanian, G. Wichern, R. Haeb-Umbach, J. Le Roux, IEEE/ACM Transactions on Audio, Speech, and Language Processing 32 (2024) 1185–1197.
LibreCat
| DOI
| Download (ext.)
C. Boeddeker, A.S. Subramanian, G. Wichern, R. Haeb-Umbach, J. Le Roux, IEEE/ACM Transactions on Audio, Speech, and Language Processing 32 (2024) 1185–1197.
2024 | Conference Paper | LibreCat-ID: 53659
Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, R. Haeb-Umbach, in: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2024.
LibreCat
| DOI
T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, R. Haeb-Umbach, in: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2024.
2024 | Preprint | LibreCat-ID: 57085 |

Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models
T. Cord-Landwehr, C. Boeddeker, R. Haeb-Umbach, (2024).
LibreCat
| Download (ext.)
T. Cord-Landwehr, C. Boeddeker, R. Haeb-Umbach, (2024).
2024 | Conference Paper | LibreCat-ID: 56272 |

Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
C. Boeddeker, T. Cord-Landwehr, R. Haeb-Umbach, in: Interspeech 2024, ISCA, 2024.
LibreCat
| DOI
| Download (ext.)
C. Boeddeker, T. Cord-Landwehr, R. Haeb-Umbach, in: Interspeech 2024, ISCA, 2024.
2024 | Conference Paper | LibreCat-ID: 57659 |

Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription
P. Vieting, S. Berger, T. von Neumann, C. Boeddeker, R. Schlüter, R. Haeb-Umbach, in: 2024 IEEE Spoken Language Technology Workshop (SLT), 2024.
LibreCat
| Download (ext.)
P. Vieting, S. Berger, T. von Neumann, C. Boeddeker, R. Schlüter, R. Haeb-Umbach, in: 2024 IEEE Spoken Language Technology Workshop (SLT), 2024.
2023 | Conference Paper | LibreCat-ID: 48269 |

On the Integration of Sampling Rate Synchronization and Acoustic Beamforming
T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: European Signal Processing Conference (EUSIPCO), 2023.
LibreCat
| Download (ext.)
T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: European Signal Processing Conference (EUSIPCO), 2023.
2023 | Conference Paper | LibreCat-ID: 48270 |

LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices
J. Schmalenstroeer, T. Gburrek, R. Haeb-Umbach, in: ITG Conference on Speech Communication, 2023.
LibreCat
| Files available
J. Schmalenstroeer, T. Gburrek, R. Haeb-Umbach, in: ITG Conference on Speech Communication, 2023.
2023 | Conference Paper | LibreCat-ID: 48355 |

On Feature Importance and Interpretability of Speaker Representations
F. Rautenberg, M. Kuhlmann, J. Wiechmann, F. Seebauer, P. Wagner, R. Haeb-Umbach, in: ITG Conference on Speech Communication, 2023.
LibreCat
| Files available
| Download (ext.)
| arXiv
F. Rautenberg, M. Kuhlmann, J. Wiechmann, F. Seebauer, P. Wagner, R. Haeb-Umbach, in: ITG Conference on Speech Communication, 2023.
2023 | Conference Paper | LibreCat-ID: 48410 |

Explaining voice characteristics to novice voice practitioners-How successful is it?
J. Wiechmann, F. Rautenberg, P. Wagner, R. Haeb-Umbach, in: 20th International Congress of the Phonetic Sciences (ICPhS) , 2023.
LibreCat
| Files available
| Download (ext.)
J. Wiechmann, F. Rautenberg, P. Wagner, R. Haeb-Umbach, in: 20th International Congress of the Phonetic Sciences (ICPhS) , 2023.
2023 | Conference Paper | LibreCat-ID: 48391
Reverberation as Supervision For Speech Separation
R. Aralikatti, C. Boeddeker, G. Wichern, A. Subramanian, J. Le Roux, in: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023.
LibreCat
| DOI
R. Aralikatti, C. Boeddeker, G. Wichern, A. Subramanian, J. Le Roux, in: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023.
2023 | Conference Paper | LibreCat-ID: 46069
Re-examining the quality dimensions of synthetic speech
F. Seebauer, M. Kuhlmann, R. Haeb-Umbach, P. Wagner, in: 12th Speech Synthesis Workshop (SSW) 2023, 2023.
LibreCat
F. Seebauer, M. Kuhlmann, R. Haeb-Umbach, P. Wagner, in: 12th Speech Synthesis Workshop (SSW) 2023, 2023.
2023 | Journal Article | LibreCat-ID: 35602 |

Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, R. Haeb-Umbach, IEEE/ACM Transactions on Audio, Speech, and Language Processing 31 (2023) 576–589.
LibreCat
| Files available
| DOI
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, R. Haeb-Umbach, IEEE/ACM Transactions on Audio, Speech, and Language Processing 31 (2023) 576–589.
2023 | Conference Paper | LibreCat-ID: 49109 |

Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks
T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: Proc. Asilomar Conference on Signals, Systems, and Computers, 2023.
LibreCat
| Files available
T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: Proc. Asilomar Conference on Signals, Systems, and Computers, 2023.
2023 | Conference Paper | LibreCat-ID: 44849 |

Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics
F. Rautenberg, M. Kuhlmann, J. Ebbers, J. Wiechmann, F. Seebauer, P. Wagner, R. Haeb-Umbach, in: Fortschritte Der Akustik - DAGA 2023, 2023, pp. 1409–1412.
LibreCat
| Files available
| Download (ext.)
F. Rautenberg, M. Kuhlmann, J. Ebbers, J. Wiechmann, F. Seebauer, P. Wagner, R. Haeb-Umbach, in: Fortschritte Der Akustik - DAGA 2023, 2023, pp. 1409–1412.