Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
331 Publications
2023 | Conference Paper | LibreCat-ID: 49111
J. Ebbers, R. Haeb-Umbach, and R. Serizel, “Post-Processing Independent Evaluation of Sound Event Detection Systems,” in Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023), 2023, pp. 36–40.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 57098
F. Seebauer, M. Kuhlmann, R. Häb-Umbach, and P. Wagner, “DISCERNING DIMENSIONS OF QUALITY FOR STATE OF THE ART SYNTHETIC SPEECH,” presented at the International Congress of Phonetic Sciences (ICPhS), Prague, 2023.
LibreCat
2023 | Conference Paper | LibreCat-ID: 57086
M. Kuhlmann, A. Meise, F. Seebauer, P. Wagner, and R. Häb-Umbach, “Investigating Speaker Embedding Disentanglement on Natural Read Speech,” in Speech Communication; 15th ITG Conference, 2023, pp. 121–125.
LibreCat
2023 | Conference Paper | LibreCat-ID: 48281 |

T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems,” 2023, doi: 10.1109/icassp49357.2023.10094784.
LibreCat
| Files available
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48275 |

T. von Neumann, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems,” presented at the CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin, 2023.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 47128 |

T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization,” presented at the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, 2023, doi: 10.1109/icassp49357.2023.10095370.
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 47129 |

T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures,” 2023, doi: 10.21437/interspeech.2023-1379.
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 54439 |

C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “Multi-stage diarization refinement for the CHiME-7 DASR scenario,” 2023, doi: 10.21437/chime.2023-10.
LibreCat
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48390 |

S. Berger, P. Vieting, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Mixture Encoder for Joint Speech Separation and Recognition,” 2023, doi: 10.21437/interspeech.2023-1815.
LibreCat
| DOI
| Download (ext.)
2022 | Journal Article | LibreCat-ID: 33669 |

W. Zhang, X. Chang, C. Boeddeker, T. Nakatani, S. Watanabe, and Y. Qian, “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi: 10.1109/TASLP.2022.3209942.
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33471
J. Heitkämper, J. Schmalenstroeer, and R. Haeb-Umbach, “Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels,” presented at the 30th European Signal Processing Conference (EUSIPCO), Belgrad.
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33806
H. Afifi, H. Karl, T. Gburrek, and J. Schmalenstroeer, “Data-driven Time Synchronization in Wireless Multimedia Networks,” 2022, doi: 10.1109/iwcmc55113.2022.9824980.
LibreCat
| DOI
2022 | Conference Paper | LibreCat-ID: 33847 |

T. Cord-Landwehr, T. von Neumann, C. Boeddeker, and R. Haeb-Umbach, “MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg, 2022.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33807 |

T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes,” 2022, doi: 10.1109/icassp43922.2022.9746284.
LibreCat
| Files available
| DOI
2022 | Journal Article | LibreCat-ID: 33451 |

C. Grimm, T. Fei, E. Warsitz, R. Farhoud, T. Breddermann, and R. Haeb-Umbach, “Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications,” IEEE Transactions on Vehicular Technology, vol. 71, no. 9, pp. 9435–9449, 2022, doi: 10.1109/TVT.2022.3182411.
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33696 |

J. Wiechmann, T. Glarner, F. Rautenberg, P. Wagner, and R. Haeb-Umbach, “Technically enabled explaining of voice characteristics,” Bielefeld, 2022.
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33857 |

M. Kuhlmann, F. Seebauer, J. Ebbers, P. Wagner, and R. Haeb-Umbach, “Investigation into Target Speaking Rate Adaptation for Voice Conversion,” 2022, doi: 10.21437/interspeech.2022-10740.
LibreCat
| Files available
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33808 |

T. Gburrek, J. Schmalenstroeer, J. Heitkaemper, and R. Haeb-Umbach, “Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription,” presented at the 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022), Bamberg, Germany , 2022, doi: 10.1109/IWAENC53105.2022.9914772.
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 34072 |

J. Ebbers, R. Haeb-Umbach, and R. Serizel, “Threshold Independent Evaluation of Sound Event Detection Scores,” 2022.
LibreCat
| Files available
2022 | Report | LibreCat-ID: 49113
J. Ebbers and R. Haeb-Umbach, Pre-Training And Self-Training For Sound Event Detection In Domestic Environments. 2022.
LibreCat
| Files available