Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

330 Publications


2023 | Conference Paper | LibreCat-ID: 48275 | OA
MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems
T. von Neumann, C. Boeddeker, M. Delcroix, R. Haeb-Umbach, in: Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.
LibreCat | Files available | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 49109 | OA
Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks
T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: Proc. Asilomar Conference on Signals, Systems, and Computers, 2023.
LibreCat | Files available
 

2023 | Conference Paper | LibreCat-ID: 44849 | OA
Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics
F. Rautenberg, M. Kuhlmann, J. Ebbers, J. Wiechmann, F. Seebauer, P. Wagner, R. Haeb-Umbach, in: Fortschritte Der Akustik - DAGA 2023, 2023, pp. 1409–1412.
LibreCat | Files available | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 54439 | OA
Multi-stage diarization refinement for the CHiME-7 DASR scenario
C. Boeddeker, T. Cord-Landwehr, T. von Neumann, R. Haeb-Umbach, in: 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023), ISCA, 2023.
LibreCat | DOI | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 49111
Post-Processing Independent Evaluation of Sound Event Detection Systems
J. Ebbers, R. Haeb-Umbach, R. Serizel, in: Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023), Tampere, Finland, 2023, pp. 36–40.
LibreCat | Files available
 

2023 | Conference Paper | LibreCat-ID: 48281 | OA
On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, R. Haeb-Umbach, in: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023.
LibreCat | Files available | DOI | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 57098
DISCERNING DIMENSIONS OF QUALITY FOR STATE OF THE ART SYNTHETIC SPEECH
F. Seebauer, M. Kuhlmann, R. Häb-Umbach, P. Wagner, in: Proceedings of the 20th International Congress of Phonetic Sciences, 2023.
LibreCat
 

2023 | Conference Paper | LibreCat-ID: 57086
Investigating Speaker Embedding Disentanglement on Natural Read Speech
M. Kuhlmann, A. Meise, F. Seebauer, P. Wagner, R. Häb-Umbach, in: Speech Communication; 15th ITG Conference, 2023, pp. 121–125.
LibreCat
 

2022 | Journal Article | LibreCat-ID: 33669 | OA
End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party
W. Zhang, X. Chang, C. Boeddeker, T. Nakatani, S. Watanabe, Y. Qian, IEEE/ACM Transactions on Audio, Speech, and Language Processing (2022).
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33954 | OA
An Initialization Scheme for Meeting Separation with Spatial Mixture Models
C. Boeddeker, T. Cord-Landwehr, T. von Neumann, R. Haeb-Umbach, in: Interspeech 2022, ISCA, 2022.
LibreCat | DOI | Download (ext.)
 

2022 | Conference Paper | LibreCat-ID: 33471
Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels
J. Heitkämper, J. Schmalenstroeer, R. Haeb-Umbach, in: Proceedings of the 30th European Signal Processing Conference (EUSIPCO), Belgrad, n.d.
LibreCat | Files available
 

2022 | Conference Paper | LibreCat-ID: 33806
Data-driven Time Synchronization in Wireless Multimedia Networks
H. Afifi, H. Karl, T. Gburrek, J. Schmalenstroeer, in: 2022 International Wireless Communications and Mobile Computing (IWCMC), IEEE, 2022.
LibreCat | DOI
 

2022 | Conference Paper | LibreCat-ID: 33958
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT
K. Kinoshita, T. von Neumann, M. Delcroix, C. Boeddeker, R. Haeb-Umbach, in: Proc. Interspeech 2022, ISCA, 2022, pp. 1486–1490.
LibreCat | DOI
 

2022 | Conference Paper | LibreCat-ID: 33819 | OA
SA-SDR: A Novel Loss Function for Separation of Meeting Style Data
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, R. Haeb-Umbach, in: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2022.
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33847 | OA
MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator
T. Cord-Landwehr, T. von Neumann, C. Boeddeker, R. Haeb-Umbach, in: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
LibreCat | Files available | arXiv
 

2022 | Conference Paper | LibreCat-ID: 33848 | OA
Monaural source separation: From anechoic to reverberant environments
T. Cord-Landwehr, C. Boeddeker, T. von Neumann, C. Zorila, R. Doddipatla, R. Haeb-Umbach, in: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, Bamberg, 2022.
LibreCat | Files available | arXiv
 

2022 | Conference Paper | LibreCat-ID: 33807 | OA
On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes
T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2022.
LibreCat | Files available | DOI
 

2022 | Journal Article | LibreCat-ID: 33451 | OA
Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications
C. Grimm, T. Fei, E. Warsitz, R. Farhoud, T. Breddermann, R. Haeb-Umbach, IEEE Transactions on Vehicular Technology 71 (2022) 9435–9449.
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33696 | OA
Technically enabled explaining of voice characteristics
J. Wiechmann, T. Glarner, F. Rautenberg, P. Wagner, R. Haeb-Umbach, in: 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P), 2022.
LibreCat | Files available
 

2022 | Conference Paper | LibreCat-ID: 33857 | OA
Investigation into Target Speaking Rate Adaptation for Voice Conversion
M. Kuhlmann, F. Seebauer, J. Ebbers, P. Wagner, R. Haeb-Umbach, in: Interspeech 2022, ISCA, 2022.
LibreCat | Files available | DOI | Download (ext.)
 

Filters and Search Terms

department=54

Search

Filter Publications

Display / Sort

Export / Embed