Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

305 Publications


2023 | Journal Article | LibreCat-ID: 35602 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 576–589, 2023, doi: 10.1109/taslp.2022.3228629.
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48281 | OA
T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems,” 2023, doi: 10.1109/icassp49357.2023.10094784.
LibreCat | Files available | DOI | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48275 | OA
T. von Neumann, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems,” presented at the CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin, 2023.
LibreCat | Files available | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 49109 | OA
T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks,” presented at the 57th Asilomar Conference on Signals, Systems, and Computers, 2023.
LibreCat | Files available
 

2023 | Conference Paper | LibreCat-ID: 44849 | OA
F. Rautenberg et al., “Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics,” in Fortschritte der Akustik - DAGA 2023, Hamburg, 2023, pp. 1409–1412.
LibreCat | Files available | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 54439 | OA
C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “Multi-stage diarization refinement for the CHiME-7 DASR scenario,” 2023, doi: 10.21437/chime.2023-10.
LibreCat | DOI | Download (ext.)
 

2022 | Conference Paper | LibreCat-ID: 33954 | OA
C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “An Initialization Scheme for Meeting Separation with Spatial Mixture Models,” 2022, doi: 10.21437/interspeech.2022-10929.
LibreCat | DOI | Download (ext.)
 

2022 | Conference Paper | LibreCat-ID: 33471
J. Heitkämper, J. Schmalenstroeer, and R. Haeb-Umbach, “Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels,” presented at the 30th European Signal Processing Conference (EUSIPCO), Belgrad.
LibreCat | Files available
 

2022 | Conference Paper | LibreCat-ID: 33958
K. Kinoshita, T. von Neumann, M. Delcroix, C. Boeddeker, and R. Haeb-Umbach, “Utterance-by-utterance overlap-aware neural diarization with Graph-PIT,” in Proc. Interspeech 2022, 2022, pp. 1486–1490, doi: 10.21437/Interspeech.2022-11408.
LibreCat | DOI
 

2022 | Conference Paper | LibreCat-ID: 33819 | OA
T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data,” 2022, doi: 10.1109/icassp43922.2022.9746757.
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33847 | OA
T. Cord-Landwehr, T. von Neumann, C. Boeddeker, and R. Haeb-Umbach, “MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg, 2022.
LibreCat | Files available | arXiv
 

2022 | Conference Paper | LibreCat-ID: 33848 | OA
T. Cord-Landwehr, C. Boeddeker, T. von Neumann, C. Zorila, R. Doddipatla, and R. Haeb-Umbach, “Monaural source separation: From anechoic to reverberant environments,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
LibreCat | Files available | arXiv
 

2022 | Conference Paper | LibreCat-ID: 33807 | OA
T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes,” 2022, doi: 10.1109/icassp43922.2022.9746284.
LibreCat | Files available | DOI
 

2022 | Journal Article | LibreCat-ID: 33451 | OA
C. Grimm, T. Fei, E. Warsitz, R. Farhoud, T. Breddermann, and R. Haeb-Umbach, “Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications,” IEEE Transactions on Vehicular Technology, vol. 71, no. 9, pp. 9435–9449, 2022, doi: 10.1109/TVT.2022.3182411.
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33696 | OA
J. Wiechmann, T. Glarner, F. Rautenberg, P. Wagner, and R. Haeb-Umbach, “Technically enabled explaining of voice characteristics,” Bielefeld, 2022.
LibreCat | Files available
 

2022 | Conference Paper | LibreCat-ID: 33857 | OA
M. Kuhlmann, F. Seebauer, J. Ebbers, P. Wagner, and R. Haeb-Umbach, “Investigation into Target Speaking Rate Adaptation for Voice Conversion,” 2022, doi: 10.21437/interspeech.2022-10740.
LibreCat | Files available | DOI | Download (ext.)
 

2022 | Conference Paper | LibreCat-ID: 33808 | OA
T. Gburrek, J. Schmalenstroeer, J. Heitkaemper, and R. Haeb-Umbach, “Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription,” presented at the 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022), Bamberg, Germany , 2022, doi: 10.1109/IWAENC53105.2022.9914772.
LibreCat | Files available | DOI
 

2022 | Misc | LibreCat-ID: 33816 | OA
T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer, and R. Haeb-Umbach, A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022.
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 34072 | OA
J. Ebbers, R. Haeb-Umbach, and R. Serizel, “Threshold Independent Evaluation of Sound Event Detection Scores,” 2022.
LibreCat | Files available
 

2021 | Journal Article | LibreCat-ID: 21065 | OA
R. Haeb-Umbach, J. Heymann, L. Drude, S. Watanabe, M. Delcroix, and T. Nakatani, “Far-Field Automatic Speech Recognition,” Proceedings of the IEEE, vol. 109, no. 2, pp. 124–148, 2021.
LibreCat | Files available | DOI
 

Filters and Search Terms

(person=242)

Search

Filter Publications

Display / Sort

Citation Style: IEEE

Export / Embed