Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
333 Publications
2023 | Conference Paper | LibreCat-ID: 48390 |

Berger S, Vieting P, Boeddeker C, Schlüter R, Haeb-Umbach R. Mixture Encoder for Joint Speech Separation and Recognition. In: INTERSPEECH 2023. ISCA; 2023. doi:10.21437/interspeech.2023-1815
LibreCat
| DOI
| Download (ext.)
2022 | Journal Article | LibreCat-ID: 33669 |

Zhang W, Chang X, Boeddeker C, Nakatani T, Watanabe S, Qian Y. End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party. IEEE/ACM Transactions on Audio, Speech, and Language Processing. Published online 2022. doi:10.1109/TASLP.2022.3209942
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33471
Heitkämper J, Schmalenstroeer J, Haeb-Umbach R. Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels. In: Proceedings of the 30th European Signal Processing Conference (EUSIPCO).
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33806
Afifi H, Karl H, Gburrek T, Schmalenstroeer J. Data-driven Time Synchronization in Wireless Multimedia Networks. In: 2022 International Wireless Communications and Mobile Computing (IWCMC). IEEE; 2022. doi:10.1109/iwcmc55113.2022.9824980
LibreCat
| DOI
2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr T, von Neumann T, Boeddeker C, Haeb-Umbach R. MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). ; 2022.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33807 |

Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes. In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746284
LibreCat
| Files available
| DOI
2022 | Journal Article | LibreCat-ID: 33451 |

Grimm C, Fei T, Warsitz E, Farhoud R, Breddermann T, Haeb-Umbach R. Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications. IEEE Transactions on Vehicular Technology. 2022;71(9):9435-9449. doi:10.1109/TVT.2022.3182411
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33696 |

Wiechmann J, Glarner T, Rautenberg F, Wagner P, Haeb-Umbach R. Technically enabled explaining of voice characteristics. In: 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P). ; 2022.
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33857 |

Kuhlmann M, Seebauer F, Ebbers J, Wagner P, Haeb-Umbach R. Investigation into Target Speaking Rate Adaptation for Voice Conversion. In: Interspeech 2022. ISCA; 2022. doi:10.21437/interspeech.2022-10740
LibreCat
| Files available
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33808 |

Gburrek T, Schmalenstroeer J, Heitkaemper J, Haeb-Umbach R. Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022. doi:10.1109/IWAENC53105.2022.9914772
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 34072 |

Ebbers J, Haeb-Umbach R, Serizel R. Threshold Independent Evaluation of Sound Event Detection Scores. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2022.
LibreCat
| Files available
2022 | Report | LibreCat-ID: 49113
Ebbers J, Haeb-Umbach R. Pre-Training And Self-Training For Sound Event Detection In Domestic Environments.; 2022.
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr T, Boeddeker C, von Neumann T, Zorila C, Doddipatla R, Haeb-Umbach R. Monaural source separation: From anechoic to reverberant environments. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746757
LibreCat
| Files available
| DOI
2022 | Misc | LibreCat-ID: 33816 |

Gburrek T, Boeddeker C, von Neumann T, Cord-Landwehr T, Schmalenstroeer J, Haeb-Umbach R. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv; 2022. doi:10.48550/ARXIV.2205.00944
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker C, Cord-Landwehr T, von Neumann T, Haeb-Umbach R. An Initialization Scheme for Meeting Separation with Spatial Mixture Models. In: Interspeech 2022. ISCA; 2022. doi:10.21437/interspeech.2022-10929
LibreCat
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita K, von Neumann T, Delcroix M, Boeddeker C, Haeb-Umbach R. Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. In: Proc. Interspeech 2022. ISCA; 2022:1486-1490. doi:10.21437/Interspeech.2022-11408
LibreCat
| DOI
| Download (ext.)
2021 | Journal Article | LibreCat-ID: 21065 |

Haeb-Umbach R, Heymann J, Drude L, Watanabe S, Delcroix M, Nakatani T. Far-Field Automatic Speech Recognition. Proceedings of the IEEE. 2021;109(2):124-148. doi:10.1109/JPROC.2020.3018668
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 28256
Zhang W, Boeddeker C, Watanabe S, et al. End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9414464
LibreCat
| DOI
2021 | Conference Paper | LibreCat-ID: 28262
Li C, Shi J, Zhang W, et al. ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration. In: 2021 IEEE Spoken Language Technology Workshop (SLT). ; 2021. doi:10.1109/slt48900.2021.9383615
LibreCat
| DOI