Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
333 Publications
2023 | Conference Paper | LibreCat-ID: 48390 |

Berger, S., Vieting, P., Boeddeker, C., Schlüter, R., & Haeb-Umbach, R. (2023). Mixture Encoder for Joint Speech Separation and Recognition. INTERSPEECH 2023. https://doi.org/10.21437/interspeech.2023-1815
LibreCat
| DOI
| Download (ext.)
2022 | Journal Article | LibreCat-ID: 33669 |

Zhang, W., Chang, X., Boeddeker, C., Nakatani, T., Watanabe, S., & Qian, Y. (2022). End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party. IEEE/ACM Transactions on Audio, Speech, and Language Processing. https://doi.org/10.1109/TASLP.2022.3209942
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33471
Heitkämper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (n.d.). Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels. Proceedings of the 30th European Signal Processing Conference (EUSIPCO). 30th European Signal Processing Conference (EUSIPCO), Belgrad.
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33806
Afifi, H., Karl, H., Gburrek, T., & Schmalenstroeer, J. (2022). Data-driven Time Synchronization in Wireless Multimedia Networks. 2022 International Wireless Communications and Mobile Computing (IWCMC). https://doi.org/10.1109/iwcmc55113.2022.9824980
LibreCat
| DOI
2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, T., von Neumann, T., Boeddeker, C., & Haeb-Umbach, R. (2022). MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33807 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746284
LibreCat
| Files available
| DOI
2022 | Journal Article | LibreCat-ID: 33451 |

Grimm, C., Fei, T., Warsitz, E., Farhoud, R., Breddermann, T., & Haeb-Umbach, R. (2022). Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications. IEEE Transactions on Vehicular Technology, 71(9), 9435–9449. https://doi.org/10.1109/TVT.2022.3182411
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33696 |

Wiechmann, J., Glarner, T., Rautenberg, F., Wagner, P., & Haeb-Umbach, R. (2022). Technically enabled explaining of voice characteristics. 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P).
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33857 |

Kuhlmann, M., Seebauer, F., Ebbers, J., Wagner, P., & Haeb-Umbach, R. (2022). Investigation into Target Speaking Rate Adaptation for Voice Conversion. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10740
LibreCat
| Files available
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33808 |

Gburrek, T., Schmalenstroeer, J., Heitkaemper, J., & Haeb-Umbach, R. (2022). Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022), Bamberg, Germany . https://doi.org/10.1109/IWAENC53105.2022.9914772
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 34072 |

Ebbers, J., Haeb-Umbach, R., & Serizel, R. (2022). Threshold Independent Evaluation of Sound Event Detection Scores. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Files available
2022 | Report | LibreCat-ID: 49113
Ebbers, J., & Haeb-Umbach, R. (2022). Pre-Training And Self-Training For Sound Event Detection In Domestic Environments.
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr, T., Boeddeker, C., von Neumann, T., Zorila, C., Doddipatla, R., & Haeb-Umbach, R. (2022). Monaural source separation: From anechoic to reverberant environments. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2022). SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746757
LibreCat
| Files available
| DOI
2022 | Misc | LibreCat-ID: 33816 |

Gburrek, T., Boeddeker, C., von Neumann, T., Cord-Landwehr, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv. https://doi.org/10.48550/ARXIV.2205.00944
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2022). An Initialization Scheme for Meeting Separation with Spatial Mixture Models. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10929
LibreCat
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita, K., von Neumann, T., Delcroix, M., Boeddeker, C., & Haeb-Umbach, R. (2022). Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. Proc. Interspeech 2022, 1486–1490. https://doi.org/10.21437/Interspeech.2022-11408
LibreCat
| DOI
| Download (ext.)
2021 | Journal Article | LibreCat-ID: 21065 |

Haeb-Umbach, R., Heymann, J., Drude, L., Watanabe, S., Delcroix, M., & Nakatani, T. (2021). Far-Field Automatic Speech Recognition. Proceedings of the IEEE, 109(2), 124–148. https://doi.org/10.1109/JPROC.2020.3018668
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 28256
Zhang, W., Boeddeker, C., Watanabe, S., Nakatani, T., Delcroix, M., Kinoshita, K., Ochiai, T., Kamo, N., Haeb-Umbach, R., & Qian, Y. (2021). End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp39728.2021.9414464
LibreCat
| DOI
2021 | Conference Paper | LibreCat-ID: 28262
Li, C., Shi, J., Zhang, W., Subramanian, A. S., Chang, X., Kamo, N., Hira, M., Hayashi, T., Boeddeker, C., Chen, Z., & Watanabe, S. (2021). ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration. 2021 IEEE Spoken Language Technology Workshop (SLT). https://doi.org/10.1109/slt48900.2021.9383615
LibreCat
| DOI