Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

318 Publications


2022 | Conference Paper | LibreCat-ID: 33806
Afifi, H., Karl, H., Gburrek, T., & Schmalenstroeer, J. (2022). Data-driven Time Synchronization in Wireless Multimedia Networks. 2022 International Wireless Communications and Mobile Computing (IWCMC). https://doi.org/10.1109/iwcmc55113.2022.9824980
LibreCat | DOI
 

2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita, K., von Neumann, T., Delcroix, M., Boeddeker, C., & Haeb-Umbach, R. (2022). Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. Proc. Interspeech 2022, 1486–1490. https://doi.org/10.21437/Interspeech.2022-11408
LibreCat | DOI
 

2022 | Conference Paper | LibreCat-ID: 33819 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2022). SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746757
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33847 | OA
Cord-Landwehr, T., von Neumann, T., Boeddeker, C., & Haeb-Umbach, R. (2022). MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg.
LibreCat | Files available | arXiv
 

2022 | Conference Paper | LibreCat-ID: 33848 | OA
Cord-Landwehr, T., Boeddeker, C., von Neumann, T., Zorila, C., Doddipatla, R., & Haeb-Umbach, R. (2022). Monaural source separation: From anechoic to reverberant environments. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).
LibreCat | Files available | arXiv
 

2022 | Conference Paper | LibreCat-ID: 33807 | OA
Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746284
LibreCat | Files available | DOI
 

2022 | Journal Article | LibreCat-ID: 33451 | OA
Grimm, C., Fei, T., Warsitz, E., Farhoud, R., Breddermann, T., & Haeb-Umbach, R. (2022). Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications. IEEE Transactions on Vehicular Technology, 71(9), 9435–9449. https://doi.org/10.1109/TVT.2022.3182411
LibreCat | Files available | DOI
 

2022 | Report | LibreCat-ID: 49113
Ebbers, J., & Haeb-Umbach, R. (2022). Pre-Training And Self-Training For Sound Event Detection In Domestic Environments.
LibreCat | Files available
 

2022 | Conference Paper | LibreCat-ID: 33696 | OA
Wiechmann, J., Glarner, T., Rautenberg, F., Wagner, P., & Haeb-Umbach, R. (2022). Technically enabled explaining of voice characteristics. 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P).
LibreCat | Files available
 

2022 | Conference Paper | LibreCat-ID: 33857 | OA
Kuhlmann, M., Seebauer, F., Ebbers, J., Wagner, P., & Haeb-Umbach, R. (2022). Investigation into Target Speaking Rate Adaptation for Voice Conversion. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10740
LibreCat | Files available | DOI | Download (ext.)
 

2022 | Conference Paper | LibreCat-ID: 33808 | OA
Gburrek, T., Schmalenstroeer, J., Heitkaemper, J., & Haeb-Umbach, R. (2022). Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022), Bamberg, Germany . https://doi.org/10.1109/IWAENC53105.2022.9914772
LibreCat | Files available | DOI
 

2022 | Misc | LibreCat-ID: 33816 | OA
Gburrek, T., Boeddeker, C., von Neumann, T., Cord-Landwehr, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv. https://doi.org/10.48550/ARXIV.2205.00944
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 34072 | OA
Ebbers, J., Haeb-Umbach, R., & Serizel, R. (2022). Threshold Independent Evaluation of Sound Event Detection Scores. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat | Files available
 

2021 | Journal Article | LibreCat-ID: 21065 | OA
Haeb-Umbach, R., Heymann, J., Drude, L., Watanabe, S., Delcroix, M., & Nakatani, T. (2021). Far-Field Automatic Speech Recognition. Proceedings of the IEEE, 109(2), 124–148. https://doi.org/10.1109/JPROC.2020.3018668
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 28256
Zhang, W., Boeddeker, C., Watanabe, S., Nakatani, T., Delcroix, M., Kinoshita, K., Ochiai, T., Kamo, N., Haeb-Umbach, R., & Qian, Y. (2021). End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp39728.2021.9414464
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 28262
Li, C., Shi, J., Zhang, W., Subramanian, A. S., Chang, X., Kamo, N., Hira, M., Hayashi, T., Boeddeker, C., Chen, Z., & Watanabe, S. (2021). ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration. 2021 IEEE Spoken Language Technology Workshop (SLT). https://doi.org/10.1109/slt48900.2021.9383615
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 28261
Li, C., Luo, Y., Han, C., Li, J., Yoshioka, T., Zhou, T., Delcroix, M., Kinoshita, K., Boeddeker, C., Qian, Y., Watanabe, S., & Chen, Z. (2021). Dual-Path RNN for Long Recording Speech Separation. 2021 IEEE Spoken Language Technology Workshop (SLT). https://doi.org/10.1109/slt48900.2021.9383514
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 24000
Heitkaemper, J., Schmalenstroeer, J., Ion, V., & Haeb-Umbach, R. (2021). A Database for Research on Detection and Enhancement of Speech Transmitted over HF links. Speech Communication; 14th ITG-Symposium, 1–5.
LibreCat
 

2021 | Conference Paper | LibreCat-ID: 44843 | OA
Boeddeker, C., Rautenberg, F., & Haeb-Umbach, R. (2021). A Comparison and Combination of Unsupervised Blind Source Separation  Techniques. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Kiel.
LibreCat | Files available | Download (ext.) | arXiv
 

2021 | Conference Paper | LibreCat-ID: 28259 | OA
Boeddeker, C., Zhang, W., Nakatani, T., Kinoshita, K., Ochiai, T., Delcroix, M., Kamo, N., Qian, Y., & Haeb-Umbach, R. (2021). Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp39728.2021.9414661
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 23998 | OA
Schmalenstroeer, J., Heitkaemper, J., Ullmann, J., & Haeb-Umbach, R. (2021). Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech. 29th European Signal Processing Conference (EUSIPCO), 1–5.
LibreCat | Download (ext.)
 

2021 | Journal Article | LibreCat-ID: 22528 | OA
Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2021). Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information. EURASIP Journal on Audio, Speech, and Music Processing. https://doi.org/10.1186/s13636-021-00210-x
LibreCat | DOI | Download (ext.)
 

2021 | Conference Paper | LibreCat-ID: 23994 | OA
Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2021). Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp39728.2021.9413831
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 23999 | OA
Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2021). On Source-Microphone Distance Estimation Using Convolutional Recurrent Neural Networks. Speech Communication; 14th ITG-Symposium, 1–5.
LibreCat | Files available
 

2021 | Conference Paper | LibreCat-ID: 23997 | OA
Chinaev, A., Enzner, G., Gburrek, T., & Schmalenstroeer, J. (2021). Online Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks with Packet Loss. 29th European Signal Processing Conference (EUSIPCO), 1–5.
LibreCat | Download (ext.)
 

2021 | Conference Paper | LibreCat-ID: 29304 | OA
Ebbers, J., Kuhlmann, M., Cord-Landwehr, T., & Haeb-Umbach, R. (2021). Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 3860–3864.
LibreCat | Files available
 

2021 | Conference Paper | LibreCat-ID: 26770 | OA
von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2021). Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021. Interspeech. https://doi.org/10.21437/interspeech.2021-1177
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 29173 | OA
von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2021). Speeding Up Permutation Invariant Training for Source Separation. Speech Communication; 14th ITG Conference. Speech Communication; 14th ITG Conference, Kiel.
LibreCat | Files available
 

2021 | Conference Paper | LibreCat-ID: 29308 | OA
Ebbers, J., & Haeb-Umbach, R. (2021). Self-Trained Audio Tagging and Sound Event Detection in Domestic Environments. Proceedings of the 6th Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE2021), 226–230.
LibreCat | Files available
 

2021 | Conference Paper | LibreCat-ID: 29306 | OA
Ebbers, J., Keyser, M. C., & Haeb-Umbach, R. (2021). Adapting Sound Recognition to A New Environment Via Self-Training. Proceedings of the 29th European Signal Processing Conference (EUSIPCO), 1135–1139.
LibreCat | Files available
 

2021 | Journal Article | LibreCat-ID: 24456 | OA
Rohlfing, K. J., Cimiano, P., Scharlau, I., Matzner, T., Buhl, H. M., Buschmeier, H., Esposito, E., Grimminger, A., Hammer, B., Haeb-Umbach, R., Horwath, I., Hüllermeier, E., Kern, F., Kopp, S., Thommes, K., Ngonga Ngomo, A.-C., Schulte, C., Wachsmuth, H., Wagner, P., & Wrede, B. (2021). Explanation as a Social Practice: Toward a Conceptual Framework for the Social Design of AI Systems. IEEE Transactions on Cognitive and Developmental Systems, 13(3), 717–728. https://doi.org/10.1109/tcds.2020.3044366
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 17763 | OA
Haeb-Umbach, R. (2020). Sprachtechnologien für Digitale Assistenten. In R. Böck, I. Siegert, & A. Wendemuth (Eds.), Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2020 (pp. 227–234). TUDpress, Dresden.
LibreCat | Download (ext.)
 

2020 | Conference Paper | LibreCat-ID: 20695 | OA
Boeddeker, C., Nakatani, T., Kinoshita, K., & Haeb-Umbach, R. (2020). Jointly Optimal Dereverberation and Beamforming. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp40776.2020.9054393
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20700 | OA
Boeddeker, C., Cord-Landwehr, T., Heitkaemper, J., Zorila, C., Hayakawa, D., Li, M., … Haeb-Umbach, R. (2020). Towards a speaker diarization system for the CHiME 2020 dinner party transcription. In Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments.
LibreCat | Files available
 

2020 | Journal Article | LibreCat-ID: 17598 | OA
Nakatani, T., Boeddeker, C., Kinoshita, K., Ikeshita, R., Delcroix, M., & Haeb-Umbach, R. (2020). Jointly optimal denoising, dereverberation, and source separation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 1–1. https://doi.org/10.1109/TASLP.2020.3013118
LibreCat | DOI | Download (ext.)
 

2020 | Conference Paper | LibreCat-ID: 20504
Heitkaemper, J., Jakobeit, D., Boeddeker, C., Drude, L., & Haeb-Umbach, R. (2020). Demystifying TasNet: A Dissecting Approach. ICASSP 2020 Virtual Barcelona Spain.
LibreCat | Files available
 

2020 | Preprint | LibreCat-ID: 28263
Watanabe, S., Mandel, M., Barker, J., Vincent, E., Arora, A., Chang, X., Khudanpur, S., Manohar, V., Povey, D., Raj, D., Snyder, D., Subramanian, A. S., Trmal, J., Yair, B. B., Boeddeker, C., Ni, Z., Fujita, Y., Horiguchi, S., Kanda, N., … Ryant, N. (2020). CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings. In arXiv:2004.09249.
LibreCat
 

2020 | Conference Paper | LibreCat-ID: 20505
Heitkaemper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (2020). Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments. INTERSPEECH 2020 Virtual Shanghai China.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20762 | OA
von Neumann, T., Kinoshita, K., Drude, L., Boeddeker, C., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). End-to-End Training of Time Domain Audio Separation and Recognition. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 7004–7008. https://doi.org/10.1109/ICASSP40776.2020.9053461
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20764 | OA
von Neumann, T., Boeddeker, C., Drude, L., Kinoshita, K., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. Proc. Interspeech 2020, 3097–3101. https://doi.org/10.21437/Interspeech.2020-2519
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 18651 | OA
Gburrek, T., Schmalenstroeer, J., Brendel, A., Kellermann, W., & Haeb-Umbach, R. (2020). Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Network. European Signal Processing Conference (EUSIPCO).
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20766 | OA
Kinoshita, K., von Neumann, T., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. Proc. Interspeech 2020, 2652–2656. https://doi.org/10.21437/Interspeech.2020-2388
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20753 | OA
Ebbers, J., & Haeb-Umbach, R. (2020). Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-Supervised Sound Event Detection. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2020 Workshop (DCASE2020).
LibreCat | Files available
 

2019 | Journal Article | LibreCat-ID: 17762
Haeb-Umbach, R. (2019). Lektionen für Alexa \& Co?! Forschung, 44(1), 12–15. https://doi.org/10.1002/fors.201970104
LibreCat | DOI
 

2019 | Journal Article | LibreCat-ID: 19446 | OA
Drude, L., Heitkaemper, J., Boeddeker, C., & Haeb-Umbach, R. (2019). SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. ArXiv E-Prints.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 11965 | OA
Drude, L., Heymann, J., & Haeb-Umbach, R. (2019). Unsupervised training of neural mask-based beamforming. In INTERSPEECH 2019, Graz, Austria.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 12874 | OA
Drude, L., Hasenklever, D., & Haeb-Umbach, R. (2019). Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation. In ICASSP 2019, Brighton, UK.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 12875 | OA
Heymann, J., Drude, L., Haeb-Umbach, R., Kinoshita, K., & Nakatani, T. (2019). Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR. In ICASSP 2019, Brighton, UK.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 12876 | OA
Kurz, G., Gilitschenski, I., Pfaff, F., Drude, L., Hanebeck, U. D., Haeb-Umbach, R., & Siegwart, R. Y. (2019). Directional Statistics and Filtering Using libDirectional. In Journal of Statistical Software 89(4).
LibreCat | Files available
 

2019 | Journal Article | LibreCat-ID: 12890 | OA
Drude, L., & Haeb-Umbach, R. (2019). Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation. IEEE Journal of Selected Topics in Signal Processing. https://doi.org/10.1109/JSTSP.2019.2912565
LibreCat | Files available | DOI
 

Filters and Search Terms

department=54

Search

Filter Publications

Display / Sort

Citation Style: APA

Export / Embed