Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

318 Publications


2022 | Conference Paper | LibreCat-ID: 33808 | OA
Gburrek T, Schmalenstroeer J, Heitkaemper J, Haeb-Umbach R. Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022. doi:10.1109/IWAENC53105.2022.9914772
LibreCat | Files available | DOI
 

2022 | Misc | LibreCat-ID: 33816 | OA
Gburrek T, Boeddeker C, von Neumann T, Cord-Landwehr T, Schmalenstroeer J, Haeb-Umbach R. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv; 2022. doi:10.48550/ARXIV.2205.00944
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 34072 | OA
Ebbers J, Haeb-Umbach R, Serizel R. Threshold Independent Evaluation of Sound Event Detection Scores. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2022.
LibreCat | Files available
 

2021 | Journal Article | LibreCat-ID: 21065 | OA
Haeb-Umbach R, Heymann J, Drude L, Watanabe S, Delcroix M, Nakatani T. Far-Field Automatic Speech Recognition. Proceedings of the IEEE. 2021;109(2):124-148. doi:10.1109/JPROC.2020.3018668
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 28256
Zhang W, Boeddeker C, Watanabe S, et al. End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9414464
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 28262
Li C, Shi J, Zhang W, et al. ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration. In: 2021 IEEE Spoken Language Technology Workshop (SLT). ; 2021. doi:10.1109/slt48900.2021.9383615
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 28261
Li C, Luo Y, Han C, et al. Dual-Path RNN for Long Recording Speech Separation. In: 2021 IEEE Spoken Language Technology Workshop (SLT). ; 2021. doi:10.1109/slt48900.2021.9383514
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 24000
Heitkaemper J, Schmalenstroeer J, Ion V, Haeb-Umbach R. A Database for Research on Detection and Enhancement of Speech Transmitted over HF links. In: Speech Communication; 14th ITG-Symposium. ; 2021:1-5.
LibreCat
 

2021 | Conference Paper | LibreCat-ID: 44843 | OA
Boeddeker C, Rautenberg F, Haeb-Umbach R. A Comparison and Combination of Unsupervised Blind Source Separation  Techniques. In: ITG Conference on Speech Communication. ; 2021.
LibreCat | Files available | Download (ext.) | arXiv
 

2021 | Conference Paper | LibreCat-ID: 28259 | OA
Boeddeker C, Zhang W, Nakatani T, et al. Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9414661
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 23998 | OA
Schmalenstroeer J, Heitkaemper J, Ullmann J, Haeb-Umbach R. Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech. In: 29th European Signal Processing Conference (EUSIPCO). ; 2021:1-5.
LibreCat | Download (ext.)
 

2021 | Journal Article | LibreCat-ID: 22528 | OA
Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information. EURASIP Journal on Audio, Speech, and Music Processing. Published online 2021. doi:10.1186/s13636-021-00210-x
LibreCat | DOI | Download (ext.)
 

2021 | Conference Paper | LibreCat-ID: 23994 | OA
Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9413831
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 23999 | OA
Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On Source-Microphone Distance Estimation Using Convolutional Recurrent Neural Networks. In: Speech Communication; 14th ITG-Symposium. ; 2021:1-5.
LibreCat | Files available
 

2021 | Conference Paper | LibreCat-ID: 23997 | OA
Chinaev A, Enzner G, Gburrek T, Schmalenstroeer J. Online Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks with Packet Loss. In: 29th European Signal Processing Conference (EUSIPCO). ; 2021:1-5.
LibreCat | Download (ext.)
 

2021 | Conference Paper | LibreCat-ID: 29304 | OA
Ebbers J, Kuhlmann M, Cord-Landwehr T, Haeb-Umbach R. Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021:3860–3864.
LibreCat | Files available
 

2021 | Conference Paper | LibreCat-ID: 26770 | OA
von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. In: Interspeech 2021. ; 2021. doi:10.21437/interspeech.2021-1177
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 29173 | OA
von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. Speeding Up Permutation Invariant Training for Source Separation. In: Speech Communication; 14th ITG Conference. ; 2021.
LibreCat | Files available
 

2021 | Conference Paper | LibreCat-ID: 29308 | OA
Ebbers J, Haeb-Umbach R. Self-Trained Audio Tagging and Sound Event Detection in Domestic Environments. In: Proceedings of the 6th Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE2021). ; 2021:226–230.
LibreCat | Files available
 

2021 | Conference Paper | LibreCat-ID: 29306 | OA
Ebbers J, Keyser MC, Haeb-Umbach R. Adapting Sound Recognition to A New Environment Via Self-Training. In: Proceedings of the 29th European Signal Processing Conference (EUSIPCO). ; 2021:1135–1139.
LibreCat | Files available
 

2021 | Journal Article | LibreCat-ID: 24456 | OA
Rohlfing KJ, Cimiano P, Scharlau I, et al. Explanation as a Social Practice: Toward a Conceptual Framework for the Social Design of AI Systems. IEEE Transactions on Cognitive and Developmental Systems. 2021;13(3):717-728. doi:10.1109/tcds.2020.3044366
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 17763 | OA
Haeb-Umbach R. Sprachtechnologien für Digitale Assistenten. In: Böck R, Siegert I, Wendemuth A, eds. Studientexte Zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2020. TUDpress, Dresden; 2020:227-234.
LibreCat | Download (ext.)
 

2020 | Conference Paper | LibreCat-ID: 20695 | OA
Boeddeker C, Nakatani T, Kinoshita K, Haeb-Umbach R. Jointly Optimal Dereverberation and Beamforming. In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2020. doi:10.1109/icassp40776.2020.9054393
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20700 | OA
Boeddeker C, Cord-Landwehr T, Heitkaemper J, et al. Towards a speaker diarization system for the CHiME 2020 dinner party transcription. In: Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments. ; 2020.
LibreCat | Files available
 

2020 | Journal Article | LibreCat-ID: 17598 | OA
Nakatani T, Boeddeker C, Kinoshita K, Ikeshita R, Delcroix M, Haeb-Umbach R. Jointly optimal denoising, dereverberation, and source separation. IEEE/ACM Transactions on Audio, Speech, and Language Processing. Published online 2020:1-1. doi:10.1109/TASLP.2020.3013118
LibreCat | DOI | Download (ext.)
 

2020 | Conference Paper | LibreCat-ID: 20504
Heitkaemper J, Jakobeit D, Boeddeker C, Drude L, Haeb-Umbach R. Demystifying TasNet: A Dissecting Approach. In: ICASSP 2020 Virtual Barcelona Spain. ; 2020.
LibreCat | Files available
 

2020 | Preprint | LibreCat-ID: 28263
Watanabe S, Mandel M, Barker J, et al. CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings. arXiv:200409249. Published online 2020.
LibreCat
 

2020 | Conference Paper | LibreCat-ID: 20505
Heitkaemper J, Schmalenstroeer J, Haeb-Umbach R. Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments. In: INTERSPEECH 2020 Virtual Shanghai China. ; 2020.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20762 | OA
von Neumann T, Kinoshita K, Drude L, et al. End-to-End Training of Time Domain Audio Separation and Recognition. In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2020:7004-7008. doi:10.1109/ICASSP40776.2020.9053461
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20764 | OA
von Neumann T, Boeddeker C, Drude L, et al. Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. In: Proc. Interspeech 2020. ; 2020:3097-3101. doi:10.21437/Interspeech.2020-2519
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 18651 | OA
Gburrek T, Schmalenstroeer J, Brendel A, Kellermann W, Haeb-Umbach R. Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Network. In: European Signal Processing Conference (EUSIPCO). ; 2020.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20766 | OA
Kinoshita K, von Neumann T, Delcroix M, Nakatani T, Haeb-Umbach R. Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. In: Proc. Interspeech 2020. ; 2020:2652-2656. doi:10.21437/Interspeech.2020-2388
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20753 | OA
Ebbers J, Haeb-Umbach R. Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-Supervised Sound Event Detection. In: Proceedings of the Detection and Classification of Acoustic Scenes and Events 2020 Workshop (DCASE2020). ; 2020.
LibreCat | Files available
 

2019 | Journal Article | LibreCat-ID: 17762
Haeb-Umbach R. Lektionen für Alexa \& Co?! forschung. 2019;44(1):12-15. doi:10.1002/fors.201970104
LibreCat | DOI
 

2019 | Journal Article | LibreCat-ID: 19446 | OA
Drude L, Heitkaemper J, Boeddeker C, Haeb-Umbach R. SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. ArXiv e-prints. 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 11965 | OA
Drude L, Heymann J, Haeb-Umbach R. Unsupervised training of neural mask-based beamforming. In: INTERSPEECH 2019, Graz, Austria. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 12874 | OA
Drude L, Hasenklever D, Haeb-Umbach R. Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation. In: ICASSP 2019, Brighton, UK. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 12875 | OA
Heymann J, Drude L, Haeb-Umbach R, Kinoshita K, Nakatani T. Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR. In: ICASSP 2019, Brighton, UK. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 12876 | OA
Kurz G, Gilitschenski I, Pfaff F, et al. Directional Statistics and Filtering Using libDirectional. In: Journal of Statistical Software 89(4). ; 2019.
LibreCat | Files available
 

2019 | Journal Article | LibreCat-ID: 12890 | OA
Drude L, Haeb-Umbach R. Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation. IEEE Journal of Selected Topics in Signal Processing. 2019. doi:10.1109/JSTSP.2019.2912565
LibreCat | Files available | DOI
 

2019 | Conference Paper | LibreCat-ID: 15812 | OA
Heymann J, Khe Chai Sim BL. Improving CTC Using Stimulated Learning for Sequence Modeling. In: ICASSP 2019, Brighton, UK. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 15816 | OA
Zorila C, Boeddeker C, Doddipatla R, Haeb-Umbach R. An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription. In: ASRU 2019, Sentosa, Singapore. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 14822 | OA
Heitkaemper J, Feher T, Freitag M, Haeb-Umbach R. A Study on Online Source Extraction in the Presence of Changing Speaker Positions. In: International Conference on Statistical Language and Speech Processing 2019, Ljubljana, Slovenia. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 14824 | OA
Martin-Donas JM, Heitkaemper J, Haeb-Umbach R, Gomez AM, Peinado AM. Multi-Channel Block-Online Source Extraction based on Utterance Adaptation. In: INTERSPEECH 2019, Graz, Austria. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 14826 | OA
Kanda N, Boeddeker C, Heitkaemper J, Fujita Y, Horiguchi S, Haeb-Umbach R. Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. In: INTERSPEECH 2019, Graz, Austria. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 13271 | OA
von Neumann T, Kinoshita K, Delcroix M, Araki S, Nakatani T, Haeb-Umbach R. All-neural Online Source Separation, Counting, and Diarization for Meeting Analysis. In: ICASSP 2019, Brighton, UK. ; 2019.
LibreCat | Files available
 

2019 | Journal Article | LibreCat-ID: 15814 | OA
Haeb-Umbach R, Watanabe S, Nakatani T, et al. Speech Processing for Digital Home Assistance: Combining Signal Processing With Deep-Learning Techniques. IEEE Signal Processing Magazine. 2019;36(6):111-124. doi:10.1109/MSP.2019.2918706
LibreCat | Files available | DOI
 

2019 | Journal Article | LibreCat-ID: 19450 | OA
Haeb-Umbach R. Lektionen für Alexa & Co?! DFG forschung 1/2019. Published online 2019:12-15. doi:10.1002/fors.201970104
LibreCat | Files available | DOI
 

2019 | Conference Paper | LibreCat-ID: 15237 | OA
Gburrek T, Glarner T, Ebbers J, Haeb-Umbach R, Wagner P. Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion. In: Proc. 10th ISCA Speech Synthesis Workshop. ; 2019:81-86. doi:10.21437/SSW.2019-15
LibreCat | Files available | DOI | Download (ext.)
 

2019 | Conference Paper | LibreCat-ID: 15794 | OA
Ebbers J, Haeb-Umbach R. Convolutional Recurrent Neural Network and Data Augmentation for Audio Tagging with Noisy Labels and Minimal Supervision. In: DCASE2019 Workshop, New York, USA. ; 2019.
LibreCat | Files available
 

Filters and Search Terms

department=54

Search

Filter Publications

Display / Sort

Citation Style: AMA

Export / Embed