Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

267 Publications


2021 | Journal Article | LibreCat-ID: 22528
Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information
T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, EURASIP Journal on Audio, Speech, and Music Processing (2021).
LibreCat | DOI
 

2021 | Journal Article | LibreCat-ID: 21065
Far-Field Automatic Speech Recognition
R. Haeb-Umbach, J. Heymann, L. Drude, S. Watanabe, M. Delcroix, T. Nakatani, Proceedings of the IEEE 109 (2021) 124–148.
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20695
Jointly Optimal Dereverberation and Beamforming
C. Boeddeker, T. Nakatani, K. Kinoshita, R. Haeb-Umbach, in: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020.
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20753
Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-Supervised Sound Event Detection
J. Ebbers, R. Haeb-Umbach, in: Proceedings of the Detection and Classification of Acoustic Scenes and Events 2020 Workshop (DCASE2020), 2020.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 17763
Sprachtechnologien für Digitale Assistenten
R. Haeb-Umbach, in: R. Böck, I. Siegert, A. Wendemuth (Eds.), Studientexte Zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2020, TUDpress, Dresden, 2020, pp. 227–234.
LibreCat | Download (ext.)
 

2020 | Conference Paper | LibreCat-ID: 20766
Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation
K. Kinoshita, T.C. von Neumann, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: Proc. Interspeech 2020, 2020, pp. 2652–2656.
LibreCat | Files available | DOI
 

2020 | Journal Article | LibreCat-ID: 17598
Jointly optimal denoising, dereverberation, and source separation
T. Nakatani, C. Boeddeker, K. Kinoshita, R. Ikeshita, M. Delcroix, R. Haeb-Umbach, IEEE/ACM Transactions on Audio, Speech, and Language Processing (2020) 1–1.
LibreCat | DOI | Download (ext.)
 

2020 | Conference Paper | LibreCat-ID: 20700
Towards a speaker diarization system for the CHiME 2020 dinner party transcription
C. Boeddeker, T. Cord-Landwehr, J. Heitkaemper, C. Zorila, D. Hayakawa, M. Li, M. Liu, R. Doddipatla, R. Haeb-Umbach, in: Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20762
End-to-End Training of Time Domain Audio Separation and Recognition
T.C. von Neumann, K. Kinoshita, L. Drude, C. Boeddeker, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008.
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20504
Demystifying TasNet: A Dissecting Approach
J. Heitkaemper, D. Jakobeit, C. Boeddeker, L. Drude, R. Haeb-Umbach, in: ICASSP 2020 Virtual Barcelona Spain, 2020.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 18651
Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Network
T. Gburrek, J. Schmalenstroeer, A. Brendel, W. Kellermann, R. Haeb-Umbach, in: European Signal Processing Conference (EUSIPCO), 2020.
LibreCat
 

2020 | Conference Paper | LibreCat-ID: 20505
Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments
J. Heitkaemper, J. Schmalenströer, R. Haeb-Umbach, in: INTERSPEECH 2020 Virtual Shanghai China, 2020.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20764
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR
T.C. von Neumann, C. Boeddeker, L. Drude, K. Kinoshita, M. Delcroix, T. Nakatani, R. Haeb-Umbach, T. von Neuann, in: Proc. Interspeech 2020, 2020, pp. 3097–3101.
LibreCat | Files available | DOI
 

2019 | Conference Paper | LibreCat-ID: 12875
Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR
J. Heymann, L. Drude, R. Haeb-Umbach, K. Kinoshita, T. Nakatani, in: ICASSP 2019, Brighton, UK, 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 14826
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR
N. Kanda, C. Boeddeker, J. Heitkaemper, Y. Fujita, S. Horiguchi, R. Haeb-Umbach, in: INTERSPEECH 2019, Graz, Austria, 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 15792
Privacy-preserving Variational Information Feature Extraction for Domestic Activity Monitoring Versus Speaker Identification
A. Nelus, J. Ebbers, R. Haeb-Umbach, R. Martin, in: INTERSPEECH 2019, Graz, Austria, 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 15812
Improving CTC Using Stimulated Learning for Sequence Modeling
J. Heymann, B.L. Khe Chai Sim, in: ICASSP 2019, Brighton, UK, 2019.
LibreCat | Files available
 

2019 | Journal Article | LibreCat-ID: 17762
Lektionen für Alexa \& Co?!
R. Haeb-Umbach, Forschung 44 (2019) 12–15.
LibreCat | DOI
 

2019 | Journal Article | LibreCat-ID: 19446
SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition
L. Drude, J. Heitkaemper, C. Boeddeker, R. Haeb-Umbach, ArXiv E-Prints (2019).
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 12876
Directional Statistics and Filtering Using libDirectional
G. Kurz, I. Gilitschenski, F. Pfaff, L. Drude, U.D. Hanebeck, R. Haeb-Umbach, R.Y. Siegwart, in: Journal of Statistical Software 89(4), 2019.
LibreCat | Files available
 

Filters and Search Terms

department=54

Search

Filter Publications

Display / Sort

Export / Embed