Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

317 Publications


2024 | Journal Article | LibreCat-ID: 52958 | OA
Boeddeker C, Subramanian AS, Wichern G, Haeb-Umbach R, Le Roux J. TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2024;32:1185-1197. doi:10.1109/taslp.2024.3350887
LibreCat | DOI | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48269 | OA
Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On the Integration of Sampling Rate Synchronization and Acoustic Beamforming. In: European Signal Processing Conference (EUSIPCO). ; 2023.
LibreCat | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 47128 | OA
Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10095370
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48270 | OA
Schmalenstroeer J, Gburrek T, Haeb-Umbach R. LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices. In: ITG Conference on Speech Communication. ; 2023.
LibreCat | Files available
 

2023 | Conference Paper | LibreCat-ID: 47129 | OA
Cord-Landwehr T, Boeddeker C, Zorilă C, Doddipatla R, Haeb-Umbach R. A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures. In: INTERSPEECH 2023. ISCA; 2023. doi:10.21437/interspeech.2023-1379
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48355 | OA
Rautenberg F, Kuhlmann M, Wiechmann J, Seebauer F, Wagner P, Haeb-Umbach R. On Feature Importance and Interpretability of Speaker Representations. In: ITG Conference on Speech Communication. ; 2023.
LibreCat | Files available | Download (ext.) | arXiv
 

2023 | Conference Paper | LibreCat-ID: 48410 | OA
Wiechmann J, Rautenberg F, Wagner P, Haeb-Umbach R. Explaining voice characteristics to novice voice practitioners-How successful is it? In: 20th International Congress of the Phonetic Sciences (ICPhS) . ; 2023.
LibreCat | Files available | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48391
Aralikatti R, Boeddeker C, Wichern G, Subramanian A, Le Roux J. Reverberation as Supervision For Speech Separation. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10095022
LibreCat | DOI
 

2023 | Conference Paper | LibreCat-ID: 48390
Berger S, Vieting P, Boeddeker C, Schlüter R, Haeb-Umbach R. Mixture Encoder for Joint Speech Separation and Recognition. In: INTERSPEECH 2023. ISCA; 2023. doi:10.21437/interspeech.2023-1815
LibreCat | DOI
 

2023 | Conference Paper | LibreCat-ID: 46069
Seebauer F, Kuhlmann M, Haeb-Umbach R, Wagner P. Re-examining the quality dimensions of synthetic speech. In: 12th Speech Synthesis Workshop (SSW) 2023. ; 2023.
LibreCat
 

2023 | Journal Article | LibreCat-ID: 35602 | OA
von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2023;31:576-589. doi:10.1109/taslp.2022.3228629
LibreCat | Files available | DOI
 

2023 | Conference Paper | LibreCat-ID: 48281 | OA
von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2023. doi:10.1109/icassp49357.2023.10094784
LibreCat | Files available | DOI | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 48275 | OA
von Neumann T, Boeddeker C, Delcroix M, Haeb-Umbach R. MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. In: Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. ; 2023.
LibreCat | Files available | Download (ext.)
 

2023 | Conference Paper | LibreCat-ID: 49109 | OA
Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks. In: Proc. Asilomar Conference on Signals, Systems, and Computers. ; 2023.
LibreCat | Files available
 

2023 | Conference Paper | LibreCat-ID: 49111
Ebbers J, Haeb-Umbach R, Serizel R. Post-Processing Independent Evaluation of Sound Event Detection Systems. In: Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023). ; 2023:36–40.
LibreCat | Files available
 

2023 | Conference Paper | LibreCat-ID: 44849 | OA
Rautenberg F, Kuhlmann M, Ebbers J, et al. Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics. In: Fortschritte Der Akustik - DAGA 2023. ; 2023:1409-1412.
LibreCat | Files available | Download (ext.)
 

2022 | Journal Article | LibreCat-ID: 33669 | OA
Zhang W, Chang X, Boeddeker C, Nakatani T, Watanabe S, Qian Y. End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party. IEEE/ACM Transactions on Audio, Speech, and Language Processing. Published online 2022. doi:10.1109/TASLP.2022.3209942
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33954 | OA
Boeddeker C, Cord-Landwehr T, von Neumann T, Haeb-Umbach R. An Initialization Scheme for Meeting Separation with Spatial Mixture Models. In: Interspeech 2022. ISCA; 2022. doi:10.21437/interspeech.2022-10929
LibreCat | DOI | Download (ext.)
 

2022 | Conference Paper | LibreCat-ID: 33471
Heitkämper J, Schmalenstroeer J, Haeb-Umbach R. Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels. In: Proceedings of the 30th European Signal Processing Conference (EUSIPCO).
LibreCat | Files available
 

2022 | Conference Paper | LibreCat-ID: 33806
Afifi H, Karl H, Gburrek T, Schmalenstroeer J. Data-driven Time Synchronization in Wireless Multimedia Networks. In: 2022 International Wireless Communications and Mobile Computing (IWCMC). IEEE; 2022. doi:10.1109/iwcmc55113.2022.9824980
LibreCat | DOI
 

2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita K, von Neumann T, Delcroix M, Boeddeker C, Haeb-Umbach R. Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. In: Proc. Interspeech 2022. ISCA; 2022:1486-1490. doi:10.21437/Interspeech.2022-11408
LibreCat | DOI
 

2022 | Conference Paper | LibreCat-ID: 33819 | OA
von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746757
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 33847 | OA
Cord-Landwehr T, von Neumann T, Boeddeker C, Haeb-Umbach R. MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). ; 2022.
LibreCat | Files available | arXiv
 

2022 | Conference Paper | LibreCat-ID: 33848 | OA
Cord-Landwehr T, Boeddeker C, von Neumann T, Zorila C, Doddipatla R, Haeb-Umbach R. Monaural source separation: From anechoic to reverberant environments. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022.
LibreCat | Files available | arXiv
 

2022 | Conference Paper | LibreCat-ID: 33807 | OA
Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes. In: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746284
LibreCat | Files available | DOI
 

2022 | Journal Article | LibreCat-ID: 33451 | OA
Grimm C, Fei T, Warsitz E, Farhoud R, Breddermann T, Haeb-Umbach R. Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications. IEEE Transactions on Vehicular Technology. 2022;71(9):9435-9449. doi:10.1109/TVT.2022.3182411
LibreCat | Files available | DOI
 

2022 | Report | LibreCat-ID: 49113
Ebbers J, Haeb-Umbach R. Pre-Training And Self-Training For Sound Event Detection In Domestic Environments.; 2022.
LibreCat | Files available
 

2022 | Conference Paper | LibreCat-ID: 33696 | OA
Wiechmann J, Glarner T, Rautenberg F, Wagner P, Haeb-Umbach R. Technically enabled explaining of voice characteristics. In: 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P). ; 2022.
LibreCat | Files available
 

2022 | Conference Paper | LibreCat-ID: 33857 | OA
Kuhlmann M, Seebauer F, Ebbers J, Wagner P, Haeb-Umbach R. Investigation into Target Speaking Rate Adaptation for Voice Conversion. In: Interspeech 2022. ISCA; 2022. doi:10.21437/interspeech.2022-10740
LibreCat | Files available | DOI | Download (ext.)
 

2022 | Conference Paper | LibreCat-ID: 33808 | OA
Gburrek T, Schmalenstroeer J, Heitkaemper J, Haeb-Umbach R. Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription. In: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE; 2022. doi:10.1109/IWAENC53105.2022.9914772
LibreCat | Files available | DOI
 

2022 | Misc | LibreCat-ID: 33816 | OA
Gburrek T, Boeddeker C, von Neumann T, Cord-Landwehr T, Schmalenstroeer J, Haeb-Umbach R. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv; 2022. doi:10.48550/ARXIV.2205.00944
LibreCat | Files available | DOI
 

2022 | Conference Paper | LibreCat-ID: 34072 | OA
Ebbers J, Haeb-Umbach R, Serizel R. Threshold Independent Evaluation of Sound Event Detection Scores. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2022.
LibreCat | Files available
 

2021 | Journal Article | LibreCat-ID: 21065 | OA
Haeb-Umbach R, Heymann J, Drude L, Watanabe S, Delcroix M, Nakatani T. Far-Field Automatic Speech Recognition. Proceedings of the IEEE. 2021;109(2):124-148. doi:10.1109/JPROC.2020.3018668
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 28256
Zhang W, Boeddeker C, Watanabe S, et al. End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9414464
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 28262
Li C, Shi J, Zhang W, et al. ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration. In: 2021 IEEE Spoken Language Technology Workshop (SLT). ; 2021. doi:10.1109/slt48900.2021.9383615
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 28261
Li C, Luo Y, Han C, et al. Dual-Path RNN for Long Recording Speech Separation. In: 2021 IEEE Spoken Language Technology Workshop (SLT). ; 2021. doi:10.1109/slt48900.2021.9383514
LibreCat | DOI
 

2021 | Conference Paper | LibreCat-ID: 24000
Heitkaemper J, Schmalenstroeer J, Ion V, Haeb-Umbach R. A Database for Research on Detection and Enhancement of Speech Transmitted over HF links. In: Speech Communication; 14th ITG-Symposium. ; 2021:1-5.
LibreCat
 

2021 | Conference Paper | LibreCat-ID: 44843 | OA
Boeddeker C, Rautenberg F, Haeb-Umbach R. A Comparison and Combination of Unsupervised Blind Source Separation  Techniques. In: ITG Conference on Speech Communication. ; 2021.
LibreCat | Files available | Download (ext.) | arXiv
 

2021 | Conference Paper | LibreCat-ID: 28259 | OA
Boeddeker C, Zhang W, Nakatani T, et al. Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9414661
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 23998 | OA
Schmalenstroeer J, Heitkaemper J, Ullmann J, Haeb-Umbach R. Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech. In: 29th European Signal Processing Conference (EUSIPCO). ; 2021:1-5.
LibreCat | Download (ext.)
 

2021 | Journal Article | LibreCat-ID: 22528 | OA
Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information. EURASIP Journal on Audio, Speech, and Music Processing. Published online 2021. doi:10.1186/s13636-021-00210-x
LibreCat | DOI | Download (ext.)
 

2021 | Conference Paper | LibreCat-ID: 23994 | OA
Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9413831
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 23999 | OA
Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On Source-Microphone Distance Estimation Using Convolutional Recurrent Neural Networks. In: Speech Communication; 14th ITG-Symposium. ; 2021:1-5.
LibreCat | Files available
 

2021 | Conference Paper | LibreCat-ID: 23997 | OA
Chinaev A, Enzner G, Gburrek T, Schmalenstroeer J. Online Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks with Packet Loss. In: 29th European Signal Processing Conference (EUSIPCO). ; 2021:1-5.
LibreCat | Download (ext.)
 

2021 | Conference Paper | LibreCat-ID: 29304 | OA
Ebbers J, Kuhlmann M, Cord-Landwehr T, Haeb-Umbach R. Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2021:3860–3864.
LibreCat | Files available
 

2021 | Conference Paper | LibreCat-ID: 26770 | OA
von Neumann T, Kinoshita K, Boeddeker C, Delcroix M, Haeb-Umbach R. Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. In: Interspeech 2021. ; 2021. doi:10.21437/interspeech.2021-1177
LibreCat | Files available | DOI
 

2021 | Conference Paper | LibreCat-ID: 29173 | OA
von Neumann T, Boeddeker C, Kinoshita K, Delcroix M, Haeb-Umbach R. Speeding Up Permutation Invariant Training for Source Separation. In: Speech Communication; 14th ITG Conference. ; 2021.
LibreCat | Files available
 

2021 | Conference Paper | LibreCat-ID: 29308 | OA
Ebbers J, Haeb-Umbach R. Self-Trained Audio Tagging and Sound Event Detection in Domestic Environments. In: Proceedings of the 6th Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE2021). ; 2021:226–230.
LibreCat | Files available
 

2021 | Conference Paper | LibreCat-ID: 29306 | OA
Ebbers J, Keyser MC, Haeb-Umbach R. Adapting Sound Recognition to A New Environment Via Self-Training. In: Proceedings of the 29th European Signal Processing Conference (EUSIPCO). ; 2021:1135–1139.
LibreCat | Files available
 

2021 | Journal Article | LibreCat-ID: 24456 | OA
Rohlfing KJ, Cimiano P, Scharlau I, et al. Explanation as a Social Practice: Toward a Conceptual Framework for the Social Design of AI Systems. IEEE Transactions on Cognitive and Developmental Systems. 2021;13(3):717-728. doi:10.1109/tcds.2020.3044366
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 17763 | OA
Haeb-Umbach R. Sprachtechnologien für Digitale Assistenten. In: Böck R, Siegert I, Wendemuth A, eds. Studientexte Zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2020. TUDpress, Dresden; 2020:227-234.
LibreCat | Download (ext.)
 

2020 | Conference Paper | LibreCat-ID: 20695 | OA
Boeddeker C, Nakatani T, Kinoshita K, Haeb-Umbach R. Jointly Optimal Dereverberation and Beamforming. In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2020. doi:10.1109/icassp40776.2020.9054393
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20700 | OA
Boeddeker C, Cord-Landwehr T, Heitkaemper J, et al. Towards a speaker diarization system for the CHiME 2020 dinner party transcription. In: Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments. ; 2020.
LibreCat | Files available
 

2020 | Journal Article | LibreCat-ID: 17598 | OA
Nakatani T, Boeddeker C, Kinoshita K, Ikeshita R, Delcroix M, Haeb-Umbach R. Jointly optimal denoising, dereverberation, and source separation. IEEE/ACM Transactions on Audio, Speech, and Language Processing. Published online 2020:1-1. doi:10.1109/TASLP.2020.3013118
LibreCat | DOI | Download (ext.)
 

2020 | Conference Paper | LibreCat-ID: 20504
Heitkaemper J, Jakobeit D, Boeddeker C, Drude L, Haeb-Umbach R. Demystifying TasNet: A Dissecting Approach. In: ICASSP 2020 Virtual Barcelona Spain. ; 2020.
LibreCat | Files available
 

2020 | Preprint | LibreCat-ID: 28263
Watanabe S, Mandel M, Barker J, et al. CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings. arXiv:200409249. Published online 2020.
LibreCat
 

2020 | Conference Paper | LibreCat-ID: 20505
Heitkaemper J, Schmalenstroeer J, Haeb-Umbach R. Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments. In: INTERSPEECH 2020 Virtual Shanghai China. ; 2020.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20762 | OA
von Neumann T, Kinoshita K, Drude L, et al. End-to-End Training of Time Domain Audio Separation and Recognition. In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2020:7004-7008. doi:10.1109/ICASSP40776.2020.9053461
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20764 | OA
von Neumann T, Boeddeker C, Drude L, et al. Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. In: Proc. Interspeech 2020. ; 2020:3097-3101. doi:10.21437/Interspeech.2020-2519
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 18651 | OA
Gburrek T, Schmalenstroeer J, Brendel A, Kellermann W, Haeb-Umbach R. Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Network. In: European Signal Processing Conference (EUSIPCO). ; 2020.
LibreCat | Files available
 

2020 | Conference Paper | LibreCat-ID: 20766 | OA
Kinoshita K, von Neumann T, Delcroix M, Nakatani T, Haeb-Umbach R. Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. In: Proc. Interspeech 2020. ; 2020:2652-2656. doi:10.21437/Interspeech.2020-2388
LibreCat | Files available | DOI
 

2020 | Conference Paper | LibreCat-ID: 20753 | OA
Ebbers J, Haeb-Umbach R. Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-Supervised Sound Event Detection. In: Proceedings of the Detection and Classification of Acoustic Scenes and Events 2020 Workshop (DCASE2020). ; 2020.
LibreCat | Files available
 

2019 | Journal Article | LibreCat-ID: 17762
Haeb-Umbach R. Lektionen für Alexa \& Co?! forschung. 2019;44(1):12-15. doi:10.1002/fors.201970104
LibreCat | DOI
 

2019 | Journal Article | LibreCat-ID: 19446 | OA
Drude L, Heitkaemper J, Boeddeker C, Haeb-Umbach R. SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. ArXiv e-prints. 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 11965 | OA
Drude L, Heymann J, Haeb-Umbach R. Unsupervised training of neural mask-based beamforming. In: INTERSPEECH 2019, Graz, Austria. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 12874 | OA
Drude L, Hasenklever D, Haeb-Umbach R. Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation. In: ICASSP 2019, Brighton, UK. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 12875 | OA
Heymann J, Drude L, Haeb-Umbach R, Kinoshita K, Nakatani T. Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR. In: ICASSP 2019, Brighton, UK. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 12876 | OA
Kurz G, Gilitschenski I, Pfaff F, et al. Directional Statistics and Filtering Using libDirectional. In: Journal of Statistical Software 89(4). ; 2019.
LibreCat | Files available
 

2019 | Journal Article | LibreCat-ID: 12890 | OA
Drude L, Haeb-Umbach R. Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation. IEEE Journal of Selected Topics in Signal Processing. 2019. doi:10.1109/JSTSP.2019.2912565
LibreCat | Files available | DOI
 

2019 | Conference Paper | LibreCat-ID: 15812 | OA
Heymann J, Khe Chai Sim BL. Improving CTC Using Stimulated Learning for Sequence Modeling. In: ICASSP 2019, Brighton, UK. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 15816 | OA
Zorila C, Boeddeker C, Doddipatla R, Haeb-Umbach R. An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription. In: ASRU 2019, Sentosa, Singapore. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 14822 | OA
Heitkaemper J, Feher T, Freitag M, Haeb-Umbach R. A Study on Online Source Extraction in the Presence of Changing Speaker Positions. In: International Conference on Statistical Language and Speech Processing 2019, Ljubljana, Slovenia. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 14824 | OA
Martin-Donas JM, Heitkaemper J, Haeb-Umbach R, Gomez AM, Peinado AM. Multi-Channel Block-Online Source Extraction based on Utterance Adaptation. In: INTERSPEECH 2019, Graz, Austria. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 14826 | OA
Kanda N, Boeddeker C, Heitkaemper J, Fujita Y, Horiguchi S, Haeb-Umbach R. Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. In: INTERSPEECH 2019, Graz, Austria. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 13271 | OA
von Neumann T, Kinoshita K, Delcroix M, Araki S, Nakatani T, Haeb-Umbach R. All-neural Online Source Separation, Counting, and Diarization for Meeting Analysis. In: ICASSP 2019, Brighton, UK. ; 2019.
LibreCat | Files available
 

2019 | Journal Article | LibreCat-ID: 15814 | OA
Haeb-Umbach R, Watanabe S, Nakatani T, et al. Speech Processing for Digital Home Assistance: Combining Signal Processing With Deep-Learning Techniques. IEEE Signal Processing Magazine. 2019;36(6):111-124. doi:10.1109/MSP.2019.2918706
LibreCat | Files available | DOI
 

2019 | Journal Article | LibreCat-ID: 19450 | OA
Haeb-Umbach R. Lektionen für Alexa & Co?! DFG forschung 1/2019. Published online 2019:12-15. doi:10.1002/fors.201970104
LibreCat | Files available | DOI
 

2019 | Conference Paper | LibreCat-ID: 15237 | OA
Gburrek T, Glarner T, Ebbers J, Haeb-Umbach R, Wagner P. Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion. In: Proc. 10th ISCA Speech Synthesis Workshop. ; 2019:81-86. doi:10.21437/SSW.2019-15
LibreCat | Files available | DOI | Download (ext.)
 

2019 | Conference Paper | LibreCat-ID: 15794 | OA
Ebbers J, Haeb-Umbach R. Convolutional Recurrent Neural Network and Data Augmentation for Audio Tagging with Noisy Labels and Minimal Supervision. In: DCASE2019 Workshop, New York, USA. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 15796 | OA
Ebbers J, Drude L, Haeb-Umbach R, Brendel A, Kellermann W. Weakly Supervised Sound Activity Detection and Event Classification in Acoustic Sensor Networks. In: CAMSAP 2019, Guadeloupe, West Indies. ; 2019.
LibreCat | Files available
 

2019 | Conference Paper | LibreCat-ID: 15792 | OA
Nelus A, Ebbers J, Haeb-Umbach R, Martin R. Privacy-preserving Variational Information Feature Extraction for Domestic Activity Monitoring Versus Speaker Identification. In: INTERSPEECH 2019, Graz, Austria. ; 2019.
LibreCat | Files available
 

2018 | Conference Paper | LibreCat-ID: 18107
Heymann J, Bacchiani M, Sainath TN. Performance of Mask Based Statistical Beamforming in a Smart Home Scenario. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ; 2018:6722-6726. doi:10.1109/ICASSP.2018.8462372
LibreCat | DOI
 

2018 | Conference Paper | LibreCat-ID: 11760 | OA
Ebbers J, Nelus A, Martin R, Haeb-Umbach R. Evaluation of Modulation-MFCC Features and DNN Classification for Acoustic Event Detection. In: DAGA 2018, München. ; 2018.
LibreCat | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 11835 | OA
Heymann J, Drude L, Haeb-Umbach R, Kinoshita K, Nakatani T. Frame-Online DNN-WPE Dereverberation. In: IWAENC 2018, Tokio, Japan. ; 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 11837 | OA
Heitkaemper J, Heymann J, Haeb-Umbach R. Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming. In: ITG 2018, Oldenburg, Germany. ; 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 11872 | OA
Drude L, Boeddeker C, Heymann J, et al. Integration neural network based beamforming and weighted prediction error dereverberation. In: INTERSPEECH 2018, Hyderabad, India. ; 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 11873 | OA
Drude L, Heymann J, Boeddeker C, Haeb-Umbach R. NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing. In: ITG 2018, Oldenburg, Germany. ; 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Journal Article | LibreCat-ID: 11916 | OA
Despotovic V, Walter O, Haeb-Umbach R. Machine learning techniques for semantic analysis of dysarthric speech: An experimental study. Speech Communication 99 (2018) 242-251 (Elsevier BV). 2018.
LibreCat | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 12898 | OA
Drude L, von Neumann T, Haeb-Umbach R. Deep Attractor Networks for Speaker Re-Identifikation and Blind Source Separation. In: ICASSP 2018, Calgary, Canada. ; 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 12900 | OA
Drude L, Higuchi, Takuya , Kinoshita K, Nakatani T, Haeb-Umbach R. Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation. In: ICASSP 2018, Calgary, Canada. ; 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 12901 | OA
Boeddeker C, Erdogan H, Yoshioka T, Haeb-Umbach R. Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition. In: ICASSP 2018, Calgary, Canada. ; 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 29923 | OA
Watanabe S, Hori T, Karita S, et al. ESPnet: End-to-End Speech Processing Toolkit. In: INTERSPEECH 2018, Hyderabad, India. ; 2018:2207–2211. doi:10.21437/Interspeech.2018-1456
LibreCat | Files available | DOI
 

2018 | Conference Paper | LibreCat-ID: 12899 | OA
Boeddeker C, Heitkaemper J, Schmalenstroeer J, Drude L, Heymann J, Haeb-Umbach R. Front-End Processing for the CHiME-5 Dinner Party Scenario. In: Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India. ; 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 6859
Afifi H, Schmalenstroeer J, Ullmann J, Haeb-Umbach R, Karl H. MARVELO - A Framework for Signal Processing in Wireless Acoustic Sensor Networks. In: Speech Communication; 13th ITG-Symposium. ; 2018:1-5.
LibreCat
 

2018 | Conference Paper | LibreCat-ID: 11747 | OA
Grimm C, Breddermann T, Farhoud R, Fei T, Warsitz E, Haeb-Umbach R. Discrimination of Stationary from Moving Targets with Recurrent Neural Networks in Automotive Radar. In: International Conference on Microwaves for Intelligent Mobility (ICMIM) 2018. ; 2018.
LibreCat | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 11907 | OA
Glarner T, Hanebrink P, Ebbers J, Haeb-Umbach R. Full Bayesian Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. In: INTERSPEECH 2018, Hyderabad, India. ; 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 11838 | OA
Schmalenstroeer J, Haeb-Umbach R. Efficient Sampling Rate Offset Compensation - An Overlap-Save Based Approach. In: 26th European Signal Processing Conference (EUSIPCO 2018). ; 2018.
LibreCat | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 11876 | OA
Kitza M, Michel W, Boeddeker C, et al. The RWTH/UPB System Combination for the CHiME 2018 Workshop. In: Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India. ; 2018.
LibreCat | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 11836 | OA
Ebbers J, Heitkaemper J, Schmalenstroeer J, Haeb-Umbach R. Benchmarking Neural Network Architectures for Acoustic Sensor Networks. In: ITG 2018, Oldenburg, Germany. ; 2018.
LibreCat | Files available | Download (ext.)
 

2018 | Conference Paper | LibreCat-ID: 11839 | OA
Schmalenstroeer J, Haeb-Umbach R. Insights into the Interplay of Sampling Rate Offsets and MVDR Beamforming. In: ITG 2018, Oldenburg, Germany. ; 2018.
LibreCat | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11717 | OA
Arora P, Haeb-Umbach R. A Study on Transfer Learning for Acoustic Event Detection in a Real Life Scenario. In: IEEE 19th International Workshop on Multimedia Signal Processing (MMSP). ; 2017.
LibreCat | Files available | Download (ext.)
 

2017 | Report | LibreCat-ID: 11735 | OA
Boeddeker C, Hanebrink P, Drude L, Heymann J, Haeb-Umbach R. On the Computation of Complex-Valued Gradients with Application to Statistically Optimum Beamforming.; 2017.
LibreCat | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11736 | OA
Boeddeker C, Hanebrink P, Drude L, Heymann J, Haeb-Umbach R. Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation. In: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP). ; 2017.
LibreCat | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11737 | OA
Chinaev A, Haeb-Umbach R. A Generalized Log-Spectral Amplitude Estimator for Single-Channel Speech Enhancement. In: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP). ; 2017.
LibreCat | Files available | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11754 | OA
Drude L, Haeb-Umbach R. Tight integration of spatial and spectral features for BSS with Deep Clustering embeddings. In: INTERSPEECH 2017, Stockholm, Schweden. ; 2017.
LibreCat | Files available | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11770 | OA
Glarner T, Boenninghoff B, Walter O, Haeb-Umbach R. Leveraging Text Data for Word Segmentation for Underresourced Languages. In: INTERSPEECH 2017, Stockholm, Schweden. ; 2017.
LibreCat | Files available | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11809 | OA
Heymann J, Drude L, Boeddeker C, Hanebrink P, Haeb-Umbach R. BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System. In: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP). ; 2017.
LibreCat | Files available | Download (ext.)
 

2017 | Journal Article | LibreCat-ID: 11811 | OA
Heymann J, Drude L, Haeb-Umbach R. A Generic Neural Acoustic Beamforming Architecture for Robust Multi-Channel Speech Processing. Computer Speech and Language. 2017.
LibreCat | Download (ext.)
 

2017 | Patent | LibreCat-ID: 12081
Jacob F, Schmalenstroeer J. Building or Enclosure Termination Closing and/or Opening Apparatus, and Method for Operating a Building or Enclosure Termination. 2017.
LibreCat
 

2017 | Conference Paper | LibreCat-ID: 11763 | OA
Fei T, Grimm C, Farhoud R, Breddermann T, Warsitz E, Haeb-Umbach R. A Novel Target Separation Algorithm Applied to The Two-Dimensional Spectrum for FMCW Automotive Radar Systems. In: IEEE International Conference on Microwave, Communications, Anthenas and Electronic Systems. ; 2017.
LibreCat | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11772 | OA
Grimm C, Breddermann T, Farhoud R, Fei T, Warsitz E, Haeb-Umbach R. Hypothesis Test for the Detection of Moving Targets in Automotive Radar. In: IEEE International Conference on Microwave, Communications, Anthenas and Electronic Systems (COMCAS). ; 2017.
LibreCat | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11759 | OA
Ebbers J, Heymann J, Drude L, Glarner T, Haeb-Umbach R, Raj B. Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. In: INTERSPEECH 2017, Stockholm, Schweden. ; 2017.
LibreCat | Files available | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11895 | OA
Schmalenstroeer J, Heymann J, Drude L, Boeddeker C, Haeb-Umbach R. Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming. In: IEEE 19th International Workshop on Multimedia Signal Processing (MMSP). ; 2017.
LibreCat | Files available | Download (ext.)
 

2017 | Conference Paper | LibreCat-ID: 11773 | OA
Grimm C, Farhoud R, Fei T, Warsitz E, Haeb-Umbach R. Detection of Moving Targets in Automotive Radar with Distorted Ego-Velocity Information. In: IEEE Microwaves, Radar and Remote Sensing Symposium (MRRS). ; 2017.
LibreCat | Download (ext.)
 

2016 | Conference Paper | LibreCat-ID: 11738 | OA
Chinaev A, Haeb-Umbach R. A Priori SNR Estimation Using a Generalized Decision Directed Approach. In: INTERSPEECH 2016, San Francisco, USA. ; 2016.
LibreCat | Files available | Download (ext.)
 

2016 | Conference Paper | LibreCat-ID: 11743 | OA
Chinaev A, Heitkaemper J, Haeb-Umbach R. A Priori SNR Estimation Using Weibull Mixture Model. In: 12. ITG Fachtagung Sprachkommunikation (ITG 2016). ; 2016.
LibreCat | Files available | Download (ext.)
 

2016 | Conference Paper | LibreCat-ID: 11744 | OA
Chinaev A, Heymann J, Drude L, Haeb-Umbach R. Noise-Presence-Probability-Based Noise PSD Estimation by Using DNNs. In: 12. ITG Fachtagung Sprachkommunikation (ITG 2016). ; 2016.
LibreCat | Files available | Download (ext.)
 

2016 | Conference Paper | LibreCat-ID: 11751 | OA
Drude L, Boeddeker C, Haeb-Umbach R. Blind Speech Separation based on Complex Spherical k-Mode Clustering. In: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP). ; 2016.
LibreCat | Files available | Download (ext.)
 

2016 | Conference Paper | LibreCat-ID: 11756 | OA
Drude L, Raj B, Haeb-Umbach R. On the appropriateness of complex-valued neural networks for speech enhancement. In: INTERSPEECH 2016, San Francisco, USA. ; 2016.
LibreCat | Files available | Download (ext.)
 

2016 | Conference Paper | LibreCat-ID: 11771 | OA
Glarner T, Mahdi Momenzadeh M, Drude L, Haeb-Umbach R. Factor Graph Decoding for Speech Presence Probability Estimation. In: 12. ITG Fachtagung Sprachkommunikation (ITG 2016). ; 2016.
LibreCat | Files available | Download (ext.)
 

2016 | Conference Paper | LibreCat-ID: 11812 | OA
Heymann J, Drude L, Haeb-Umbach R. Neural Network Based Spectral Mask Estimation for Acoustic Beamforming. In: Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP). ; 2016.
LibreCat | Files available | Download (ext.)
 

2016 | Conference Paper | LibreCat-ID: 11829 | OA
Jacob F, Haeb-Umbach R. On the Bias of Direction of Arrival Estimation Using Linear Microphone Arrays. In: 12. ITG Fachtagung Sprachkommunikation (ITG 2016). ; 2016.
LibreCat | Files available | Download (ext.)
 

2016 | Conference Paper | LibreCat-ID: 11834 | OA
Heymann J, Drude L, Haeb-Umbach R. Wide Residual BLSTM Network with Discriminative Speaker Adaptation for Robust Speech Recognition. In: Computer Speech and Language. ; 2016.
LibreCat | Files available | Download (ext.)
 

2016 | Journal Article | LibreCat-ID: 11840 | OA
Kinoshita K, Delcroix M, Gannot S, et al. A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research. EURASIP Journal on Advances in Signal Processing. 2016.
LibreCat | Download (ext.)
 

2016 | Journal Article | LibreCat-ID: 11886
Plinge A, Jacob F, Haeb-Umbach R, Fink GA. Acoustic Microphone Geometry Calibration: An overview and experimental evaluation of state-of-the-art algorithms. IEEE Signal Processing Magazine. 2016;33(4):14-29. doi:10.1109/MSP.2016.2555198
LibreCat | DOI
 

2016 | Conference Paper | LibreCat-ID: 11908 | OA
Menne T, Heymann J, Alexandridis A, et al. The RWTH/UPB/FORTH System Combination for the 4th CHiME Challenge Evaluation. In: Computer Speech and Language. ; 2016.
LibreCat | Download (ext.)
 

2016 | Conference Paper | LibreCat-ID: 11920 | OA
Walter O, Haeb-Umbach R. Unsupervised Word Discovery from Speech using Bayesian Hierarchical Models. In: 38th German Conference on Pattern Recognition (GCPR 2016). ; 2016.
LibreCat | Files available | Download (ext.)
 

2016 | Conference Paper | LibreCat-ID: 11890 | OA
Schmalenstroeer J, Haeb-Umbach R. Investigations into Bluetooth Low Energy Localization Precision Limits. In: 24th European Signal Processing Conference (EUSIPCO 2016). ; 2016.
LibreCat | Files available | Download (ext.)
 

2015 | Conference Paper | LibreCat-ID: 11739 | OA
Chinaev A, Haeb-Umbach R. On Optimal Smoothing in Minimum Statistics Based Noise Tracking. In: Interspeech 2015. ; 2015:1785-1789.
LibreCat | Files available | Download (ext.)
 

2015 | Conference Paper | LibreCat-ID: 11748 | OA
Despotovic V, Walter O, Haeb-Umbach R. Semantic Analysis of Spoken Input using Markov Logic Networks. In: INTERSPEECH 2015. ; 2015.
LibreCat | Files available | Download (ext.)
 

2015 | Conference Paper | LibreCat-ID: 11755 | OA
Drude L, Jacob F, Haeb-Umbach R. DOA-Estimation based on a Complex Watson Kernel Method. In: 23th European Signal Processing Conference (EUSIPCO 2015). ; 2015.
LibreCat | Files available | Download (ext.)
 

2015 | Conference Paper | LibreCat-ID: 11810
Heymann J, Drude L, Chinaev A, Haeb-Umbach R. BLSTM supported GEV Beamformer Front-End for the 3RD CHiME Challenge. In: Automatic Speech Recognition and Understanding Workshop (ASRU 2015). ; 2015.
LibreCat
 

2015 | Conference Paper | LibreCat-ID: 11813 | OA
Heymann J, Haeb-Umbach R, Golik P, Schlueter R. Unsupervised adaptation of a denoising autoencoder by Bayesian Feature Enhancement for reverberant asr under mismatch conditions. In: Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference On. ; 2015:5053-5057. doi:10.1109/ICASSP.2015.7178933
LibreCat | DOI | Download (ext.)
 

2015 | Journal Article | LibreCat-ID: 11830 | OA
Jacob F, Haeb-Umbach R. Absolute Geometry Calibration of Distributed Microphone Arrays in an Audio-Visual Sensor Network. ArXiv e-prints. 2015.
LibreCat | Download (ext.)
 

2015 | Book | LibreCat-ID: 11868 | OA
Li J, Deng L, Haeb-Umbach R, Gong Y. Robust Automatic Speech Recognition. Elsevier; 2015.
LibreCat | Files available | Download (ext.)
 

2015 | Conference Paper | LibreCat-ID: 11875 | OA
Marchi E, Schuller B, Baron-Cohen S, et al. Typicality and Emotion in the Voice of Children with Autism Spectrum Condition: Evidence Across Three Languages. In: INTERSPEECH 2015. ; 2015.
LibreCat | Download (ext.)
 

2015 | Conference Paper | LibreCat-ID: 11919 | OA
Walter O, Drude L, Haeb-Umbach R. Source Counting in Speech Mixtures by Nonparametric Bayesian Estimation of an infinite Gaussian Mixture Model. In: 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2015). ; 2015.
LibreCat | Files available | Download (ext.)
 

2015 | Journal Article | LibreCat-ID: 11922 | OA
Walter O, Haeb-Umbach R, Mokbel B, Paassen B, Hammer B. Autonomous Learning of Representations. KI - Kuenstliche Intelligenz. 2015:1-13. doi:http://dx.doi.org/10.1007/s13218-015-0372-1
LibreCat | DOI | Download (ext.)
 

2015 | Report | LibreCat-ID: 11923 | OA
Walter O, Haeb-Umbach R, Strunk J, P. Himmelmann N. Lexicon Discovery for Language Preservation Using Unsupervised Word Segmentation with Pitman-Yor Language Models (FGNT-2015-01).; 2015.
LibreCat | Download (ext.)
 

2015 | Conference Paper | LibreCat-ID: 11874 | OA
Hoang MK, Schmalenstroeer J, Haeb-Umbach R. Aligning training models with smartphone properties in WiFi fingerprinting based indoor localization. In: 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2015). ; 2015.
LibreCat | Download (ext.)
 

2014 | Conference Paper | LibreCat-ID: 11746 | OA
Chinaev A, Puels M, Haeb-Umbach R. Spectral Noise Tracking for Improved Nonstationary Noise Robust ASR. In: 11. ITG Fachtagung Sprachkommunikation (ITG 2014). ; 2014.
LibreCat | Files available | Download (ext.)
 

2014 | Conference Paper | LibreCat-ID: 11752 | OA
Drude L, Chinaev A, Tran Vu DH, Haeb-Umbach R. Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models. In: 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014). ; 2014.
LibreCat | Files available | Download (ext.)
 

2014 | Conference Paper | LibreCat-ID: 11753 | OA
Drude L, Chinaev A, Tran Vu DH, Haeb-Umbach R. Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models. In: 14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014). ; 2014:213-217.
LibreCat | Files available | Download (ext.)
 

2014 | Conference Paper | LibreCat-ID: 11814 | OA
Heymann J, Walter O, Haeb-Umbach R, Raj B. Iterative Bayesian Word Segmentation for Unspuervised Vocabulary Discovery from Phoneme Lattices. In: 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014). ; 2014.
LibreCat | Files available | Download (ext.)
 

2014 | Conference Paper | LibreCat-ID: 11831 | OA
Jacob F, Haeb-Umbach R. Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking. In: 11. ITG Fachtagung Sprachkommunikation (ITG 2014). ; 2014.
LibreCat | Files available | Download (ext.)
 

2014 | Journal Article | LibreCat-ID: 11861
Leutnant V, Krueger A, Haeb-Umbach R. A New Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2014;22(1):95-109. doi:10.1109/TASLP.2013.2285480
LibreCat | DOI
 

2014 | Journal Article | LibreCat-ID: 11867 | OA
Li J, Deng L, Gong Y, Haeb-Umbach R. An Overview of Noise-Robust Automatic Speech Recognition. IEEE Transactions on Audio, Speech and Language Processing. 2014;22(4):745-777. doi:10.1109/TASLP.2014.2304637
LibreCat | DOI | Download (ext.)
 

2014 | Conference Paper | LibreCat-ID: 11918 | OA
Walter O, Despotovic V, Haeb-Umbach R, Gemmeke J, Ons B, Van hamme H. An Evaluation of Unsupervised Acoustic Model Training for a Dysarthric Speech Interface. In: INTERSPEECH 2014. ; 2014.
LibreCat | Files available | Download (ext.)
 

2014 | Journal Article | LibreCat-ID: 11898 | OA
Schmalenstroeer J, Jebramcik P, Haeb-Umbach R. A combined hardware-software approach for acoustic sensor network synchronization . Signal Processing. 2014;(0). doi:http://dx.doi.org/10.1016/j.sigpro.2014.06.030
LibreCat | DOI | Download (ext.)
 

2014 | Conference Paper | LibreCat-ID: 11897 | OA
Schmalenstroeer J, Jebramcik P, Haeb-Umbach R. A Gossiping Approach to Sampling Clock Synchronization in Wireless Acoustic Sensor Networks. In: 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014). ; 2014.
LibreCat | Files available | Download (ext.)
 

2014 | Conference Paper | LibreCat-ID: 11903 | OA
Schmalenstroeer J, Zhao W, Haeb-Umbach R. Online Observation Error Model Estimation for Acoustic Sensor Network Synchronization. In: 11. ITG Fachtagung Sprachkommunikation (ITG 2014). ; 2014.
LibreCat | Files available | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11716
Abdelaziz AH, Zeiler S, Kolossa D, Leutnant V, Haeb-Umbach R. GMM-based significance decoding. In: Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. ; 2013:6827-6831. doi:10.1109/ICASSP.2013.6638984
LibreCat | DOI
 

2013 | Conference Paper | LibreCat-ID: 11740 | OA
Chinaev A, Haeb-Umbach R. MAP-based Estimation of the Parameters of a Gaussian Mixture Model in the Presence of Noisy Observations. In: 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013). ; 2013:3352-3356. doi:10.1109/ICASSP.2013.6638279
LibreCat | Files available | DOI | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11742 | OA
Chinaev A, Haeb-Umbach R, Taghia J, Martin R. Improved Single-Channel Nonstationary Noise Tracking by an Optimized MAP-based Postprocessor. In: 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013). ; 2013:7477-7481. doi:10.1109/ICASSP.2013.6639116
LibreCat | Files available | DOI | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11762 | OA
Enzner G, Schmid D, Haeb-Umbach R. On the Acoustic Channel Identification in Multi-Microphone Systems via Adaptive Blind Signal Enhancement Techniques. In: 21th European Signal Processing Conference (EUSIPCO 2013). ; 2013.
LibreCat | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11815 | OA
Heymann J, Walter O, Haeb-Umbach R, Raj B. Unsupervised Word Segmentation from Noisy Input. In: Automatic Speech Recognition and Understanding Workshop (ASRU 2013). ; 2013.
LibreCat | Files available | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11816 | OA
Hoang MK, Haeb-Umbach R. Parameter estimation and classification of censored Gaussian data with application to WiFi indoor positioning. In: 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013). ; 2013:3721-3725. doi:10.1109/ICASSP.2013.6638353
LibreCat | Files available | DOI | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11841 | OA
Kinoshita K, Delcroix M, Yoshioka T, et al. The reverb challenge: a common evaluation framework for dereverberation and recognition of reverberant speech. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics . ; 2013:22-23.
LibreCat | Download (ext.)
 

2013 | Journal Article | LibreCat-ID: 11862
Leutnant V, Krueger A, Haeb-Umbach R. Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing. 2013;21(8):1640-1652. doi:10.1109/TASL.2013.2258013
LibreCat | DOI
 

2013 | Conference Paper | LibreCat-ID: 11909 | OA
Tran Vu DH, Haeb-Umbach R. Blind Speech Separation Exploiting Temporal and Spectral Correlations Using Turbo Decoding of 2D-HMMs. In: 21th European Signal Processing Conference (EUSIPCO 2013). ; 2013.
LibreCat | Files available | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11917
Vu DHT, Haeb-Umbach R. Using the turbo principle for exploiting temporal and spectral correlations in speech presence probability estimation. In: 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013). ; 2013:863-867. doi:10.1109/ICASSP.2013.6637771
LibreCat | DOI
 

2013 | Conference Paper | LibreCat-ID: 11921 | OA
Walter O, Haeb-Umbach R, Chaudhuri S, Raj B. Unsupervised Word Discovery from Phonetic Input Using Nested Pitman-Yor Language Modeling. In: IEEE International Conference on Robotics and Automation (ICRA 2013). ; 2013.
LibreCat | Files available | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11924 | OA
Walter O, Korthals T, Haeb-Umbach R, Raj B. Hierarchical System for Word Discovery Exploiting DTW-Based Initialization. In: Automatic Speech Recognition and Understanding Workshop (ASRU 2013). ; 2013.
LibreCat | Files available | Download (ext.)
 

2013 | Report | LibreCat-ID: 11926 | OA
Walter O, Schmalenstroeer J, Haeb-Umbach R. A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01).; 2013.
LibreCat | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11832 | OA
Jacob F, Schmalenstroeer J, Haeb-Umbach R. DoA-Based Microphone Array Position Self-Calibration Using Circular Statistic. In: 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013). ; 2013:116-120. doi:10.1109/ICASSP.2013.6637620
LibreCat | Files available | DOI | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11891 | OA
Schmalenstroeer J, Haeb-Umbach R. Sampling Rate Synchronisation in Acoustic Sensor Networks with a Pre-Trained Clock Skew Error Model. In: 21th European Signal Processing Conference (EUSIPCO 2013). ; 2013.
LibreCat | Files available | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11818 | OA
Hoang MK, Schmitz S, Drueke C, Vu DHT, Schmalenstroeer J, Haeb-Umbach R. Server based indoor navigation using RSSI and inertial sensor information. In: Positioning Navigation and Communication (WPNC), 2013 10th Workshop On. ; 2013:1-6. doi:10.1109/WPNC.2013.6533263
LibreCat | Files available | DOI | Download (ext.)
 

2013 | Conference Paper | LibreCat-ID: 11817 | OA
Hoang MK, Schmalenstroeer J, Drueke C, Tran Vu DH, Haeb-Umbach R. A Hidden Markov Model for Indoor User Tracking Based on WiFi Fingerprinting and Step Detection. In: 21th European Signal Processing Conference (EUSIPCO 2013). ; 2013.
LibreCat | Files available | Download (ext.)
 

2012 | Conference Paper | LibreCat-ID: 11741 | OA
Chinaev A, Haeb-Umbach R. Quality Analysis and Optimization of the MAP-based Noise Power Spectral Density Tracker. In: Speech Communication; 10. ITG Symposium; Proceedings. ; 2012.
LibreCat | Files available | Download (ext.)
 

2012 | Conference Paper | LibreCat-ID: 11745 | OA
Chinaev A, Krueger A, Tran Vu DH, Haeb-Umbach R. Improved Noise Power Spectral Density Tracking by a MAP-based Postprocessor. In: 37th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012). ; 2012.
LibreCat | Files available | Download (ext.)
 

2012 | Book Chapter | LibreCat-ID: 11844
Krueger A, Haeb-Umbach R. Reverberant Speech Recognition. In: Techniques for Noise Robustness in Automatic Speech Recognition. Wiley; 2012.
LibreCat
 

2012 | Conference Paper | LibreCat-ID: 11849 | OA
Krueger A, Walter O, Leutnant V, Haeb-Umbach R. Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data. In: Proc. Interspeech. Portland, USA; 2012.
LibreCat | Download (ext.)
 

2012 | Journal Article | LibreCat-ID: 11863 | OA
Leutnant V, Krueger A, Haeb-Umbach R. Investigations Into a Statistical Observation Model for Logarithmic Mel Power Spectral Density Features of Noisy Reverberant Speech. Speech Communication; 10 ITG Symposium; Proceedings of. 2012:1-4.
LibreCat | Download (ext.)
 

2012 | Conference Paper | LibreCat-ID: 11864 | OA
Leutnant V, Krueger A, Haeb-Umbach R. A Statistical Observation Model For Noisy Reverberant Speech Features and its Application to Robust ASR. In: Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference On. ; 2012.
LibreCat | Download (ext.)
 

2012 | Report | LibreCat-ID: 11865 | OA
Leutnant V, Krueger A, Haeb-Umbach R. Derivation of the Power Compensation Constant in the Observation Model for Reverberant Speech in the Logarithmic Mel Power Spectral Domain.; 2012.
LibreCat | Download (ext.)
 

2012 | Conference Paper | LibreCat-ID: 11910
Tran Vu DH, Haeb-Umbach R. Exploiting Temporal Correlations in Joint Multichannel Speech Separation and Noise Suppression using Hidden Markov Models. In: International Workshop on Acoustic Signal Enhancement (IWAENC2012). ; 2012.
LibreCat
 

2012 | Conference Paper | LibreCat-ID: 11833 | OA
Jacob F, Schmalenstroeer J, Haeb-Umbach R. Microphone Array Position Self-Calibration from Reverberant Speech Input. In: International Workshop on Acoustic Signal Enhancement (IWAENC 2012). ; 2012.
LibreCat | Files available | Download (ext.)
 

2012 | Conference Paper | LibreCat-ID: 11925 | OA
Walter O, Schmalenstroeer J, Engler A, Haeb-Umbach R. Smartphone-Based Sensor Fusion for Improved Vehicular Navigation. In: 9th Workshop on Positioning Navigation and Communication (WPNC 2012). ; 2012.
LibreCat | Download (ext.)
 

2011 | Conference Paper | LibreCat-ID: 11721 | OA
Bevermeier M, Flanke S, Haeb-Umbach R, Stehr J. A Platform for efficient Supply Chain Management Support in Logistics. In: International Workshop on Intelligent Transportation (WIT 2011). ; 2011.
LibreCat | Download (ext.)
 

2011 | Book Chapter | LibreCat-ID: 11774
Haeb-Umbach R. Uncertainty Decoding and Conditional Bayesian Estimation. In: Haeb-Umbach R, Kolossa D, eds. Robust Speech Recognition of Uncertain or Missing Data. Springer; 2011.
LibreCat
 

2011 | Book Chapter | LibreCat-ID: 11775
Haeb-Umbach R. Können Computer sprechen und hören, sollen sie es überhaupt können? Sprachverarbeitung und ambiente Intelligenz. In: Baustelle Informationsgesellschaft Und Universität Heute. Ferdinand Schoeningh Verlag, Paderborn; 2011.
LibreCat
 

2011 | Journal Article | LibreCat-ID: 11807
Herbig T, Gerl F, Minker W, Haeb-Umbach R. Adaptive Systems for Unsupervised Speaker Tracking and Speech Recognition. Evolving Systems. 2011;2(3):199-214.
LibreCat
 

2011 | Book Chapter | LibreCat-ID: 11843
Krueger A, Haeb-Umbach R. A Model-Based Approach to Joint Compensation of Noise and Reverberation for Speech Recognition. In: Haeb-Umbach R, Kolossa D, eds. Robust Speech Recognition of Uncertain or Missing Data. Springer; 2011.
LibreCat
 

2011 | Conference Paper | LibreCat-ID: 11845 | OA
Krueger A, Haeb-Umbach R. MAP-based estimation of the parameters of non-stationary Gaussian processes from noisy observations. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2011). ; 2011:3596-3599. doi:10.1109/ICASSP.2011.5946256
LibreCat | DOI | Download (ext.)
 

2011 | Journal Article | LibreCat-ID: 11850 | OA
Krueger A, Warsitz E, Haeb-Umbach R. Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation. IEEE Transactions on Audio, Speech, and Language Processing. 2011;19(1):206-219. doi:10.1109/TASL.2010.2047324
LibreCat | DOI | Download (ext.)
 

2011 | Book Chapter | LibreCat-ID: 11856
Leutnant V, Haeb-Umbach R. Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition. In: Haeb-Umbach R, Kolossa D, eds. Robust Speech Recognition of Uncertain or Missing Data. Springer; 2011.
LibreCat
 

2011 | Conference Paper | LibreCat-ID: 11866 | OA
Leutnant V, Krueger A, Haeb-Umbach R. A versatile Gaussian splitting approach to non-linear state estimation and its application to noise-robust ASR. In: Interspeech 2011. ; 2011.
LibreCat | Download (ext.)
 

2011 | Conference Paper | LibreCat-ID: 11911 | OA
Tran Vu DH, Haeb-Umbach R. On Initial Seed Selection for Frequency Domain Blind Speech Separation. In: Interspeech 2011. ; 2011.
LibreCat | Download (ext.)
 

2011 | Book (Editor) | LibreCat-ID: 11945 | OA
Kolossa D, Haeb-Umbach R, eds. Robust Speech Recognition of Uncertain or Missing Data --- Theory and Applications. Springer; 2011.
LibreCat | Download (ext.)
 

2011 | Conference Paper | LibreCat-ID: 11889 | OA
Schmalenstroeer J, Bartek M, Haeb-Umbach R. Unsupervised learning of acoustic events using dynamic time warping and hierarchical K-means++ clustering. In: Interspeech 2011. ; 2011.
LibreCat | Download (ext.)
 

2011 | Conference Paper | LibreCat-ID: 11896 | OA
Schmalenstroeer J, Jacob F, Haeb-Umbach R, Hennecke M, Fink GA. Unsupervised Geometry Calibration of Acoustic Sensor Networks Using Source Correspondences. In: Interspeech 2011. ; 2011.
LibreCat | Download (ext.)
 

2011 | Conference Paper | LibreCat-ID: 9456 | OA
Schmalenstroeer J, Bartek M, Haeb-Umbach R. Investigations into Features for Robust Classification into Broad Acoustic Categories. In: 37. Deutsche Jahrestagung Fuer Akustik (DAGA 2011). ; 2011.
LibreCat | Download (ext.)
 

2010 | Conference Paper | LibreCat-ID: 11726 | OA
Bevermeier M, Walter O, Peschke S, Haeb-Umbach R. Barometric height estimation combined with map-matching in a loosely-coupled Kalman-filter. In: 7th Workshop on Positioning Navigation and Communication (WPNC 2010). ; 2010:128-134. doi:10.1109/WPNC.2010.5650745
LibreCat | DOI | Download (ext.)
 

2010 | Journal Article | LibreCat-ID: 11846 | OA
Krueger A, Haeb-Umbach R. Model-Based Feature Enhancement for Reverberant Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing. 2010;18(7):1692-1707. doi:10.1109/TASL.2010.2049684
LibreCat | DOI | Download (ext.)
 

2010 | Conference Paper | LibreCat-ID: 11857 | OA
Leutnant V, Haeb-Umbach R. Options for Modelling Temporal Statistical Dependencies in an Acoustic Model for ASR. In: 36. Deutsche Jahrestagung Fuer Akustik (DAGA 2010). ; 2010.
LibreCat | Download (ext.)
 

2010 | Conference Paper | LibreCat-ID: 11858 | OA
Leutnant V, Haeb-Umbach R. On the Exploitation of Hidden Markov Models and Linear Dynamic Models in a Hybrid Decoder Architecture for Continuous Speech Recognition. In: Interspeech 2010. ; 2010.
LibreCat | Download (ext.)
 

2010 | Conference Paper | LibreCat-ID: 11887 | OA
Raj B, Wilson KW, Krueger A, Haeb-Umbach R. Ungrounded Independent Non-Negative Factor Analysis. In: Interspeech 2010. ; 2010.
LibreCat | Download (ext.)
 

2010 | Conference Paper | LibreCat-ID: 11912 | OA
Tran Vu DH, Haeb-Umbach R. An EM Approach to Integrated Multichannel Speech Separation and Noise Suppression. In: International Workshop on Acoustic Echo and Noise Control (IWAENC 2010). ; 2010.
LibreCat | Download (ext.)
 

2010 | Conference Paper | LibreCat-ID: 11913 | OA
Tran Vu DH, Haeb-Umbach R. Blind speech separation employing directional statistics in an Expectation Maximization framework. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010). ; 2010:241-244. doi:10.1109/ICASSP.2010.5495994
LibreCat | DOI | Download (ext.)
 

2010 | Journal Article | LibreCat-ID: 11892 | OA
Schmalenstroeer J, Haeb-Umbach R. Online Diarization of Streaming Audio-Visual Data for Smart Environments. IEEE Journal of Selected Topics in Signal Processing. 2010;4(5):845-856. doi:10.1109/JSTSP.2010.2050519
LibreCat | DOI | Download (ext.)
 

Filters and Search Terms

(department=54)

status=public

Search

Filter Publications

Display / Sort

Sorted by: Publishing Year
Citation Style: AMA

Export / Embed