318 Publications
2024 | Conference Paper | LibreCat-ID: 57031 |

Gburrek, T., Meise, A., Schmalenstroeer, J., & Haeb-Umbach, R. (2024). Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models. 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC). https://doi.org/10.1109/iwaenc61483.2024.10694103
LibreCat
| Files available
| DOI
2024 | Report | LibreCat-ID: 57161
Werning, A., & Haeb-Umbach, R. (2024). UPB-NT submission to DCASE24: Dataset pruning for targeted knowledge distillation.
LibreCat
2024 | Conference Paper | LibreCat-ID: 57160
Werning, A., & Haeb-Umbach, R. (2024). Target-Specific Dataset Pruning for Compression of Audio Tagging Models. 32nd European Signal Processing Conference (EUSIPCO 2024). 32nd European Signal Processing Conference, Lyon.
LibreCat
| Files available
2024 | Conference Paper | LibreCat-ID: 57099
Xie, Y., Kuhlmann, M., Rautenberg, F., Tan, Z.-H., & Häb-Umbach, R. (2024). Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder. 2024 32nd European Signal Processing Conference (EUSIPCO), 436–440.
LibreCat
2024 | Conference Paper | LibreCat-ID: 56004 |

von Neumann, T., Boeddeker, C., Cord-Landwehr, T., Delcroix, M., & Haeb-Umbach, R. (2024). Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). https://doi.org/10.1109/icasspw62465.2024.10625894
LibreCat
| Files available
| DOI
2024 | Journal Article | LibreCat-ID: 52958 |

Boeddeker, C., Subramanian, A. S., Wichern, G., Haeb-Umbach, R., & Le Roux, J. (2024). TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 32, 1185–1197. https://doi.org/10.1109/taslp.2024.3350887
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 53659
Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2024). Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul. https://doi.org/10.1109/icassp48485.2024.10445911
LibreCat
| DOI
2024 | Preprint | LibreCat-ID: 57085 |

Cord-Landwehr, T., Boeddeker, C., & Haeb-Umbach, R. (2024). Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models.
LibreCat
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 56272 |

Boeddeker, C., Cord-Landwehr, T., & Haeb-Umbach, R. (2024). Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment. Interspeech 2024. https://doi.org/10.21437/interspeech.2024-1286
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting, P., Berger, S., von Neumann, T., Boeddeker, C., Schlüter, R., & Haeb-Umbach, R. (2024). Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription. 2024 IEEE Spoken Language Technology Workshop (SLT).
LibreCat
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48269 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2023). On the Integration of Sampling Rate Synchronization and Acoustic Beamforming. European Signal Processing Conference (EUSIPCO). European Signal Processing Conference (EUSIPCO), Helsinki.
LibreCat
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48270 |

Schmalenstroeer, J., Gburrek, T., & Haeb-Umbach, R. (2023). LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Aachen.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 48355 |

Rautenberg, F., Kuhlmann, M., Wiechmann, J., Seebauer, F., Wagner, P., & Haeb-Umbach, R. (2023). On Feature Importance and Interpretability of Speaker Representations. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Aachen.
LibreCat
| Files available
| Download (ext.)
| arXiv
2023 | Conference Paper | LibreCat-ID: 48410 |

Wiechmann, J., Rautenberg, F., Wagner, P., & Haeb-Umbach, R. (2023). Explaining voice characteristics to novice voice practitioners-How successful is it? 20th International Congress of the Phonetic Sciences (ICPhS) .
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 46069
Seebauer, F., Kuhlmann, M., Haeb-Umbach, R., & Wagner, P. (2023). Re-examining the quality dimensions of synthetic speech. 12th Speech Synthesis Workshop (SSW) 2023.
LibreCat
2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 49109 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2023). Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks. Proc. Asilomar Conference on Signals, Systems, and Computers. 57th Asilomar Conference on Signals, Systems, and Computers.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 44849 |

Rautenberg, F., Kuhlmann, M., Ebbers, J., Wiechmann, J., Seebauer, F., Wagner, P., & Haeb-Umbach, R. (2023). Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics. Fortschritte Der Akustik - DAGA 2023, 1409–1412.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 49111
Ebbers, J., Haeb-Umbach, R., & Serizel, R. (2023). Post-Processing Independent Evaluation of Sound Event Detection Systems. Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023), 36–40.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 57098
Seebauer, F., Kuhlmann, M., Häb-Umbach, R., & Wagner, P. (2023). DISCERNING DIMENSIONS OF QUALITY FOR STATE OF THE ART SYNTHETIC SPEECH. Proceedings of the 20th International Congress of Phonetic Sciences. International Congress of Phonetic Sciences (ICPhS), Prague.
LibreCat
2023 | Conference Paper | LibreCat-ID: 57086
Kuhlmann, M., Meise, A., Seebauer, F., Wagner, P., & Häb-Umbach, R. (2023). Investigating Speaker Embedding Disentanglement on Natural Read Speech. Speech Communication; 15th ITG Conference, 121–125.
LibreCat
2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2023). On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp49357.2023.10094784
LibreCat
| Files available
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann, T., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 47128 |

Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2023). Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes. https://doi.org/10.1109/icassp49357.2023.10095370
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 47129 |

Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2023). A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures. INTERSPEECH 2023. https://doi.org/10.21437/interspeech.2023-1379
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 54439 |

Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2023). Multi-stage diarization refinement for the CHiME-7 DASR scenario. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). https://doi.org/10.21437/chime.2023-10
LibreCat
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48390 |

Berger, S., Vieting, P., Boeddeker, C., Schlüter, R., & Haeb-Umbach, R. (2023). Mixture Encoder for Joint Speech Separation and Recognition. INTERSPEECH 2023. https://doi.org/10.21437/interspeech.2023-1815
LibreCat
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33471
Heitkämper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (n.d.). Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels. Proceedings of the 30th European Signal Processing Conference (EUSIPCO). 30th European Signal Processing Conference (EUSIPCO), Belgrad.
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, T., von Neumann, T., Boeddeker, C., & Haeb-Umbach, R. (2022). MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33807 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746284
LibreCat
| Files available
| DOI
2022 | Journal Article | LibreCat-ID: 33451 |

Grimm, C., Fei, T., Warsitz, E., Farhoud, R., Breddermann, T., & Haeb-Umbach, R. (2022). Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications. IEEE Transactions on Vehicular Technology, 71(9), 9435–9449. https://doi.org/10.1109/TVT.2022.3182411
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33696 |

Wiechmann, J., Glarner, T., Rautenberg, F., Wagner, P., & Haeb-Umbach, R. (2022). Technically enabled explaining of voice characteristics. 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P).
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33857 |

Kuhlmann, M., Seebauer, F., Ebbers, J., Wagner, P., & Haeb-Umbach, R. (2022). Investigation into Target Speaking Rate Adaptation for Voice Conversion. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10740
LibreCat
| Files available
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33808 |

Gburrek, T., Schmalenstroeer, J., Heitkaemper, J., & Haeb-Umbach, R. (2022). Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022), Bamberg, Germany . https://doi.org/10.1109/IWAENC53105.2022.9914772
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 34072 |

Ebbers, J., Haeb-Umbach, R., & Serizel, R. (2022). Threshold Independent Evaluation of Sound Event Detection Scores. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Files available
2022 | Report | LibreCat-ID: 49113
Ebbers, J., & Haeb-Umbach, R. (2022). Pre-Training And Self-Training For Sound Event Detection In Domestic Environments.
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr, T., Boeddeker, C., von Neumann, T., Zorila, C., Doddipatla, R., & Haeb-Umbach, R. (2022). Monaural source separation: From anechoic to reverberant environments. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2022). SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746757
LibreCat
| Files available
| DOI
2022 | Misc | LibreCat-ID: 33816 |

Gburrek, T., Boeddeker, C., von Neumann, T., Cord-Landwehr, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv. https://doi.org/10.48550/ARXIV.2205.00944
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2022). An Initialization Scheme for Meeting Separation with Spatial Mixture Models. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10929
LibreCat
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita, K., von Neumann, T., Delcroix, M., Boeddeker, C., & Haeb-Umbach, R. (2022). Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. Proc. Interspeech 2022, 1486–1490. https://doi.org/10.21437/Interspeech.2022-11408
LibreCat
| DOI
| Download (ext.)
2021 | Journal Article | LibreCat-ID: 21065 |

Haeb-Umbach, R., Heymann, J., Drude, L., Watanabe, S., Delcroix, M., & Nakatani, T. (2021). Far-Field Automatic Speech Recognition. Proceedings of the IEEE, 109(2), 124–148. https://doi.org/10.1109/JPROC.2020.3018668
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 28256
Zhang, W., Boeddeker, C., Watanabe, S., Nakatani, T., Delcroix, M., Kinoshita, K., Ochiai, T., Kamo, N., Haeb-Umbach, R., & Qian, Y. (2021). End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp39728.2021.9414464
LibreCat
| DOI
2021 | Conference Paper | LibreCat-ID: 24000
Heitkaemper, J., Schmalenstroeer, J., Ion, V., & Haeb-Umbach, R. (2021). A Database for Research on Detection and Enhancement of Speech Transmitted over HF links. Speech Communication; 14th ITG-Symposium, 1–5.
LibreCat
2021 | Conference Paper | LibreCat-ID: 44843 |

Boeddeker, C., Rautenberg, F., & Haeb-Umbach, R. (2021). A Comparison and Combination of Unsupervised Blind Source Separation Techniques. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Kiel.
LibreCat
| Files available
| Download (ext.)
| arXiv
2021 | Conference Paper | LibreCat-ID: 28259 |

Boeddeker, C., Zhang, W., Nakatani, T., Kinoshita, K., Ochiai, T., Delcroix, M., Kamo, N., Qian, Y., & Haeb-Umbach, R. (2021). Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp39728.2021.9414661
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 23998 |

Schmalenstroeer, J., Heitkaemper, J., Ullmann, J., & Haeb-Umbach, R. (2021). Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech. 29th European Signal Processing Conference (EUSIPCO), 1–5.
LibreCat
| Download (ext.)
2021 | Journal Article | LibreCat-ID: 22528 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2021). Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information. EURASIP Journal on Audio, Speech, and Music Processing. https://doi.org/10.1186/s13636-021-00210-x
LibreCat
| DOI
| Download (ext.)
2021 | Conference Paper | LibreCat-ID: 23994 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2021). Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp39728.2021.9413831
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 23999 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2021). On Source-Microphone Distance Estimation Using Convolutional Recurrent Neural Networks. Speech Communication; 14th ITG-Symposium, 1–5.
LibreCat
| Files available
2021 | Conference Paper | LibreCat-ID: 29304 |

Ebbers, J., Kuhlmann, M., Cord-Landwehr, T., & Haeb-Umbach, R. (2021). Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 3860–3864.
LibreCat
| Files available
2021 | Conference Paper | LibreCat-ID: 26770 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2021). Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021. Interspeech. https://doi.org/10.21437/interspeech.2021-1177
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 29173 |

von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2021). Speeding Up Permutation Invariant Training for Source Separation. Speech Communication; 14th ITG Conference. Speech Communication; 14th ITG Conference, Kiel.
LibreCat
| Files available
2021 | Conference Paper | LibreCat-ID: 29308 |

Ebbers, J., & Haeb-Umbach, R. (2021). Self-Trained Audio Tagging and Sound Event Detection in Domestic Environments. Proceedings of the 6th Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE2021), 226–230.
LibreCat
| Files available
2021 | Conference Paper | LibreCat-ID: 29306 |

Ebbers, J., Keyser, M. C., & Haeb-Umbach, R. (2021). Adapting Sound Recognition to A New Environment Via Self-Training. Proceedings of the 29th European Signal Processing Conference (EUSIPCO), 1135–1139.
LibreCat
| Files available
2021 | Journal Article | LibreCat-ID: 24456 |

Rohlfing, K. J., Cimiano, P., Scharlau, I., Matzner, T., Buhl, H. M., Buschmeier, H., Esposito, E., Grimminger, A., Hammer, B., Haeb-Umbach, R., Horwath, I., Hüllermeier, E., Kern, F., Kopp, S., Thommes, K., Ngonga Ngomo, A.-C., Schulte, C., Wachsmuth, H., Wagner, P., & Wrede, B. (2021). Explanation as a Social Practice: Toward a Conceptual Framework for the Social Design of AI Systems. IEEE Transactions on Cognitive and Developmental Systems, 13(3), 717–728. https://doi.org/10.1109/tcds.2020.3044366
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 17763 |

Haeb-Umbach, R. (2020). Sprachtechnologien für Digitale Assistenten. In R. Böck, I. Siegert, & A. Wendemuth (Eds.), Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2020 (pp. 227–234). TUDpress, Dresden.
LibreCat
| Download (ext.)
2020 | Conference Paper | LibreCat-ID: 20700 |

Boeddeker, C., Cord-Landwehr, T., Heitkaemper, J., Zorila, C., Hayakawa, D., Li, M., … Haeb-Umbach, R. (2020). Towards a speaker diarization system for the CHiME 2020 dinner party transcription. In Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments.
LibreCat
| Files available
2020 | Journal Article | LibreCat-ID: 17598 |

Nakatani, T., Boeddeker, C., Kinoshita, K., Ikeshita, R., Delcroix, M., & Haeb-Umbach, R. (2020). Jointly optimal denoising, dereverberation, and source separation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 1–1. https://doi.org/10.1109/TASLP.2020.3013118
LibreCat
| DOI
| Download (ext.)
2020 | Conference Paper | LibreCat-ID: 20504
Heitkaemper, J., Jakobeit, D., Boeddeker, C., Drude, L., & Haeb-Umbach, R. (2020). Demystifying TasNet: A Dissecting Approach. ICASSP 2020 Virtual Barcelona Spain.
LibreCat
| Files available
2020 | Conference Paper | LibreCat-ID: 20505
Heitkaemper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (2020). Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments. INTERSPEECH 2020 Virtual Shanghai China.
LibreCat
| Files available
2020 | Conference Paper | LibreCat-ID: 20762 |

von Neumann, T., Kinoshita, K., Drude, L., Boeddeker, C., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). End-to-End Training of Time Domain Audio Separation and Recognition. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 7004–7008. https://doi.org/10.1109/ICASSP40776.2020.9053461
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 20764 |

von Neumann, T., Boeddeker, C., Drude, L., Kinoshita, K., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. Proc. Interspeech 2020, 3097–3101. https://doi.org/10.21437/Interspeech.2020-2519
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 18651 |

Gburrek, T., Schmalenstroeer, J., Brendel, A., Kellermann, W., & Haeb-Umbach, R. (2020). Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Network. European Signal Processing Conference (EUSIPCO).
LibreCat
| Files available
2020 | Conference Paper | LibreCat-ID: 20766 |

Kinoshita, K., von Neumann, T., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. Proc. Interspeech 2020, 2652–2656. https://doi.org/10.21437/Interspeech.2020-2388
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 20753 |

Ebbers, J., & Haeb-Umbach, R. (2020). Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-Supervised Sound Event Detection. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2020 Workshop (DCASE2020).
LibreCat
| Files available
2020 | Conference Paper | LibreCat-ID: 20695 |

Boeddeker, C., Nakatani, T., Kinoshita, K., & Haeb-Umbach, R. (2020). Jointly Optimal Dereverberation and Beamforming. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp40776.2020.9054393
LibreCat
| Files available
| DOI
2019 | Journal Article | LibreCat-ID: 17762
Haeb-Umbach, R. (2019). Lektionen für Alexa \& Co?! Forschung, 44(1), 12–15. https://doi.org/10.1002/fors.201970104
LibreCat
| DOI
2019 | Journal Article | LibreCat-ID: 19446 |

Drude, L., Heitkaemper, J., Boeddeker, C., & Haeb-Umbach, R. (2019). SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. ArXiv E-Prints.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 11965 |

Drude, L., Heymann, J., & Haeb-Umbach, R. (2019). Unsupervised training of neural mask-based beamforming. In INTERSPEECH 2019, Graz, Austria.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 12874 |

Drude, L., Hasenklever, D., & Haeb-Umbach, R. (2019). Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation. In ICASSP 2019, Brighton, UK.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 12875 |

Heymann, J., Drude, L., Haeb-Umbach, R., Kinoshita, K., & Nakatani, T. (2019). Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR. In ICASSP 2019, Brighton, UK.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 12876 |

Kurz, G., Gilitschenski, I., Pfaff, F., Drude, L., Hanebeck, U. D., Haeb-Umbach, R., & Siegwart, R. Y. (2019). Directional Statistics and Filtering Using libDirectional. In Journal of Statistical Software 89(4).
LibreCat
| Files available
2019 | Journal Article | LibreCat-ID: 12890 |

Drude, L., & Haeb-Umbach, R. (2019). Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation. IEEE Journal of Selected Topics in Signal Processing. https://doi.org/10.1109/JSTSP.2019.2912565
LibreCat
| Files available
| DOI
2019 | Conference Paper | LibreCat-ID: 15816 |

Zorila, C., Boeddeker, C., Doddipatla, R., & Haeb-Umbach, R. (2019). An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription. In ASRU 2019, Sentosa, Singapore.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 14822 |

Heitkaemper, J., Feher, T., Freitag, M., & Haeb-Umbach, R. (2019). A Study on Online Source Extraction in the Presence of Changing Speaker Positions. In International Conference on Statistical Language and Speech Processing 2019, Ljubljana, Slovenia.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 14824 |

Martin-Donas, J. M., Heitkaemper, J., Haeb-Umbach, R., Gomez, A. M., & Peinado, A. M. (2019). Multi-Channel Block-Online Source Extraction based on Utterance Adaptation. In INTERSPEECH 2019, Graz, Austria.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 14826 |

Kanda, N., Boeddeker, C., Heitkaemper, J., Fujita, Y., Horiguchi, S., & Haeb-Umbach, R. (2019). Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. In INTERSPEECH 2019, Graz, Austria.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 13271 |

von Neumann, T., Kinoshita, K., Delcroix, M., Araki, S., Nakatani, T., & Haeb-Umbach, R. (2019). All-neural Online Source Separation, Counting, and Diarization for Meeting Analysis. In ICASSP 2019, Brighton, UK.
LibreCat
| Files available
2019 | Journal Article | LibreCat-ID: 15814 |

Haeb-Umbach, R., Watanabe, S., Nakatani, T., Bacchiani, M., Hoffmeister, B., Seltzer, M. L., Zen, H., & Souden, M. (2019). Speech Processing for Digital Home Assistance: Combining Signal Processing With Deep-Learning Techniques. IEEE Signal Processing Magazine, 36(6), 111–124. https://doi.org/10.1109/MSP.2019.2918706
LibreCat
| Files available
| DOI
2019 | Journal Article | LibreCat-ID: 19450 |

Haeb-Umbach, R. (2019). Lektionen für Alexa & Co?! DFG Forschung 1/2019, 12–15. https://doi.org/10.1002/fors.201970104
LibreCat
| Files available
| DOI
2019 | Conference Paper | LibreCat-ID: 15237 |

Gburrek, T., Glarner, T., Ebbers, J., Haeb-Umbach, R., & Wagner, P. (2019). Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion. Proc. 10th ISCA Speech Synthesis Workshop, 81–86. https://doi.org/10.21437/SSW.2019-15
LibreCat
| Files available
| DOI
| Download (ext.)
2019 | Conference Paper | LibreCat-ID: 15794 |

Ebbers, J., & Haeb-Umbach, R. (2019). Convolutional Recurrent Neural Network and Data Augmentation for Audio Tagging with Noisy Labels and Minimal Supervision. DCASE2019 Workshop, New York, USA.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 15796 |

Ebbers, J., Drude, L., Haeb-Umbach, R., Brendel, A., & Kellermann, W. (2019). Weakly Supervised Sound Activity Detection and Event Classification in Acoustic Sensor Networks. CAMSAP 2019, Guadeloupe, West Indies.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 15792 |

Nelus, A., Ebbers, J., Haeb-Umbach, R., & Martin, R. (2019). Privacy-preserving Variational Information Feature Extraction for Domestic Activity Monitoring Versus Speaker Identification. INTERSPEECH 2019, Graz, Austria.
LibreCat
| Files available
2018 | Conference Paper | LibreCat-ID: 11760 |

Ebbers, J., Nelus, A., Martin, R., & Haeb-Umbach, R. (2018). Evaluation of Modulation-MFCC Features and DNN Classification for Acoustic Event Detection. In DAGA 2018, München.
LibreCat
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11835 |

Heymann, J., Drude, L., Haeb-Umbach, R., Kinoshita, K., & Nakatani, T. (2018). Frame-Online DNN-WPE Dereverberation. In IWAENC 2018, Tokio, Japan.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11837 |

Heitkaemper, J., Heymann, J., & Haeb-Umbach, R. (2018). Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming. In ITG 2018, Oldenburg, Germany.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11872 |

Drude, L., Boeddeker, C., Heymann, J., Kinoshita, K., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2018). Integration neural network based beamforming and weighted prediction error dereverberation. In INTERSPEECH 2018, Hyderabad, India.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11873 |

Drude, L., Heymann, J., Boeddeker, C., & Haeb-Umbach, R. (2018). NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing. In ITG 2018, Oldenburg, Germany.
LibreCat
| Files available
| Download (ext.)
2018 | Journal Article | LibreCat-ID: 11916 |

Despotovic, V., Walter, O., & Haeb-Umbach, R. (2018). Machine learning techniques for semantic analysis of dysarthric speech: An experimental study. Speech Communication 99 (2018) 242-251 (Elsevier B.V.).
LibreCat
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 12898 |

Drude, L., von Neumann, T., & Haeb-Umbach, R. (2018). Deep Attractor Networks for Speaker Re-Identifikation and Blind Source Separation. In ICASSP 2018, Calgary, Canada.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 12900 |

Drude, L., Higuchi, Takuya , Kinoshita, K., Nakatani, T., & Haeb-Umbach, R. (2018). Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation. In ICASSP 2018, Calgary, Canada.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 12901 |

Boeddeker, C., Erdogan, H., Yoshioka, T., & Haeb-Umbach, R. (2018). Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition. In ICASSP 2018, Calgary, Canada.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 12899 |

Boeddeker, C., Heitkaemper, J., Schmalenstroeer, J., Drude, L., Heymann, J., & Haeb-Umbach, R. (2018). Front-End Processing for the CHiME-5 Dinner Party Scenario. Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 6859
Afifi, H., Schmalenstroeer, J., Ullmann, J., Haeb-Umbach, R., & Karl, H. (2018). MARVELO - A Framework for Signal Processing in Wireless Acoustic Sensor Networks. Speech Communication; 13th ITG-Symposium, 1–5.
LibreCat
2018 | Conference Paper | LibreCat-ID: 11747 |

Grimm, C., Breddermann, T., Farhoud, R., Fei, T., Warsitz, E., & Haeb-Umbach, R. (2018). Discrimination of Stationary from Moving Targets with Recurrent Neural Networks in Automotive Radar. International Conference on Microwaves for Intelligent Mobility (ICMIM) 2018.
LibreCat
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11907 |

Glarner, T., Hanebrink, P., Ebbers, J., & Haeb-Umbach, R. (2018). Full Bayesian Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. INTERSPEECH 2018, Hyderabad, India.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11838 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2018). Efficient Sampling Rate Offset Compensation - An Overlap-Save Based Approach. 26th European Signal Processing Conference (EUSIPCO 2018).
LibreCat
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11876 |

Kitza, M., Michel, W., Boeddeker, C., Heitkaemper, J., Menne, T., Schlüter, R., Ney, H., Schmalenstroeer, J., Drude, L., Heymann, J., & Haeb-Umbach, R. (2018). The RWTH/UPB System Combination for the CHiME 2018 Workshop. Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India.
LibreCat
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11836 |

Ebbers, J., Heitkaemper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (2018). Benchmarking Neural Network Architectures for Acoustic Sensor Networks. ITG 2018, Oldenburg, Germany.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11839 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2018). Insights into the Interplay of Sampling Rate Offsets and MVDR Beamforming. ITG 2018, Oldenburg, Germany.
LibreCat
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11717 |

Arora, P., & Haeb-Umbach, R. (2017). A Study on Transfer Learning for Acoustic Event Detection in a Real Life Scenario. In IEEE 19th International Workshop on Multimedia Signal Processing (MMSP).
LibreCat
| Files available
| Download (ext.)
2017 | Report | LibreCat-ID: 11735 |

Boeddeker, C., Hanebrink, P., Drude, L., Heymann, J., & Haeb-Umbach, R. (2017). On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming.
LibreCat
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11736 |

Boeddeker, C., Hanebrink, P., Drude, L., Heymann, J., & Haeb-Umbach, R. (2017). Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation. In Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11737 |

Chinaev, A., & Haeb-Umbach, R. (2017). A Generalized Log-Spectral Amplitude Estimator for Single-Channel Speech Enhancement. In Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Files available
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11754 |

Drude, L., & Haeb-Umbach, R. (2017). Tight integration of spatial and spectral features for BSS with Deep Clustering embeddings. In INTERSPEECH 2017, Stockholm, Schweden.
LibreCat
| Files available
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11770 |

Glarner, T., Boenninghoff, B., Walter, O., & Haeb-Umbach, R. (2017). Leveraging Text Data for Word Segmentation for Underresourced Languages. In INTERSPEECH 2017, Stockholm, Schweden.
LibreCat
| Files available
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11809 |

Heymann, J., Drude, L., Boeddeker, C., Hanebrink, P., & Haeb-Umbach, R. (2017). BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System. In Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Files available
| Download (ext.)
2017 | Journal Article | LibreCat-ID: 11811 |

Heymann, J., Drude, L., & Haeb-Umbach, R. (2017). A Generic Neural Acoustic Beamforming Architecture for Robust Multi-Channel Speech Processing. Computer Speech and Language.
LibreCat
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11763 |

Fei, T., Grimm, C., Farhoud, R., Breddermann, T., Warsitz, E., & Haeb-Umbach, R. (2017). A Novel Target Separation Algorithm Applied to The Two-Dimensional Spectrum for FMCW Automotive Radar Systems. IEEE International Conference on Microwave, Communications, Anthenas and Electronic Systems.
LibreCat
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11772 |

Grimm, C., Breddermann, T., Farhoud, R., Fei, T., Warsitz, E., & Haeb-Umbach, R. (2017). Hypothesis Test for the Detection of Moving Targets in Automotive Radar. IEEE International Conference on Microwave, Communications, Anthenas and Electronic Systems (COMCAS).
LibreCat
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11759 |

Ebbers, J., Heymann, J., Drude, L., Glarner, T., Haeb-Umbach, R., & Raj, B. (2017). Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. INTERSPEECH 2017, Stockholm, Schweden.
LibreCat
| Files available
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11895 |

Schmalenstroeer, J., Heymann, J., Drude, L., Boeddeker, C., & Haeb-Umbach, R. (2017). Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming. IEEE 19th International Workshop on Multimedia Signal Processing (MMSP).
LibreCat
| Files available
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11773 |

Grimm, C., Farhoud, R., Fei, T., Warsitz, E., & Haeb-Umbach, R. (2017). Detection of Moving Targets in Automotive Radar with Distorted Ego-Velocity Information. IEEE Microwaves, Radar and Remote Sensing Symposium (MRRS).
LibreCat
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11738 |

Chinaev, A., & Haeb-Umbach, R. (2016). A Priori SNR Estimation Using a Generalized Decision Directed Approach. In INTERSPEECH 2016, San Francisco, USA.
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11743 |

Chinaev, A., Heitkaemper, J., & Haeb-Umbach, R. (2016). A Priori SNR Estimation Using Weibull Mixture Model. In 12. ITG Fachtagung Sprachkommunikation (ITG 2016).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11744 |

Chinaev, A., Heymann, J., Drude, L., & Haeb-Umbach, R. (2016). Noise-Presence-Probability-Based Noise PSD Estimation by Using DNNs. In 12. ITG Fachtagung Sprachkommunikation (ITG 2016).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11751 |

Drude, L., Boeddeker, C., & Haeb-Umbach, R. (2016). Blind Speech Separation based on Complex Spherical k-Mode Clustering. In Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11756 |

Drude, L., Raj, B., & Haeb-Umbach, R. (2016). On the appropriateness of complex-valued neural networks for speech enhancement. In INTERSPEECH 2016, San Francisco, USA.
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11771 |

Glarner, T., Mahdi Momenzadeh, M., Drude, L., & Haeb-Umbach, R. (2016). Factor Graph Decoding for Speech Presence Probability Estimation. In 12. ITG Fachtagung Sprachkommunikation (ITG 2016).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11812 |

Heymann, J., Drude, L., & Haeb-Umbach, R. (2016). Neural Network Based Spectral Mask Estimation for Acoustic Beamforming. In Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11829 |

Jacob, F., & Haeb-Umbach, R. (2016). On the Bias of Direction of Arrival Estimation Using Linear Microphone Arrays. In 12. ITG Fachtagung Sprachkommunikation (ITG 2016).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11834 |

Heymann, J., Drude, L., & Haeb-Umbach, R. (2016). Wide Residual BLSTM Network with Discriminative Speaker Adaptation for Robust Speech Recognition. In Computer Speech and Language.
LibreCat
| Files available
| Download (ext.)
2016 | Journal Article | LibreCat-ID: 11840 |

Kinoshita, K., Delcroix, M., Gannot, S., Habets, E. A. P., Haeb-Umbach, R., Kellermann, W., … Yoshioka, T. (2016). A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research. EURASIP Journal on Advances in Signal Processing.
LibreCat
| Download (ext.)
2016 | Journal Article | LibreCat-ID: 11886
Plinge, A., Jacob, F., Haeb-Umbach, R., & Fink, G. A. (2016). Acoustic Microphone Geometry Calibration: An overview and experimental evaluation of state-of-the-art algorithms. IEEE Signal Processing Magazine, 33(4), 14–29. https://doi.org/10.1109/MSP.2016.2555198
LibreCat
| DOI
2016 | Conference Paper | LibreCat-ID: 11908 |

Menne, T., Heymann, J., Alexandridis, A., Irie, K., Zeyer, A., Kitza, M., … Mouchtaris, A. (2016). The RWTH/UPB/FORTH System Combination for the 4th CHiME Challenge Evaluation. In Computer Speech and Language.
LibreCat
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11920 |

Walter, O., & Haeb-Umbach, R. (2016). Unsupervised Word Discovery from Speech using Bayesian Hierarchical Models. In 38th German Conference on Pattern Recognition (GCPR 2016).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11890 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2016). Investigations into Bluetooth Low Energy Localization Precision Limits. 24th European Signal Processing Conference (EUSIPCO 2016).
LibreCat
| Files available
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11739 |

Chinaev, A., & Haeb-Umbach, R. (2015). On Optimal Smoothing in Minimum Statistics Based Noise Tracking. In Interspeech 2015 (pp. 1785–1789).
LibreCat
| Files available
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11748 |

Despotovic, V., Walter, O., & Haeb-Umbach, R. (2015). Semantic Analysis of Spoken Input using Markov Logic Networks. In INTERSPEECH 2015.
LibreCat
| Files available
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11755 |

Drude, L., Jacob, F., & Haeb-Umbach, R. (2015). DOA-Estimation based on a Complex Watson Kernel Method. In 23th European Signal Processing Conference (EUSIPCO 2015).
LibreCat
| Files available
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11810
Heymann, J., Drude, L., Chinaev, A., & Haeb-Umbach, R. (2015). BLSTM supported GEV Beamformer Front-End for the 3RD CHiME Challenge. In Automatic Speech Recognition and Understanding Workshop (ASRU 2015).
LibreCat
2015 | Conference Paper | LibreCat-ID: 11813 |

Heymann, J., Haeb-Umbach, R., Golik, P., & Schlueter, R. (2015). Unsupervised adaptation of a denoising autoencoder by Bayesian Feature Enhancement for reverberant asr under mismatch conditions. In Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on (pp. 5053–5057). https://doi.org/10.1109/ICASSP.2015.7178933
LibreCat
| DOI
| Download (ext.)
2015 | Journal Article | LibreCat-ID: 11830 |

Jacob, F., & Haeb-Umbach, R. (2015). Absolute Geometry Calibration of Distributed Microphone Arrays in an Audio-Visual Sensor Network. ArXiv E-Prints.
LibreCat
| Download (ext.)
2015 | Book | LibreCat-ID: 11868 |

Li, J., Deng, L., Haeb-Umbach, R., & Gong, Y. (2015). Robust Automatic Speech Recognition. Elsevier.
LibreCat
| Files available
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11875 |

Marchi, E., Schuller, B., Baron-Cohen, S., Golan, O., Boelte, S., Arora, P., & Haeb-Umbach, R. (2015). Typicality and Emotion in the Voice of Children with Autism Spectrum Condition: Evidence Across Three Languages. In INTERSPEECH 2015.
LibreCat
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11919 |

Walter, O., Drude, L., & Haeb-Umbach, R. (2015). Source Counting in Speech Mixtures by Nonparametric Bayesian Estimation of an infinite Gaussian Mixture Model. In 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2015).
LibreCat
| Files available
| Download (ext.)
2015 | Journal Article | LibreCat-ID: 11922 |

Walter, O., Haeb-Umbach, R., Mokbel, B., Paassen, B., & Hammer, B. (2015). Autonomous Learning of Representations. KI - Kuenstliche Intelligenz, 1–13. http://dx.doi.org/10.1007/s13218-015-0372-1
LibreCat
| DOI
| Download (ext.)
2015 | Report | LibreCat-ID: 11923 |

Walter, O., Haeb-Umbach, R., Strunk, J., & P. Himmelmann, N. (2015). Lexicon Discovery for Language Preservation using Unsupervised Word Segmentation with Pitman-Yor Language Models (FGNT-2015-01).
LibreCat
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11874 |

Hoang, M. K., Schmalenstroeer, J., & Haeb-Umbach, R. (2015). Aligning training models with smartphone properties in WiFi fingerprinting based indoor localization. 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2015).
LibreCat
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11746 |

Chinaev, A., Puels, M., & Haeb-Umbach, R. (2014). Spectral Noise Tracking for Improved Nonstationary Noise Robust ASR. In 11. ITG Fachtagung Sprachkommunikation (ITG 2014).
LibreCat
| Files available
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11752 |

Drude, L., Chinaev, A., Tran Vu, D. H., & Haeb-Umbach, R. (2014). Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models. In 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014).
LibreCat
| Files available
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11753 |

Drude, L., Chinaev, A., Tran Vu, D. H., & Haeb-Umbach, R. (2014). Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models. In 14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014) (pp. 213–217).
LibreCat
| Files available
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11814 |

Heymann, J., Walter, O., Haeb-Umbach, R., & Raj, B. (2014). Iterative Bayesian Word Segmentation for Unspuervised Vocabulary Discovery from Phoneme Lattices. In 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014).
LibreCat
| Files available
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11831 |

Jacob, F., & Haeb-Umbach, R. (2014). Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking. In 11. ITG Fachtagung Sprachkommunikation (ITG 2014).
LibreCat
| Files available
| Download (ext.)
2014 | Journal Article | LibreCat-ID: 11861
Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2014). A New Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 22(1), 95–109. https://doi.org/10.1109/TASLP.2013.2285480
LibreCat
| DOI
2014 | Journal Article | LibreCat-ID: 11867 |

Li, J., Deng, L., Gong, Y., & Haeb-Umbach, R. (2014). An Overview of Noise-Robust Automatic Speech Recognition. IEEE Transactions on Audio, Speech and Language Processing, 22(4), 745–777. https://doi.org/10.1109/TASLP.2014.2304637
LibreCat
| DOI
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11918 |

Walter, O., Despotovic, V., Haeb-Umbach, R., Gemmeke, J., Ons, B., & Van hamme, H. (2014). An Evaluation of Unsupervised Acoustic Model Training for a Dysarthric Speech Interface. In INTERSPEECH 2014.
LibreCat
| Files available
| Download (ext.)
2014 | Journal Article | LibreCat-ID: 11898 |

Schmalenstroeer, J., Jebramcik, P., & Haeb-Umbach, R. (2014). A combined hardware-software approach for acoustic sensor network synchronization . Signal Processing, 0. http://dx.doi.org/10.1016/j.sigpro.2014.06.030
LibreCat
| DOI
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11897 |

Schmalenstroeer, J., Jebramcik, P., & Haeb-Umbach, R. (2014). A Gossiping Approach to Sampling Clock Synchronization in Wireless Acoustic Sensor Networks. 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014).
LibreCat
| Files available
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11903 |

Schmalenstroeer, J., Zhao, W., & Haeb-Umbach, R. (2014). Online Observation Error Model Estimation for Acoustic Sensor Network Synchronization. 11. ITG Fachtagung Sprachkommunikation (ITG 2014).
LibreCat
| Files available
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11716
Abdelaziz, A. H., Zeiler, S., Kolossa, D., Leutnant, V., & Haeb-Umbach, R. (2013). GMM-based significance decoding. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on (pp. 6827–6831). https://doi.org/10.1109/ICASSP.2013.6638984
LibreCat
| DOI
2013 | Conference Paper | LibreCat-ID: 11740 |

Chinaev, A., & Haeb-Umbach, R. (2013). MAP-based Estimation of the Parameters of a Gaussian Mixture Model in the Presence of Noisy Observations. In 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) (pp. 3352–3356). https://doi.org/10.1109/ICASSP.2013.6638279
LibreCat
| Files available
| DOI
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11742 |

Chinaev, A., Haeb-Umbach, R., Taghia, J., & Martin, R. (2013). Improved Single-Channel Nonstationary Noise Tracking by an Optimized MAP-based Postprocessor. In 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) (pp. 7477–7481). https://doi.org/10.1109/ICASSP.2013.6639116
LibreCat
| Files available
| DOI
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11762 |

Enzner, G., Schmid, D., & Haeb-Umbach, R. (2013). On the Acoustic Channel Identification in Multi-Microphone Systems via Adaptive Blind Signal Enhancement Techniques. In 21th European Signal Processing Conference (EUSIPCO 2013).
LibreCat
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11815 |

Heymann, J., Walter, O., Haeb-Umbach, R., & Raj, B. (2013). Unsupervised Word Segmentation from Noisy Input. In Automatic Speech Recognition and Understanding Workshop (ASRU 2013).
LibreCat
| Files available
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11816 |

Hoang, M. K., & Haeb-Umbach, R. (2013). Parameter estimation and classification of censored Gaussian data with application to WiFi indoor positioning. In 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013) (pp. 3721–3725). https://doi.org/10.1109/ICASSP.2013.6638353
LibreCat
| Files available
| DOI
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11841 |

Kinoshita, K., Delcroix, M., Yoshioka, T., Nakatani, T., Habets, E., Haeb-Umbach, R., … Raj, B. (2013). The reverb challenge: a common evaluation framework for dereverberation and recognition of reverberant speech. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (pp. 22–23).
LibreCat
| Download (ext.)
2013 | Journal Article | LibreCat-ID: 11862
Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2013). Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 21(8), 1640–1652. https://doi.org/10.1109/TASL.2013.2258013
LibreCat
| DOI
2013 | Conference Paper | LibreCat-ID: 11909 |

Tran Vu, D. H., & Haeb-Umbach, R. (2013). Blind Speech Separation Exploiting Temporal and Spectral Correlations Using Turbo Decoding of 2D-HMMs. In 21th European Signal Processing Conference (EUSIPCO 2013).
LibreCat
| Files available
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11917
Vu, D. H. T., & Haeb-Umbach, R. (2013). Using the turbo principle for exploiting temporal and spectral correlations in speech presence probability estimation. In 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) (pp. 863–867). https://doi.org/10.1109/ICASSP.2013.6637771
LibreCat
| DOI
2013 | Conference Paper | LibreCat-ID: 11921 |

Walter, O., Haeb-Umbach, R., Chaudhuri, S., & Raj, B. (2013). Unsupervised Word Discovery from Phonetic Input Using Nested Pitman-Yor Language Modeling. In IEEE International Conference on Robotics and Automation (ICRA 2013).
LibreCat
| Files available
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11924 |

Walter, O., Korthals, T., Haeb-Umbach, R., & Raj, B. (2013). Hierarchical System for Word Discovery Exploiting DTW-Based Initialization. In Automatic Speech Recognition and Understanding Workshop (ASRU 2013).
LibreCat
| Files available
| Download (ext.)
2013 | Report | LibreCat-ID: 11926 |

Walter, O., Schmalenstroeer, J., & Haeb-Umbach, R. (2013). A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01).
LibreCat
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11832 |

Jacob, F., Schmalenstroeer, J., & Haeb-Umbach, R. (2013). DoA-Based Microphone Array Position Self-Calibration Using Circular Statistic. 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), 116–120. https://doi.org/10.1109/ICASSP.2013.6637620
LibreCat
| Files available
| DOI
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11891 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2013). Sampling Rate Synchronisation in Acoustic Sensor Networks with a Pre-Trained Clock Skew Error Model. 21th European Signal Processing Conference (EUSIPCO 2013).
LibreCat
| Files available
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11818 |

Hoang, M. K., Schmitz, S., Drueke, C., Vu, D. H. T., Schmalenstroeer, J., & Haeb-Umbach, R. (2013). Server based indoor navigation using RSSI and inertial sensor information. Positioning Navigation and Communication (WPNC), 2013 10th Workshop On, 1–6. https://doi.org/10.1109/WPNC.2013.6533263
LibreCat
| Files available
| DOI
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11817 |

Hoang, M. K., Schmalenstroeer, J., Drueke, C., Tran Vu, D. H., & Haeb-Umbach, R. (2013). A Hidden Markov Model for Indoor User Tracking Based on WiFi Fingerprinting and Step Detection. 21th European Signal Processing Conference (EUSIPCO 2013).
LibreCat
| Files available
| Download (ext.)
2012 | Conference Paper | LibreCat-ID: 11741 |

Chinaev, A., & Haeb-Umbach, R. (2012). Quality Analysis and Optimization of the MAP-based Noise Power Spectral Density Tracker. In Speech Communication; 10. ITG Symposium; Proceedings.
LibreCat
| Files available
| Download (ext.)
2012 | Conference Paper | LibreCat-ID: 11745 |

Chinaev, A., Krueger, A., Tran Vu, D. H., & Haeb-Umbach, R. (2012). Improved Noise Power Spectral Density Tracking by a MAP-based Postprocessor. In 37th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012).
LibreCat
| Files available
| Download (ext.)
2012 | Book Chapter | LibreCat-ID: 11844
Krueger, A., & Haeb-Umbach, R. (2012). Reverberant Speech Recognition. In Techniques for Noise Robustness in Automatic Speech Recognition. Wiley.
LibreCat
2012 | Conference Paper | LibreCat-ID: 11849 |

Krueger, A., Walter, O., Leutnant, V., & Haeb-Umbach, R. (2012). Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data. In Proc. Interspeech. Portland, USA.
LibreCat
| Download (ext.)
2012 | Journal Article | LibreCat-ID: 11863 |

Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2012). Investigations Into a Statistical Observation Model for Logarithmic Mel Power Spectral Density Features of Noisy Reverberant Speech. Speech Communication; 10. ITG Symposium; Proceedings Of, 1–4.
LibreCat
| Download (ext.)
2012 | Conference Paper | LibreCat-ID: 11864 |

Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2012). A Statistical Observation Model For Noisy Reverberant Speech Features and its Application to Robust ASR. In Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference on.
LibreCat
| Download (ext.)
2012 | Report | LibreCat-ID: 11865 |

Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2012). Derivation of the Power Compensation Constant in the Observation Model for Reverberant Speech in the Logarithmic Mel Power Spectral Domain.
LibreCat
| Download (ext.)
2012 | Conference Paper | LibreCat-ID: 11910
Tran Vu, D. H., & Haeb-Umbach, R. (2012). Exploiting Temporal Correlations in Joint Multichannel Speech Separation and Noise Suppression using Hidden Markov Models. In International Workshop on Acoustic Signal Enhancement (IWAENC2012).
LibreCat
2012 | Conference Paper | LibreCat-ID: 11833 |

Jacob, F., Schmalenstroeer, J., & Haeb-Umbach, R. (2012). Microphone Array Position Self-Calibration from Reverberant Speech Input. International Workshop on Acoustic Signal Enhancement (IWAENC 2012).
LibreCat
| Files available
| Download (ext.)
2012 | Conference Paper | LibreCat-ID: 11925 |

Walter, O., Schmalenstroeer, J., Engler, A., & Haeb-Umbach, R. (2012). Smartphone-Based Sensor Fusion for Improved Vehicular Navigation. 9th Workshop on Positioning Navigation and Communication (WPNC 2012).
LibreCat
| Download (ext.)
2011 | Conference Paper | LibreCat-ID: 11721 |

Bevermeier, M., Flanke, S., Haeb-Umbach, R., & Stehr, J. (2011). A Platform for efficient Supply Chain Management Support in Logistics. In International Workshop on Intelligent Transportation (WIT 2011).
LibreCat
| Download (ext.)
2011 | Book Chapter | LibreCat-ID: 11774
Haeb-Umbach, R. (2011). Uncertainty Decoding and Conditional Bayesian Estimation. In R. Haeb-Umbach & D. Kolossa (Eds.), Robust Speech Recognition of Uncertain or Missing Data. Springer.
LibreCat
2011 | Book Chapter | LibreCat-ID: 11775
Haeb-Umbach, R. (2011). Können Computer sprechen und hören, sollen sie es überhaupt können? Sprachverarbeitung und ambiente Intelligenz. In Baustelle Informationsgesellschaft und Universität heute. Ferdinand Schoeningh Verlag, Paderborn.
LibreCat
2011 | Journal Article | LibreCat-ID: 11807
Herbig, T., Gerl, F., Minker, W., & Haeb-Umbach, R. (2011). Adaptive Systems for Unsupervised Speaker Tracking and Speech Recognition. Evolving Systems, 2(3), 199–214.
LibreCat
2011 | Book Chapter | LibreCat-ID: 11843
Krueger, A., & Haeb-Umbach, R. (2011). A Model-Based Approach to Joint Compensation of Noise and Reverberation for Speech Recognition. In R. Haeb-Umbach & D. Kolossa (Eds.), Robust Speech Recognition of Uncertain or Missing Data. Springer.
LibreCat
2011 | Conference Paper | LibreCat-ID: 11845 |

Krueger, A., & Haeb-Umbach, R. (2011). MAP-based estimation of the parameters of non-stationary Gaussian processes from noisy observations. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2011) (pp. 3596–3599). https://doi.org/10.1109/ICASSP.2011.5946256
LibreCat
| DOI
| Download (ext.)
2011 | Journal Article | LibreCat-ID: 11850 |

Krueger, A., Warsitz, E., & Haeb-Umbach, R. (2011). Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation. IEEE Transactions on Audio, Speech, and Language Processing, 19(1), 206–219. https://doi.org/10.1109/TASL.2010.2047324
LibreCat
| DOI
| Download (ext.)
2011 | Book Chapter | LibreCat-ID: 11856
Leutnant, V., & Haeb-Umbach, R. (2011). Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition. In R. Haeb-Umbach & D. Kolossa (Eds.), Robust Speech Recognition of Uncertain or Missing Data. Springer.
LibreCat
2011 | Conference Paper | LibreCat-ID: 11866 |

Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2011). A versatile Gaussian splitting approach to non-linear state estimation and its application to noise-robust ASR. In Interspeech 2011.
LibreCat
| Download (ext.)
2011 | Conference Paper | LibreCat-ID: 11911 |

Tran Vu, D. H., & Haeb-Umbach, R. (2011). On Initial Seed Selection for Frequency Domain Blind Speech Separation. In Interspeech 2011.
LibreCat
| Download (ext.)
2011 | Book (Editor) | LibreCat-ID: 11945 |

Kolossa, D., & Haeb-Umbach, R. (Eds.). (2011). Robust Speech Recognition of Uncertain or Missing Data --- Theory and Applications. Springer.
LibreCat
| Download (ext.)
2011 | Conference Paper | LibreCat-ID: 11889 |

Schmalenstroeer, J., Bartek, M., & Haeb-Umbach, R. (2011). Unsupervised learning of acoustic events using dynamic time warping and hierarchical K-means++ clustering. Interspeech 2011.
LibreCat
| Download (ext.)
2011 | Conference Paper | LibreCat-ID: 11896 |

Schmalenstroeer, J., Jacob, F., Haeb-Umbach, R., Hennecke, M., & Fink, G. A. (2011). Unsupervised Geometry Calibration of Acoustic Sensor Networks Using Source Correspondences. Interspeech 2011.
LibreCat
| Download (ext.)
2011 | Conference Paper | LibreCat-ID: 9456 |

Schmalenstroeer, J., Bartek, M., & Haeb-Umbach, R. (2011). Investigations into Features for Robust Classification into Broad Acoustic Categories. 37. Deutsche Jahrestagung Fuer Akustik (DAGA 2011).
LibreCat
| Download (ext.)
2010 | Conference Paper | LibreCat-ID: 11726 |

Bevermeier, M., Walter, O., Peschke, S., & Haeb-Umbach, R. (2010). Barometric height estimation combined with map-matching in a loosely-coupled Kalman-filter. In 7th Workshop on Positioning Navigation and Communication (WPNC 2010) (pp. 128–134). https://doi.org/10.1109/WPNC.2010.5650745
LibreCat
| DOI
| Download (ext.)
2010 | Journal Article | LibreCat-ID: 11846 |

Krueger, A., & Haeb-Umbach, R. (2010). Model-Based Feature Enhancement for Reverberant Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 18(7), 1692–1707. https://doi.org/10.1109/TASL.2010.2049684
LibreCat
| DOI
| Download (ext.)
2010 | Conference Paper | LibreCat-ID: 11857 |

Leutnant, V., & Haeb-Umbach, R. (2010). Options for Modelling Temporal Statistical Dependencies in an Acoustic Model for ASR. In 36. Deutsche Jahrestagung fuer Akustik (DAGA 2010).
LibreCat
| Download (ext.)
2010 | Conference Paper | LibreCat-ID: 11858 |

Leutnant, V., & Haeb-Umbach, R. (2010). On the Exploitation of Hidden Markov Models and Linear Dynamic Models in a Hybrid Decoder Architecture for Continuous Speech Recognition. In Interspeech 2010.
LibreCat
| Download (ext.)
2010 | Conference Paper | LibreCat-ID: 11887 |

Raj, B., Wilson, K. W., Krueger, A., & Haeb-Umbach, R. (2010). Ungrounded Independent Non-Negative Factor Analysis. In Interspeech 2010.
LibreCat
| Download (ext.)
2010 | Conference Paper | LibreCat-ID: 11912 |

Tran Vu, D. H., & Haeb-Umbach, R. (2010). An EM Approach to Integrated Multichannel Speech Separation and Noise Suppression. In International Workshop on Acoustic Echo and Noise Control (IWAENC 2010).
LibreCat
| Download (ext.)
2010 | Conference Paper | LibreCat-ID: 11913 |

Tran Vu, D. H., & Haeb-Umbach, R. (2010). Blind speech separation employing directional statistics in an Expectation Maximization framework. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010) (pp. 241–244). https://doi.org/10.1109/ICASSP.2010.5495994
LibreCat
| DOI
| Download (ext.)
2010 | Journal Article | LibreCat-ID: 11892 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2010). Online Diarization of Streaming Audio-Visual Data for Smart Environments. IEEE Journal of Selected Topics in Signal Processing, 4(5), 845–856. https://doi.org/10.1109/JSTSP.2010.2050519
LibreCat
| DOI
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11723 |

Bevermeier, M., Peschke, S., & Haeb-Umbach, R. (2009). Robust vehicle localization based on multi-level sensor fusion and online parameter estimation. In 6th Workshop on Positioning Navigation and Communication (WPNC 2009) (pp. 235–242). https://doi.org/10.1109/WPNC.2009.4907833
LibreCat
| DOI
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11724 |

Bevermeier, M., Peschke, S., & Haeb-Umbach, R. (2009). Joint Parameter Estimation and Tracking in a Multi-Stage Kalman Filter for Vehicle Positioning. In IEEE 69th Vehicular Technology Conference (VTC 2009 Spring) (pp. 1–5). https://doi.org/10.1109/VETECS.2009.5073634
LibreCat
| DOI
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11725 |

Bevermeier, M., Peschke, S., & Haeb-Umbach, R. (2009). Eine Plattform fuer Mehrwertdienste im Bereich Logistik - Drahtlose Fahrzeug- und Laderaumueberwachung fuer LKW mit Hilfe einer Maut-On-Board Unit. In DGON Navigationskonvent 2009.
LibreCat
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11847 |

Krueger, A., & Haeb-Umbach, R. (2009). Model based feature enhancement for automatic speech recognition in reverberant environments. In Interspeech 2009.
LibreCat
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11859 |

Leutnant, V., & Haeb-Umbach, R. (2009). On the Estimation and Use of Feature Reliability Information for Noise Robust Speech Recognition. In International Conference on Acoustics (NAG/DAGA 2009).
LibreCat
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11860 |

Leutnant, V., & Haeb-Umbach, R. (2009). An analytic derivation of a phase-sensitive observation model for noise robust speech recognition. In Interspeech 2009.
LibreCat
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11881 |

Peschke, S., Bevermeier, M., & Haeb-Umbach, R. (2009). A GPS positioning approach exploiting GSM velocity estimates. In 6th Workshop on Positioning Navigation and Communication (WPNC 2009) (pp. 195–202). https://doi.org/10.1109/WPNC.2009.4907827
LibreCat
| DOI
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11882 |

Peschke, S., Bevermeier, M., & Haeb-Umbach, R. (2009). Verbesserung von GPS-basierter Ortung durch GSM-Geschwindigkeitsschaetzungen. In DGON Navigationskonvent 2009.
LibreCat
| Download (ext.)
2009 | Journal Article | LibreCat-ID: 11937 |

Windmann, S., & Haeb-Umbach, R. (2009). Approaches to Iterative Speech Feature Enhancement and Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17(5), 974–984. https://doi.org/10.1109/TASL.2009.2014894
LibreCat
| DOI
| Download (ext.)
2009 | Journal Article | LibreCat-ID: 11938 |

Windmann, S., & Haeb-Umbach, R. (2009). Parameter Estimation of a State-Space Model of Noise for Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17(8), 1577–1590. https://doi.org/10.1109/TASL.2009.2023172
LibreCat
| DOI
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11900 |

Schmalenstroeer, J., Leutnant, V., & Haeb-Umbach, R. (2009). Audio-Visual Data Processing for Ambient Communication. 1st International Workshop on Distributed Computing in Ambient Environments within 32nd Annual Conference on Artificial Intelligence.
LibreCat
| Files available
2009 | Conference Paper | LibreCat-ID: 11806 |

Hennecke, M., Ploetz, T., Fink, G. A., Schmalenstroeer, J., & Haeb-Umbach, R. (2009). A hierarchical approach to unsupervised shape calibration of microphone array networks. IEEE/SP 15th Workshop on Statistical Signal Processing (SSP 2009), 257–260. https://doi.org/10.1109/SSP.2009.5278589
LibreCat
| DOI
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11899 |

Schmalenstroeer, J., Kelling, M., Leutnant, V., & Haeb-Umbach, R. (2009). Fusing Audio and Video Information for Online Speaker Diarization. Interspeech 2009.
LibreCat
| Download (ext.)
2008 | Journal Article | LibreCat-ID: 11776 |

Haeb-Umbach, R. (2008). Uncertainty Decoding in Automatic Speech Recognition. 2008 ITG Conference on Voice Communication (SprachKommunikation), 1–7.
LibreCat
| Download (ext.)
2008 | Book Chapter | LibreCat-ID: 11789 |

Haeb-Umbach, R., & Ion, V. (2008). Error Concealement. In B. Lindenberg & Z.-H. Tan (Eds.), Automatic Speech Recognition on Mobile Devices and over Communication Networks (Vol. Advances in Computer Vision and Pattern Recognition, pp. 187–210). Springer.
LibreCat
| Download (ext.)
2008 | Journal Article | LibreCat-ID: 11820 |

Ion, V., & Haeb-Umbach, R. (2008). A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 16(5), 1047–1060. https://doi.org/10.1109/TASL.2008.925879
LibreCat
| DOI
| Download (ext.)
2008 | Journal Article | LibreCat-ID: 11821 |

Ion, V., & Haeb-Umbach, R. (2008). Investigations into Uncertainty Decoding Employing a Discrete Feature Space for Noise Robust Automatic Speech Recognition. 2008 ITG Conference on Voice Communication (SprachKommunikation), 1–4.
LibreCat
| Download (ext.)
2008 | Conference Paper | LibreCat-ID: 11851 |

Krueger, A., Warsitz, E., & Haeb-Umbach, R. (2008). Blinde Akustische Strahlformung fuer Anwendungen im KFZ. In 34. Deutsche Jahrestagung fuer Akustik (DAGA 2008).
LibreCat
| Download (ext.)
2008 | Journal Article | LibreCat-ID: 11914 |

Tran Vu, D. H., & Haeb-Umbach, R. (2008). Blind Speech Separation in Presence of Correlated Noise with Generalized Eigenvector Beamforming. 2008 ITG Conference on Voice Communication (SprachKommunikation), 1–4.
LibreCat
| Download (ext.)
2008 | Conference Paper | LibreCat-ID: 11915 |

Tran Vu, D. H., Krueger, A., & Haeb-Umbach, R. (2008). Generalized Eigenvector Blind Speech Separation Under Coherent Noise In A GSC Configuration. In International Workshop on Acoustic Echo and Noise Control (IWAENC 2008).
LibreCat
| Download (ext.)
2008 | Conference Paper | LibreCat-ID: 11935 |

Warsitz, E., Krueger, A., & Haeb-Umbach, R. (2008). Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008) (pp. 73–76). https://doi.org/10.1109/ICASSP.2008.4517549
LibreCat
| DOI
| Download (ext.)
2008 | Conference Paper | LibreCat-ID: 11939 |

Windmann, S., & Haeb-Umbach, R. (2008). Modeling the dynamics of speech and noise for speech feature enhancement in ASR. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008) (pp. 4409–4412). https://doi.org/10.1109/ICASSP.2008.4518633
LibreCat
| DOI
| Download (ext.)
2008 | Journal Article | LibreCat-ID: 11940 |

Windmann, S., & Haeb-Umbach, R. (2008). A novel approach to noise estimation in model-based speech feature enhancement. 2008 ITG Conference on Voice Communication (SprachKommunikation), 1–4.
LibreCat
| Download (ext.)
2008 | Journal Article | LibreCat-ID: 11944 |

Windmann, S., Haeb-Umbach, R., & Leutnant, V. (2008). A segmental HMM based on a modified emission probability. 2008 ITG Conference on Voice Communication (SprachKommunikation), 1–4.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11720 |

Bevermeier, M., Ebel, T., & Haeb-Umbach, R. (2007). Channel Estimation by Exploiting Sublayer Information in OFDM Systems. In Multi-Carrier Spread Spectrum 2007.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11722 |

Bevermeier, M., & Haeb-Umbach, R. (2007). Combined Time and Frequency Domain OFDM Channel Estimation. In Multi-Carrier Spread Spectrum 2007.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11785 |

Haeb-Umbach, R., & Bevermeier, M. (2007). OFDM Channel Estimation Based on Combined Estimation in Time and Frequency Domain. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007) (Vol. 3, pp. III-277-III–280). https://doi.org/10.1109/ICASSP.2007.366526
LibreCat
| DOI
| Download (ext.)
2007 | Journal Article | LibreCat-ID: 11799 |

Haeb-Umbach, R., & Peschke, S. (2007). A Novel Similarity Measure for Positioning Cellular Phones by a Comparison With a Database of Signal Power Levels. IEEE Transactions on Vehicular Technology, 56(1), 368–372. https://doi.org/10.1109/TVT.2006.889563
LibreCat
| DOI
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11822 |

Ion, V., & Haeb-Umbach, R. (2007). Multi-Resolution Soft Features for Channel-Robust Distributed Speech Recognition. In Interspeech 2007.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11883 |

Peschke, S., & Haeb-Umbach, R. (2007). Velocity Estimation of Mobile Terminals by Exploiting GSM Downlink Signalling. In 4th Workshop on Positioning Navigation and Communication (WPNC 2007) (pp. 217–222). https://doi.org/10.1109/WPNC.2007.353637
LibreCat
| DOI
| Download (ext.)
2007 | Journal Article | LibreCat-ID: 11927 |

Warsitz, E., & Haeb-Umbach, R. (2007). Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition. IEEE Transactions on Audio, Speech, and Language Processing, 15(5), 1529–1539. https://doi.org/10.1109/TASL.2007.898454
LibreCat
| DOI
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11934 |

Warsitz, E., Haeb-Umbach, R., & Tran Vu, D. H. (2007). Blind Adaptive Principal Eigenvector Beamforming for Acoustical Source Separation. In Interspeech 2007.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11941 |

Windmann, S., & Haeb-Umbach, R. (2007). An Approach to Iterative Speech Feature Enhancement and Recognition. In Interspeech 2007.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11893 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2007). Joint Speaker Segmentation, Localization and Identification for Streaming Audio. Interspeech 2007.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11901 |

Schmalenstroeer, J., Leutnant, V., & Haeb-Umbach, R. (2007). Amigo Context Management Service with Applications in Ambient Communication Scenarios. AMI-07 - European Conference on Ambient Intelligence.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11933 |

Warsitz, E., Haeb-Umbach, R., & Schmalenstroeer, J. (2007). Zweistufige Sprache/Pause-Detektion in stark gestoerter Umgebung. 33. Deutsche Jahrestagung Fuer Akustik (DAGA 2007).
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11902 |

Schmalenstroeer, J., Warsitz, E., & Haeb-Umbach, R. (2007). Projekt Amigo - Sprachsignalverarbeitung im vernetzten Haus. 33. Deutsche Jahrestagung Fuer Akustik (DAGA 2007).
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11823 |

Ion, V., & Haeb-Umbach, R. (2006). Comparison of Decoder-based Transmission Error Compensation Techniques for Distributed Speech Recognition. In 7. ITG-Fachtagung Sprachkommunikation.
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11824 |

Ion, V., & Haeb-Umbach, R. (2006). An Inexpensive Packet Loss Compensation Scheme for Distributed Speech Recognition Based on Soft-Features. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006) (Vol. 1, p. I). https://doi.org/10.1109/ICASSP.2006.1659984
LibreCat
| DOI
| Download (ext.)
2006 | Journal Article | LibreCat-ID: 11825 |

Ion, V., & Haeb-Umbach, R. (2006). Uncertainty decoding for distributed speech recognition over error-prone networks. Speech Communication, 48(11), 1435–1446. https://doi.org/10.1016/j.specom.2006.03.007
LibreCat
| DOI
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11826 |

Ion, V., & Haeb-Umbach, R. (2006). Improved Source Modeling and Predictive Classification for Channel Robust Speech Recognition. In Interspeech 2006.
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11884 |

Peschke, S., & Haeb-Umbach, R. (2006). A Probabilistic Similarity Measure and a Non-Linear Post-Filter for Mobile Phone Positioning using GSM Signal Power Measurements. In European Navigation Conference \& Exhibition (ENC 2006).
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11885 |

Peschke, S., & Haeb-Umbach, R. (2006). Particle Filtering of Database assisted Positioning Estimates using a novel Similarity Measure for GSM Signal Power Level Measurements. In 3rd Workshop on Positioning Navigation and Communication (WPNC 2006).
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11928 |

Warsitz, E., & Haeb-Umbach, R. (2006). Mehrkanalige Sprachsignalverarbeitung durch adaptives Eigenbeamforming fuer Freisprecheinrichtungen im Kraftfahrzeug. In 32. Deutsche Jahrestagung fuer Akustik (DAGA 2006).
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11929 |

Warsitz, E., & Haeb-Umbach, R. (2006). Controlling Speech Distortion in Adaptive Frequency-Domain Principal Eigenvector Beamforming. In International Workshop on Acoustic Echo and Noise Control (IWAENC 2006).
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11942 |

Windmann, S., & Haeb-Umbach, R. (2006). Einkanalige Sprachsignalverbesserung mit Hilfe eines marginalisierten Partikelfilters. In 7. ITG-Fachtagung Sprachkommunikation.
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11943 |

Windmann, S., & Haeb-Umbach, R. (2006). Iterative Speech Enhancement using a Non-Linear Dynamic State Model of Speech and its Parameters. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006) (Vol. 1, p. I). https://doi.org/10.1109/ICASSP.2006.1660058
LibreCat
| DOI
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11894 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2006). Online Speaker Change Detection by Combining BIC with Microphone Array Beamforming. Interspeech 2006.
LibreCat
| Download (ext.)
2005 | Conference Paper | LibreCat-ID: 11803 |

Haeb-Umbach, R., & Warsitz, E. (2005). Adaptive Filter-and-Sum Beamforming in Spatially Correlated Noise. In International Workshop on Acoustic Echo and Noise Control (IWAENC 2005).
LibreCat
| Download (ext.)
2005 | Conference Paper | LibreCat-ID: 11827 |

Ion, V., & Haeb-Umbach, R. (2005). A Unified Probabilistic Approach to Error Concealment for Distributed Speech Recognition. In Interspeech 2005.
LibreCat
| Download (ext.)
2005 | Conference Paper | LibreCat-ID: 11828 |

Ion, V., & Haeb-Umbach, R. (2005). A Comparison of Soft-Feature Distributed Speech Recognition with Candidate Codecs for Speech Enabled Mobile Services. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2005) (Vol. 1, pp. 333–336). https://doi.org/10.1109/ICASSP.2005.1415118
LibreCat
| DOI
| Download (ext.)
2005 | Conference Paper | LibreCat-ID: 11930 |

Warsitz, E., & Haeb-Umbach, R. (2005). Acoustic filter-and-sum beamforming by adaptive principal component analysis. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2005) (Vol. 4, p. iv/797-iv/800 Vol. 4). https://doi.org/10.1109/ICASSP.2005.1416129
LibreCat
| DOI
| Download (ext.)
2005 | Conference Paper | LibreCat-ID: 11802 |

Haeb-Umbach, R., & Schmalenstroeer, J. (2005). Speech Processing in the Networked Home Environment - A View on the Amigo Project. Interspeech 2005.
LibreCat
| Download (ext.)
2005 | Conference Paper | LibreCat-ID: 11801 |

Haeb-Umbach, R., & Schmalenstroeer, J. (2005). A Comparison of Particle Filtering Variants for Speech Feature Enhancement. Interspeech 2005.
LibreCat
| Download (ext.)
2004 | Journal Article | LibreCat-ID: 11732 |

Bischoff, R., Haeb-Umbach, R., & Nammi, S. R. (2004). Multipath-Resistant Time of Arrival Estimation for Satellite Positioning. AEUe, Int. Journal on Electronics and Communications, 58(1).
LibreCat
| Download (ext.)
2004 | Conference Paper | LibreCat-ID: 11790 |

Haeb-Umbach, R., & Ion, V. (2004). Soft Features for Improved Distributed Speech Recognition over Wireless Networks. In International Conference on Spoken Language Processing (ICSLP 2004).
LibreCat
| Download (ext.)
2004 | Conference Paper | LibreCat-ID: 11931 |

Warsitz, E., & Haeb-Umbach, R. (2004). Robust speaker direction estimation with particle filtering. In IEEE Workshop on Multimedia Signal Processing (MMSP 2004) (pp. 367–370). https://doi.org/10.1109/MMSP.2004.1436569
LibreCat
| DOI
| Download (ext.)
2004 | Conference Paper | LibreCat-ID: 11932 |

Warsitz, E., Haeb-Umbach, R., & Peschke, S. (2004). Adaptive Beamforming Combined with Particle Filtering for Acoustic Source Localization. In International Conference on Spoken Language Processing (ICSLP 2004).
LibreCat
| Download (ext.)
2003 | Journal Article | LibreCat-ID: 11777 |

Haeb-Umbach, R. (2003). Auf ein Wort - Moeglichkeiten und Grenzen der automatischen Spracherkennung. Forschungsforum Paderborn, 68–71.
LibreCat
| Download (ext.)
2002 | Journal Article | LibreCat-ID: 11727 |

Beyerlein, P., Aubert, X., Haeb-Umbach, R., Harris, M., Klakow, D., Wendemuth, A., … Sixtus, A. (2002). Large Vocabulary Continuous Speech Recognition of Broadcast News - The Philips/RWTH Approach. Speech Communication, (37), 109–131.
LibreCat
| Download (ext.)
2002 | Conference Paper | LibreCat-ID: 11731 |

Bischoff, R., Haeb-Umbach, R., & Heinrichs, G. (2002). A Joint Time Multiplex Receiver for UMTS and Galileo. In ION-GPS 2002.
LibreCat
| Download (ext.)
2002 | Conference Paper | LibreCat-ID: 11733 |

Bischoff, R., Haeb-Umbach, R., Schulz, W., & Heinrichs, G. (2002). Employment of a multipath receiver structure in a combined GALILEO/UMTS receiver. In IEEE 55th Vehicular Technology Conference (VTC 2002 Spring) (Vol. 4, pp. 1844–1848 vol.4). https://doi.org/10.1109/VTC.2002.1002940
LibreCat
| DOI
| Download (ext.)
2002 | Conference Paper | LibreCat-ID: 11808 |

Hesse, T., Bischoff, R., Schulz, W., & Haeb-Umbach, R. (2002). Estimation of Bias Location Error due to Absence of the LOS-Signal in a UMTS-System. In International Symposium on Location Based Services for Cellular Users (LOCELLUS 2002).
LibreCat
| Download (ext.)
2001 | Conference Paper | LibreCat-ID: 11734 |

Bischoff, R., Haeb-Umbach, R., Schulz, W., & Heinrichs, G. (2001). Implementation of a Rake Receiver Architecture into a Galileo Receiver. In 1st ESA Workshop on Satellite Navigation User Equipment Technology (Navitec 2001).
LibreCat
| Download (ext.)
2001 | Journal Article | LibreCat-ID: 11778 |

Haeb-Umbach, R. (2001). Automatic generation of phonetic regression class trees for MLLR adaptation. IEEE Transactions on Speech and Audio Processing, 9(3), 299–302. https://doi.org/10.1109/89.906003
LibreCat
| DOI
| Download (ext.)
2001 | Journal Article | LibreCat-ID: 11870 |

Loog, M., Duin, R. P. W., & Haeb-Umbach, R. (2001). Multiclass linear dimension reduction by weighted pairwise Fisher criteria. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(7), 762–766. https://doi.org/10.1109/34.935849
LibreCat
| DOI
| Download (ext.)
2000 | Conference Paper | LibreCat-ID: 11758 |

Duin, R. P. W., Loog, M., & Haeb-Umbach, R. (2000). Multi-class Linear Feature Extraction by Nonlinear PCA. In International Conference on Pattern Recognition (ICPR 2000).
LibreCat
| Download (ext.)
2000 | Conference Paper | LibreCat-ID: 11779 |

Haeb-Umbach, R. (2000). Data-driven Phonetic Regression Class Tree Estimation for MLLR Adaptation. In International Conference on Spoken Language Processing (ICSLP 2000).
LibreCat
| Download (ext.)
2000 | Conference Paper | LibreCat-ID: 11869 |

Lieb, M., & Haeb-Umbach, R. (2000). LDA derived cepstral trajectory filters in adverse environmental conditions. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000) (Vol. 2, pp. II1105-II1108 vol.2). https://doi.org/10.1109/ICASSP.2000.859157
LibreCat
| DOI
| Download (ext.)
2000 | Conference Paper | LibreCat-ID: 11871 |

Loog, M., & Haeb-Umbach, R. (2000). Multi-class Linear Dimension Reduction by Generalized Fisher Criteria. In International Conference on Spoken Language Processing (ICSLP 2000).
LibreCat
| Download (ext.)
1999 | Conference Paper | LibreCat-ID: 11728 |

Beyerlein, P., Aubert, X. L., Haeb-Umbach, R., Harris, M. J., Klakow, D., Wendemuth, A., … Sixtus, A. (1999). The Philips/RWTH system for transcription of broadcast news. In Eurospeech.
LibreCat
| Download (ext.)
1999 | Conference Paper | LibreCat-ID: 11729 |

Beyerlein, P., Aubert, X. L., Haeb-Umbach, R., Harris, M. J., Klakow, D., Wendemuth, A., … Sixtus, A. (1999). The Philips/RWTH System for Transcription of Broadcast News. In Broadcast News Transcription and Understanding Workshop, Washington.
LibreCat
| Download (ext.)
1999 | Conference Paper | LibreCat-ID: 11780 |

Haeb-Umbach, R. (1999). Investigations on inter-speaker variability in the feature space. In ICASSP99 Phoenix, AZ.
LibreCat
| Download (ext.)
1999 | Conference Paper | LibreCat-ID: 11791 |

Haeb-Umbach, R., & Loog, M. (1999). An Investigation of Cepstral Parameterisations for Large Vocabulary Speech Recognition. In Eurospeech.
LibreCat
| Download (ext.)
1999 | Conference Paper | LibreCat-ID: 11805 |

Harris, M. J., Aubert, X. L., Haeb-Umbach, R., & Beyerlein, P. (1999). A study of broadcast news audio stream segmentation and segment clustering. In Eurospeech.
LibreCat
| Download (ext.)
1998 | Conference Paper | LibreCat-ID: 11730 |

Beyerlein, P., Aubert, X. L., Haeb-Umbach, R., Klakow, D., Ullrich, M., Wendemuth, A., & Wilcox, P. (1998). Automatic Transcription of English Broadcast News. In DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne.
LibreCat
| Download (ext.)
1998 | Conference Paper | LibreCat-ID: 11784 |

Haeb-Umbach, R., Aubert, X. L., Beyerlein, P., Klakow, D., Ullrich, M., Wendemuth, A., & Wilcox, P. (1998). Acoustic Modeling in the Philips Hub-4 Continuous-Speech Recognition System. In DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne.
LibreCat
| Download (ext.)
1998 | Conference Paper | LibreCat-ID: 11842 |

Klakow, D., Aubert, X. L., Haeb-Umbach, R., Beyerlein, P., Ullrich, M., Wendemuth, A., & Wilcox, P. (1998). Language-Model Investigations related to Broadcast News. In DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne.
LibreCat
| Download (ext.)
1998 | Conference Paper | LibreCat-ID: 11936 |

Welling, L., Haeb-Umbach, R., Aubert, X., & Haberland, N. (1998). A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training. In ICASSP 1998, Seattle.
LibreCat
| Download (ext.)
1997 | Conference Paper | LibreCat-ID: 11750 |

Dolfing, J. G. A., & Haeb-Umbach, R. (1997). Signal Representations for Hidden Markov Model Based On-Line Handwriting Recognition. In ICASSP, Munich.
LibreCat
| Download (ext.)
1997 | Journal Article | LibreCat-ID: 11766
Gamm, S., Haeb-Umbach, R., & Langmann, D. (1997). The development of a command-based speech interface for a telephone answering machine. Speech Communication.
LibreCat
1997 | Conference Paper | LibreCat-ID: 11781 |

Haeb-Umbach, R. (1997). Robust Speech Recognition for Wireless Networks and Mobile Telephony. In Eurospeech.
LibreCat
| Download (ext.)
1997 | Conference Paper | LibreCat-ID: 11819 |

Hoege, H., Tropf, H. S., Winsky, R., van den Heuvel, H., Haeb-Umbach, R., & Choukri, K. (1997). European Speech Databases for Telephone Applications. In ICASSP, Munich.
LibreCat
| Download (ext.)
1997 | Conference Paper | LibreCat-ID: 11852 |

Langmann, D., Fischer, A., Wuppermann, F., Haeb-Umbach, R., & Eisele, T. (1997). Acoustic Front Ends for Speaker-Independent Digit Recognition in Car Environments. In Eurospeech.
LibreCat
| Download (ext.)
1997 | Conference Paper | LibreCat-ID: 11855
Langmann, D., Wuppermann, F., Haeb-Umbach, R., Fischer, A., & Eisele, T. (1997). Investigation of Acoustic Front Ends for Speaker-Independent Speech Recognition in the Car. In Aachener Kolloquium on Signal Theory.
LibreCat
1996 | Conference Paper | LibreCat-ID: 11761 |

Eisele, T., Haeb-Umbach, R., & Langmann, D. (1996). A Comparative Study of Linear Feature Transformation Techniques for Automatic Speech Recognition. In ICSLP , Philadelphia.
LibreCat
| Download (ext.)
1996 | Conference Paper | LibreCat-ID: 11767 |

Gamm, S., Haeb-Umbach, R., & Langmann, D. (1996). Findings with the Design of a Command-Based Speech Interface for a Voice Mail System. In IEEE Workshop on Interactive Voice Technology for Telecommunications Applications.
LibreCat
| Download (ext.)
1996 | Conference Paper | LibreCat-ID: 11853 |

Langmann, D., & Haeb-Umbach, R. (1996). FRESCO: The French Telephone Speech Data Collection - Part of the European SpeechDat(M) Project. In ICSLP, Philadelphia.
LibreCat
| Download (ext.)
1996 | Conference Paper | LibreCat-ID: 11854
Langmann, D., Haeb-Umbach, R., & Eisele, T. (1996). Robust Rejection Modeling for a Small-Vocabulary Application. In ITG Fachtagung Sprachkommunikation, Frankfurt.
LibreCat
1995 | Conference Paper | LibreCat-ID: 11757 |

Dugast, C., Beyerlein, P., & Haeb-Umbach, R. (1995). Application of Clustering Techniques to Mixture Density Modelling for Continuous-Speech Recognition. In ICASSP, Detroit.
LibreCat
| Download (ext.)
1995 | Journal Article | LibreCat-ID: 11764
Gamm, S., & Haeb-Umbach, R. (1995). User interface design of voice controlled consumer electronics. Philips Journal of Research.
LibreCat
1995 | Conference Paper | LibreCat-ID: 11765
Gamm, S., & Haeb-Umbach, R. (1995). Human Factors of a Voice-Controlled Car Stereo. In Eurospeech, Madrid.
LibreCat
1995 | Conference Paper | LibreCat-ID: 11768
Gamm, S., Haeb-Umbach, R., & Langmann, D. (1995). The Usability Engineering of a Voice-Controlled Answering Machine. In International Symposium on Human Factors in Telecommunications, Melbourne.
LibreCat
1995 | Journal Article | LibreCat-ID: 11786
Haeb-Umbach, R., Beyerlein, P., & Geller, D. (1995). Speech recognition algorithms for voice control interfaces. Philips Journal of Research.
LibreCat
1995 | Conference Paper | LibreCat-ID: 11787 |

Haeb-Umbach, R., Beyerlein, P., & Thelen, E. (1995). Automatic Transcription of Unknown Words in a Speech Recognition System. In ICASSP, Detroit.
LibreCat
| Download (ext.)
1995 | Journal Article | LibreCat-ID: 11905
Steinbiss, V., Ney, H. J., Aubert, X. L., Besling, S., Dugast, C., Essen, U., … Tran, B. H. (1995). The Philips Research system for continuous-speech dictation. Philips Journal of Research.
LibreCat
1995 | Journal Article | LibreCat-ID: 11948
Steinbiss, V., Ney, H. J., Essen, U., Tran, B. H., Aubert, X. L., Dugast, C., … Bartosik, H. (1995). Continuous speech dictation - From theory to practice. Speech Communication.
LibreCat
1994 | Journal Article | LibreCat-ID: 11796
Haeb-Umbach, R., & Ney, H. (1994). Improvements in beam search for 10000-word continuous-speech recognition. IEEE Transactions on Speech and Audio Processing.
LibreCat
1994 | Conference Paper | LibreCat-ID: 11878
Ney, H., Steinbeiss, V., Aubert, X. L., & Haeb-Umbach, R. (1994). Progress in Large-Vocabulary, Continuous Speech Recognition. In Artifical Intelligence, Progress and Prospects of Speech Research and Technology, Munich.
LibreCat
318 Publications
2024 | Conference Paper | LibreCat-ID: 57031 |

Gburrek, T., Meise, A., Schmalenstroeer, J., & Haeb-Umbach, R. (2024). Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models. 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC). https://doi.org/10.1109/iwaenc61483.2024.10694103
LibreCat
| Files available
| DOI
2024 | Report | LibreCat-ID: 57161
Werning, A., & Haeb-Umbach, R. (2024). UPB-NT submission to DCASE24: Dataset pruning for targeted knowledge distillation.
LibreCat
2024 | Conference Paper | LibreCat-ID: 57160
Werning, A., & Haeb-Umbach, R. (2024). Target-Specific Dataset Pruning for Compression of Audio Tagging Models. 32nd European Signal Processing Conference (EUSIPCO 2024). 32nd European Signal Processing Conference, Lyon.
LibreCat
| Files available
2024 | Conference Paper | LibreCat-ID: 57099
Xie, Y., Kuhlmann, M., Rautenberg, F., Tan, Z.-H., & Häb-Umbach, R. (2024). Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder. 2024 32nd European Signal Processing Conference (EUSIPCO), 436–440.
LibreCat
2024 | Conference Paper | LibreCat-ID: 56004 |

von Neumann, T., Boeddeker, C., Cord-Landwehr, T., Delcroix, M., & Haeb-Umbach, R. (2024). Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). https://doi.org/10.1109/icasspw62465.2024.10625894
LibreCat
| Files available
| DOI
2024 | Journal Article | LibreCat-ID: 52958 |

Boeddeker, C., Subramanian, A. S., Wichern, G., Haeb-Umbach, R., & Le Roux, J. (2024). TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 32, 1185–1197. https://doi.org/10.1109/taslp.2024.3350887
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 53659
Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2024). Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul. https://doi.org/10.1109/icassp48485.2024.10445911
LibreCat
| DOI
2024 | Preprint | LibreCat-ID: 57085 |

Cord-Landwehr, T., Boeddeker, C., & Haeb-Umbach, R. (2024). Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models.
LibreCat
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 56272 |

Boeddeker, C., Cord-Landwehr, T., & Haeb-Umbach, R. (2024). Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment. Interspeech 2024. https://doi.org/10.21437/interspeech.2024-1286
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 57659 |

Vieting, P., Berger, S., von Neumann, T., Boeddeker, C., Schlüter, R., & Haeb-Umbach, R. (2024). Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription. 2024 IEEE Spoken Language Technology Workshop (SLT).
LibreCat
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48269 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2023). On the Integration of Sampling Rate Synchronization and Acoustic Beamforming. European Signal Processing Conference (EUSIPCO). European Signal Processing Conference (EUSIPCO), Helsinki.
LibreCat
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48270 |

Schmalenstroeer, J., Gburrek, T., & Haeb-Umbach, R. (2023). LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Aachen.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 48355 |

Rautenberg, F., Kuhlmann, M., Wiechmann, J., Seebauer, F., Wagner, P., & Haeb-Umbach, R. (2023). On Feature Importance and Interpretability of Speaker Representations. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Aachen.
LibreCat
| Files available
| Download (ext.)
| arXiv
2023 | Conference Paper | LibreCat-ID: 48410 |

Wiechmann, J., Rautenberg, F., Wagner, P., & Haeb-Umbach, R. (2023). Explaining voice characteristics to novice voice practitioners-How successful is it? 20th International Congress of the Phonetic Sciences (ICPhS) .
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 46069
Seebauer, F., Kuhlmann, M., Haeb-Umbach, R., & Wagner, P. (2023). Re-examining the quality dimensions of synthetic speech. 12th Speech Synthesis Workshop (SSW) 2023.
LibreCat
2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 49109 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2023). Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks. Proc. Asilomar Conference on Signals, Systems, and Computers. 57th Asilomar Conference on Signals, Systems, and Computers.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 44849 |

Rautenberg, F., Kuhlmann, M., Ebbers, J., Wiechmann, J., Seebauer, F., Wagner, P., & Haeb-Umbach, R. (2023). Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics. Fortschritte Der Akustik - DAGA 2023, 1409–1412.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 49111
Ebbers, J., Haeb-Umbach, R., & Serizel, R. (2023). Post-Processing Independent Evaluation of Sound Event Detection Systems. Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023), 36–40.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 57098
Seebauer, F., Kuhlmann, M., Häb-Umbach, R., & Wagner, P. (2023). DISCERNING DIMENSIONS OF QUALITY FOR STATE OF THE ART SYNTHETIC SPEECH. Proceedings of the 20th International Congress of Phonetic Sciences. International Congress of Phonetic Sciences (ICPhS), Prague.
LibreCat
2023 | Conference Paper | LibreCat-ID: 57086
Kuhlmann, M., Meise, A., Seebauer, F., Wagner, P., & Häb-Umbach, R. (2023). Investigating Speaker Embedding Disentanglement on Natural Read Speech. Speech Communication; 15th ITG Conference, 121–125.
LibreCat
2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2023). On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp49357.2023.10094784
LibreCat
| Files available
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann, T., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 47128 |

Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2023). Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes. https://doi.org/10.1109/icassp49357.2023.10095370
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 47129 |

Cord-Landwehr, T., Boeddeker, C., Zorilă, C., Doddipatla, R., & Haeb-Umbach, R. (2023). A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures. INTERSPEECH 2023. https://doi.org/10.21437/interspeech.2023-1379
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 54439 |

Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2023). Multi-stage diarization refinement for the CHiME-7 DASR scenario. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). https://doi.org/10.21437/chime.2023-10
LibreCat
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48390 |

Berger, S., Vieting, P., Boeddeker, C., Schlüter, R., & Haeb-Umbach, R. (2023). Mixture Encoder for Joint Speech Separation and Recognition. INTERSPEECH 2023. https://doi.org/10.21437/interspeech.2023-1815
LibreCat
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33471
Heitkämper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (n.d.). Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels. Proceedings of the 30th European Signal Processing Conference (EUSIPCO). 30th European Signal Processing Conference (EUSIPCO), Belgrad.
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, T., von Neumann, T., Boeddeker, C., & Haeb-Umbach, R. (2022). MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33807 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746284
LibreCat
| Files available
| DOI
2022 | Journal Article | LibreCat-ID: 33451 |

Grimm, C., Fei, T., Warsitz, E., Farhoud, R., Breddermann, T., & Haeb-Umbach, R. (2022). Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications. IEEE Transactions on Vehicular Technology, 71(9), 9435–9449. https://doi.org/10.1109/TVT.2022.3182411
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33696 |

Wiechmann, J., Glarner, T., Rautenberg, F., Wagner, P., & Haeb-Umbach, R. (2022). Technically enabled explaining of voice characteristics. 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P).
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33857 |

Kuhlmann, M., Seebauer, F., Ebbers, J., Wagner, P., & Haeb-Umbach, R. (2022). Investigation into Target Speaking Rate Adaptation for Voice Conversion. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10740
LibreCat
| Files available
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33808 |

Gburrek, T., Schmalenstroeer, J., Heitkaemper, J., & Haeb-Umbach, R. (2022). Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022), Bamberg, Germany . https://doi.org/10.1109/IWAENC53105.2022.9914772
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 34072 |

Ebbers, J., Haeb-Umbach, R., & Serizel, R. (2022). Threshold Independent Evaluation of Sound Event Detection Scores. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Files available
2022 | Report | LibreCat-ID: 49113
Ebbers, J., & Haeb-Umbach, R. (2022). Pre-Training And Self-Training For Sound Event Detection In Domestic Environments.
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr, T., Boeddeker, C., von Neumann, T., Zorila, C., Doddipatla, R., & Haeb-Umbach, R. (2022). Monaural source separation: From anechoic to reverberant environments. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2022). SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746757
LibreCat
| Files available
| DOI
2022 | Misc | LibreCat-ID: 33816 |

Gburrek, T., Boeddeker, C., von Neumann, T., Cord-Landwehr, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv. https://doi.org/10.48550/ARXIV.2205.00944
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2022). An Initialization Scheme for Meeting Separation with Spatial Mixture Models. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10929
LibreCat
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33958
Kinoshita, K., von Neumann, T., Delcroix, M., Boeddeker, C., & Haeb-Umbach, R. (2022). Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. Proc. Interspeech 2022, 1486–1490. https://doi.org/10.21437/Interspeech.2022-11408
LibreCat
| DOI
| Download (ext.)
2021 | Journal Article | LibreCat-ID: 21065 |

Haeb-Umbach, R., Heymann, J., Drude, L., Watanabe, S., Delcroix, M., & Nakatani, T. (2021). Far-Field Automatic Speech Recognition. Proceedings of the IEEE, 109(2), 124–148. https://doi.org/10.1109/JPROC.2020.3018668
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 28256
Zhang, W., Boeddeker, C., Watanabe, S., Nakatani, T., Delcroix, M., Kinoshita, K., Ochiai, T., Kamo, N., Haeb-Umbach, R., & Qian, Y. (2021). End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp39728.2021.9414464
LibreCat
| DOI
2021 | Conference Paper | LibreCat-ID: 24000
Heitkaemper, J., Schmalenstroeer, J., Ion, V., & Haeb-Umbach, R. (2021). A Database for Research on Detection and Enhancement of Speech Transmitted over HF links. Speech Communication; 14th ITG-Symposium, 1–5.
LibreCat
2021 | Conference Paper | LibreCat-ID: 44843 |

Boeddeker, C., Rautenberg, F., & Haeb-Umbach, R. (2021). A Comparison and Combination of Unsupervised Blind Source Separation Techniques. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Kiel.
LibreCat
| Files available
| Download (ext.)
| arXiv
2021 | Conference Paper | LibreCat-ID: 28259 |

Boeddeker, C., Zhang, W., Nakatani, T., Kinoshita, K., Ochiai, T., Delcroix, M., Kamo, N., Qian, Y., & Haeb-Umbach, R. (2021). Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp39728.2021.9414661
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 23998 |

Schmalenstroeer, J., Heitkaemper, J., Ullmann, J., & Haeb-Umbach, R. (2021). Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech. 29th European Signal Processing Conference (EUSIPCO), 1–5.
LibreCat
| Download (ext.)
2021 | Journal Article | LibreCat-ID: 22528 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2021). Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information. EURASIP Journal on Audio, Speech, and Music Processing. https://doi.org/10.1186/s13636-021-00210-x
LibreCat
| DOI
| Download (ext.)
2021 | Conference Paper | LibreCat-ID: 23994 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2021). Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp39728.2021.9413831
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 23999 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2021). On Source-Microphone Distance Estimation Using Convolutional Recurrent Neural Networks. Speech Communication; 14th ITG-Symposium, 1–5.
LibreCat
| Files available
2021 | Conference Paper | LibreCat-ID: 29304 |

Ebbers, J., Kuhlmann, M., Cord-Landwehr, T., & Haeb-Umbach, R. (2021). Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 3860–3864.
LibreCat
| Files available
2021 | Conference Paper | LibreCat-ID: 26770 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2021). Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021. Interspeech. https://doi.org/10.21437/interspeech.2021-1177
LibreCat
| Files available
| DOI
2021 | Conference Paper | LibreCat-ID: 29173 |

von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2021). Speeding Up Permutation Invariant Training for Source Separation. Speech Communication; 14th ITG Conference. Speech Communication; 14th ITG Conference, Kiel.
LibreCat
| Files available
2021 | Conference Paper | LibreCat-ID: 29308 |

Ebbers, J., & Haeb-Umbach, R. (2021). Self-Trained Audio Tagging and Sound Event Detection in Domestic Environments. Proceedings of the 6th Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE2021), 226–230.
LibreCat
| Files available
2021 | Conference Paper | LibreCat-ID: 29306 |

Ebbers, J., Keyser, M. C., & Haeb-Umbach, R. (2021). Adapting Sound Recognition to A New Environment Via Self-Training. Proceedings of the 29th European Signal Processing Conference (EUSIPCO), 1135–1139.
LibreCat
| Files available
2021 | Journal Article | LibreCat-ID: 24456 |

Rohlfing, K. J., Cimiano, P., Scharlau, I., Matzner, T., Buhl, H. M., Buschmeier, H., Esposito, E., Grimminger, A., Hammer, B., Haeb-Umbach, R., Horwath, I., Hüllermeier, E., Kern, F., Kopp, S., Thommes, K., Ngonga Ngomo, A.-C., Schulte, C., Wachsmuth, H., Wagner, P., & Wrede, B. (2021). Explanation as a Social Practice: Toward a Conceptual Framework for the Social Design of AI Systems. IEEE Transactions on Cognitive and Developmental Systems, 13(3), 717–728. https://doi.org/10.1109/tcds.2020.3044366
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 17763 |

Haeb-Umbach, R. (2020). Sprachtechnologien für Digitale Assistenten. In R. Böck, I. Siegert, & A. Wendemuth (Eds.), Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2020 (pp. 227–234). TUDpress, Dresden.
LibreCat
| Download (ext.)
2020 | Conference Paper | LibreCat-ID: 20700 |

Boeddeker, C., Cord-Landwehr, T., Heitkaemper, J., Zorila, C., Hayakawa, D., Li, M., … Haeb-Umbach, R. (2020). Towards a speaker diarization system for the CHiME 2020 dinner party transcription. In Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments.
LibreCat
| Files available
2020 | Journal Article | LibreCat-ID: 17598 |

Nakatani, T., Boeddeker, C., Kinoshita, K., Ikeshita, R., Delcroix, M., & Haeb-Umbach, R. (2020). Jointly optimal denoising, dereverberation, and source separation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 1–1. https://doi.org/10.1109/TASLP.2020.3013118
LibreCat
| DOI
| Download (ext.)
2020 | Conference Paper | LibreCat-ID: 20504
Heitkaemper, J., Jakobeit, D., Boeddeker, C., Drude, L., & Haeb-Umbach, R. (2020). Demystifying TasNet: A Dissecting Approach. ICASSP 2020 Virtual Barcelona Spain.
LibreCat
| Files available
2020 | Conference Paper | LibreCat-ID: 20505
Heitkaemper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (2020). Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments. INTERSPEECH 2020 Virtual Shanghai China.
LibreCat
| Files available
2020 | Conference Paper | LibreCat-ID: 20762 |

von Neumann, T., Kinoshita, K., Drude, L., Boeddeker, C., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). End-to-End Training of Time Domain Audio Separation and Recognition. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 7004–7008. https://doi.org/10.1109/ICASSP40776.2020.9053461
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 20764 |

von Neumann, T., Boeddeker, C., Drude, L., Kinoshita, K., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. Proc. Interspeech 2020, 3097–3101. https://doi.org/10.21437/Interspeech.2020-2519
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 18651 |

Gburrek, T., Schmalenstroeer, J., Brendel, A., Kellermann, W., & Haeb-Umbach, R. (2020). Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Network. European Signal Processing Conference (EUSIPCO).
LibreCat
| Files available
2020 | Conference Paper | LibreCat-ID: 20766 |

Kinoshita, K., von Neumann, T., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. Proc. Interspeech 2020, 2652–2656. https://doi.org/10.21437/Interspeech.2020-2388
LibreCat
| Files available
| DOI
2020 | Conference Paper | LibreCat-ID: 20753 |

Ebbers, J., & Haeb-Umbach, R. (2020). Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-Supervised Sound Event Detection. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2020 Workshop (DCASE2020).
LibreCat
| Files available
2020 | Conference Paper | LibreCat-ID: 20695 |

Boeddeker, C., Nakatani, T., Kinoshita, K., & Haeb-Umbach, R. (2020). Jointly Optimal Dereverberation and Beamforming. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp40776.2020.9054393
LibreCat
| Files available
| DOI
2019 | Journal Article | LibreCat-ID: 17762
Haeb-Umbach, R. (2019). Lektionen für Alexa \& Co?! Forschung, 44(1), 12–15. https://doi.org/10.1002/fors.201970104
LibreCat
| DOI
2019 | Journal Article | LibreCat-ID: 19446 |

Drude, L., Heitkaemper, J., Boeddeker, C., & Haeb-Umbach, R. (2019). SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. ArXiv E-Prints.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 11965 |

Drude, L., Heymann, J., & Haeb-Umbach, R. (2019). Unsupervised training of neural mask-based beamforming. In INTERSPEECH 2019, Graz, Austria.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 12874 |

Drude, L., Hasenklever, D., & Haeb-Umbach, R. (2019). Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation. In ICASSP 2019, Brighton, UK.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 12875 |

Heymann, J., Drude, L., Haeb-Umbach, R., Kinoshita, K., & Nakatani, T. (2019). Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR. In ICASSP 2019, Brighton, UK.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 12876 |

Kurz, G., Gilitschenski, I., Pfaff, F., Drude, L., Hanebeck, U. D., Haeb-Umbach, R., & Siegwart, R. Y. (2019). Directional Statistics and Filtering Using libDirectional. In Journal of Statistical Software 89(4).
LibreCat
| Files available
2019 | Journal Article | LibreCat-ID: 12890 |

Drude, L., & Haeb-Umbach, R. (2019). Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation. IEEE Journal of Selected Topics in Signal Processing. https://doi.org/10.1109/JSTSP.2019.2912565
LibreCat
| Files available
| DOI
2019 | Conference Paper | LibreCat-ID: 15816 |

Zorila, C., Boeddeker, C., Doddipatla, R., & Haeb-Umbach, R. (2019). An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription. In ASRU 2019, Sentosa, Singapore.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 14822 |

Heitkaemper, J., Feher, T., Freitag, M., & Haeb-Umbach, R. (2019). A Study on Online Source Extraction in the Presence of Changing Speaker Positions. In International Conference on Statistical Language and Speech Processing 2019, Ljubljana, Slovenia.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 14824 |

Martin-Donas, J. M., Heitkaemper, J., Haeb-Umbach, R., Gomez, A. M., & Peinado, A. M. (2019). Multi-Channel Block-Online Source Extraction based on Utterance Adaptation. In INTERSPEECH 2019, Graz, Austria.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 14826 |

Kanda, N., Boeddeker, C., Heitkaemper, J., Fujita, Y., Horiguchi, S., & Haeb-Umbach, R. (2019). Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. In INTERSPEECH 2019, Graz, Austria.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 13271 |

von Neumann, T., Kinoshita, K., Delcroix, M., Araki, S., Nakatani, T., & Haeb-Umbach, R. (2019). All-neural Online Source Separation, Counting, and Diarization for Meeting Analysis. In ICASSP 2019, Brighton, UK.
LibreCat
| Files available
2019 | Journal Article | LibreCat-ID: 15814 |

Haeb-Umbach, R., Watanabe, S., Nakatani, T., Bacchiani, M., Hoffmeister, B., Seltzer, M. L., Zen, H., & Souden, M. (2019). Speech Processing for Digital Home Assistance: Combining Signal Processing With Deep-Learning Techniques. IEEE Signal Processing Magazine, 36(6), 111–124. https://doi.org/10.1109/MSP.2019.2918706
LibreCat
| Files available
| DOI
2019 | Journal Article | LibreCat-ID: 19450 |

Haeb-Umbach, R. (2019). Lektionen für Alexa & Co?! DFG Forschung 1/2019, 12–15. https://doi.org/10.1002/fors.201970104
LibreCat
| Files available
| DOI
2019 | Conference Paper | LibreCat-ID: 15237 |

Gburrek, T., Glarner, T., Ebbers, J., Haeb-Umbach, R., & Wagner, P. (2019). Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion. Proc. 10th ISCA Speech Synthesis Workshop, 81–86. https://doi.org/10.21437/SSW.2019-15
LibreCat
| Files available
| DOI
| Download (ext.)
2019 | Conference Paper | LibreCat-ID: 15794 |

Ebbers, J., & Haeb-Umbach, R. (2019). Convolutional Recurrent Neural Network and Data Augmentation for Audio Tagging with Noisy Labels and Minimal Supervision. DCASE2019 Workshop, New York, USA.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 15796 |

Ebbers, J., Drude, L., Haeb-Umbach, R., Brendel, A., & Kellermann, W. (2019). Weakly Supervised Sound Activity Detection and Event Classification in Acoustic Sensor Networks. CAMSAP 2019, Guadeloupe, West Indies.
LibreCat
| Files available
2019 | Conference Paper | LibreCat-ID: 15792 |

Nelus, A., Ebbers, J., Haeb-Umbach, R., & Martin, R. (2019). Privacy-preserving Variational Information Feature Extraction for Domestic Activity Monitoring Versus Speaker Identification. INTERSPEECH 2019, Graz, Austria.
LibreCat
| Files available
2018 | Conference Paper | LibreCat-ID: 11760 |

Ebbers, J., Nelus, A., Martin, R., & Haeb-Umbach, R. (2018). Evaluation of Modulation-MFCC Features and DNN Classification for Acoustic Event Detection. In DAGA 2018, München.
LibreCat
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11835 |

Heymann, J., Drude, L., Haeb-Umbach, R., Kinoshita, K., & Nakatani, T. (2018). Frame-Online DNN-WPE Dereverberation. In IWAENC 2018, Tokio, Japan.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11837 |

Heitkaemper, J., Heymann, J., & Haeb-Umbach, R. (2018). Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming. In ITG 2018, Oldenburg, Germany.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11872 |

Drude, L., Boeddeker, C., Heymann, J., Kinoshita, K., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2018). Integration neural network based beamforming and weighted prediction error dereverberation. In INTERSPEECH 2018, Hyderabad, India.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11873 |

Drude, L., Heymann, J., Boeddeker, C., & Haeb-Umbach, R. (2018). NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing. In ITG 2018, Oldenburg, Germany.
LibreCat
| Files available
| Download (ext.)
2018 | Journal Article | LibreCat-ID: 11916 |

Despotovic, V., Walter, O., & Haeb-Umbach, R. (2018). Machine learning techniques for semantic analysis of dysarthric speech: An experimental study. Speech Communication 99 (2018) 242-251 (Elsevier B.V.).
LibreCat
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 12898 |

Drude, L., von Neumann, T., & Haeb-Umbach, R. (2018). Deep Attractor Networks for Speaker Re-Identifikation and Blind Source Separation. In ICASSP 2018, Calgary, Canada.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 12900 |

Drude, L., Higuchi, Takuya , Kinoshita, K., Nakatani, T., & Haeb-Umbach, R. (2018). Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation. In ICASSP 2018, Calgary, Canada.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 12901 |

Boeddeker, C., Erdogan, H., Yoshioka, T., & Haeb-Umbach, R. (2018). Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition. In ICASSP 2018, Calgary, Canada.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 12899 |

Boeddeker, C., Heitkaemper, J., Schmalenstroeer, J., Drude, L., Heymann, J., & Haeb-Umbach, R. (2018). Front-End Processing for the CHiME-5 Dinner Party Scenario. Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 6859
Afifi, H., Schmalenstroeer, J., Ullmann, J., Haeb-Umbach, R., & Karl, H. (2018). MARVELO - A Framework for Signal Processing in Wireless Acoustic Sensor Networks. Speech Communication; 13th ITG-Symposium, 1–5.
LibreCat
2018 | Conference Paper | LibreCat-ID: 11747 |

Grimm, C., Breddermann, T., Farhoud, R., Fei, T., Warsitz, E., & Haeb-Umbach, R. (2018). Discrimination of Stationary from Moving Targets with Recurrent Neural Networks in Automotive Radar. International Conference on Microwaves for Intelligent Mobility (ICMIM) 2018.
LibreCat
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11907 |

Glarner, T., Hanebrink, P., Ebbers, J., & Haeb-Umbach, R. (2018). Full Bayesian Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. INTERSPEECH 2018, Hyderabad, India.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11838 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2018). Efficient Sampling Rate Offset Compensation - An Overlap-Save Based Approach. 26th European Signal Processing Conference (EUSIPCO 2018).
LibreCat
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11876 |

Kitza, M., Michel, W., Boeddeker, C., Heitkaemper, J., Menne, T., Schlüter, R., Ney, H., Schmalenstroeer, J., Drude, L., Heymann, J., & Haeb-Umbach, R. (2018). The RWTH/UPB System Combination for the CHiME 2018 Workshop. Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India.
LibreCat
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11836 |

Ebbers, J., Heitkaemper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (2018). Benchmarking Neural Network Architectures for Acoustic Sensor Networks. ITG 2018, Oldenburg, Germany.
LibreCat
| Files available
| Download (ext.)
2018 | Conference Paper | LibreCat-ID: 11839 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2018). Insights into the Interplay of Sampling Rate Offsets and MVDR Beamforming. ITG 2018, Oldenburg, Germany.
LibreCat
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11717 |

Arora, P., & Haeb-Umbach, R. (2017). A Study on Transfer Learning for Acoustic Event Detection in a Real Life Scenario. In IEEE 19th International Workshop on Multimedia Signal Processing (MMSP).
LibreCat
| Files available
| Download (ext.)
2017 | Report | LibreCat-ID: 11735 |

Boeddeker, C., Hanebrink, P., Drude, L., Heymann, J., & Haeb-Umbach, R. (2017). On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming.
LibreCat
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11736 |

Boeddeker, C., Hanebrink, P., Drude, L., Heymann, J., & Haeb-Umbach, R. (2017). Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation. In Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11737 |

Chinaev, A., & Haeb-Umbach, R. (2017). A Generalized Log-Spectral Amplitude Estimator for Single-Channel Speech Enhancement. In Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Files available
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11754 |

Drude, L., & Haeb-Umbach, R. (2017). Tight integration of spatial and spectral features for BSS with Deep Clustering embeddings. In INTERSPEECH 2017, Stockholm, Schweden.
LibreCat
| Files available
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11770 |

Glarner, T., Boenninghoff, B., Walter, O., & Haeb-Umbach, R. (2017). Leveraging Text Data for Word Segmentation for Underresourced Languages. In INTERSPEECH 2017, Stockholm, Schweden.
LibreCat
| Files available
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11809 |

Heymann, J., Drude, L., Boeddeker, C., Hanebrink, P., & Haeb-Umbach, R. (2017). BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System. In Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Files available
| Download (ext.)
2017 | Journal Article | LibreCat-ID: 11811 |

Heymann, J., Drude, L., & Haeb-Umbach, R. (2017). A Generic Neural Acoustic Beamforming Architecture for Robust Multi-Channel Speech Processing. Computer Speech and Language.
LibreCat
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11763 |

Fei, T., Grimm, C., Farhoud, R., Breddermann, T., Warsitz, E., & Haeb-Umbach, R. (2017). A Novel Target Separation Algorithm Applied to The Two-Dimensional Spectrum for FMCW Automotive Radar Systems. IEEE International Conference on Microwave, Communications, Anthenas and Electronic Systems.
LibreCat
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11772 |

Grimm, C., Breddermann, T., Farhoud, R., Fei, T., Warsitz, E., & Haeb-Umbach, R. (2017). Hypothesis Test for the Detection of Moving Targets in Automotive Radar. IEEE International Conference on Microwave, Communications, Anthenas and Electronic Systems (COMCAS).
LibreCat
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11759 |

Ebbers, J., Heymann, J., Drude, L., Glarner, T., Haeb-Umbach, R., & Raj, B. (2017). Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. INTERSPEECH 2017, Stockholm, Schweden.
LibreCat
| Files available
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11895 |

Schmalenstroeer, J., Heymann, J., Drude, L., Boeddeker, C., & Haeb-Umbach, R. (2017). Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming. IEEE 19th International Workshop on Multimedia Signal Processing (MMSP).
LibreCat
| Files available
| Download (ext.)
2017 | Conference Paper | LibreCat-ID: 11773 |

Grimm, C., Farhoud, R., Fei, T., Warsitz, E., & Haeb-Umbach, R. (2017). Detection of Moving Targets in Automotive Radar with Distorted Ego-Velocity Information. IEEE Microwaves, Radar and Remote Sensing Symposium (MRRS).
LibreCat
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11738 |

Chinaev, A., & Haeb-Umbach, R. (2016). A Priori SNR Estimation Using a Generalized Decision Directed Approach. In INTERSPEECH 2016, San Francisco, USA.
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11743 |

Chinaev, A., Heitkaemper, J., & Haeb-Umbach, R. (2016). A Priori SNR Estimation Using Weibull Mixture Model. In 12. ITG Fachtagung Sprachkommunikation (ITG 2016).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11744 |

Chinaev, A., Heymann, J., Drude, L., & Haeb-Umbach, R. (2016). Noise-Presence-Probability-Based Noise PSD Estimation by Using DNNs. In 12. ITG Fachtagung Sprachkommunikation (ITG 2016).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11751 |

Drude, L., Boeddeker, C., & Haeb-Umbach, R. (2016). Blind Speech Separation based on Complex Spherical k-Mode Clustering. In Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11756 |

Drude, L., Raj, B., & Haeb-Umbach, R. (2016). On the appropriateness of complex-valued neural networks for speech enhancement. In INTERSPEECH 2016, San Francisco, USA.
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11771 |

Glarner, T., Mahdi Momenzadeh, M., Drude, L., & Haeb-Umbach, R. (2016). Factor Graph Decoding for Speech Presence Probability Estimation. In 12. ITG Fachtagung Sprachkommunikation (ITG 2016).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11812 |

Heymann, J., Drude, L., & Haeb-Umbach, R. (2016). Neural Network Based Spectral Mask Estimation for Acoustic Beamforming. In Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11829 |

Jacob, F., & Haeb-Umbach, R. (2016). On the Bias of Direction of Arrival Estimation Using Linear Microphone Arrays. In 12. ITG Fachtagung Sprachkommunikation (ITG 2016).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11834 |

Heymann, J., Drude, L., & Haeb-Umbach, R. (2016). Wide Residual BLSTM Network with Discriminative Speaker Adaptation for Robust Speech Recognition. In Computer Speech and Language.
LibreCat
| Files available
| Download (ext.)
2016 | Journal Article | LibreCat-ID: 11840 |

Kinoshita, K., Delcroix, M., Gannot, S., Habets, E. A. P., Haeb-Umbach, R., Kellermann, W., … Yoshioka, T. (2016). A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research. EURASIP Journal on Advances in Signal Processing.
LibreCat
| Download (ext.)
2016 | Journal Article | LibreCat-ID: 11886
Plinge, A., Jacob, F., Haeb-Umbach, R., & Fink, G. A. (2016). Acoustic Microphone Geometry Calibration: An overview and experimental evaluation of state-of-the-art algorithms. IEEE Signal Processing Magazine, 33(4), 14–29. https://doi.org/10.1109/MSP.2016.2555198
LibreCat
| DOI
2016 | Conference Paper | LibreCat-ID: 11908 |

Menne, T., Heymann, J., Alexandridis, A., Irie, K., Zeyer, A., Kitza, M., … Mouchtaris, A. (2016). The RWTH/UPB/FORTH System Combination for the 4th CHiME Challenge Evaluation. In Computer Speech and Language.
LibreCat
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11920 |

Walter, O., & Haeb-Umbach, R. (2016). Unsupervised Word Discovery from Speech using Bayesian Hierarchical Models. In 38th German Conference on Pattern Recognition (GCPR 2016).
LibreCat
| Files available
| Download (ext.)
2016 | Conference Paper | LibreCat-ID: 11890 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2016). Investigations into Bluetooth Low Energy Localization Precision Limits. 24th European Signal Processing Conference (EUSIPCO 2016).
LibreCat
| Files available
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11739 |

Chinaev, A., & Haeb-Umbach, R. (2015). On Optimal Smoothing in Minimum Statistics Based Noise Tracking. In Interspeech 2015 (pp. 1785–1789).
LibreCat
| Files available
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11748 |

Despotovic, V., Walter, O., & Haeb-Umbach, R. (2015). Semantic Analysis of Spoken Input using Markov Logic Networks. In INTERSPEECH 2015.
LibreCat
| Files available
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11755 |

Drude, L., Jacob, F., & Haeb-Umbach, R. (2015). DOA-Estimation based on a Complex Watson Kernel Method. In 23th European Signal Processing Conference (EUSIPCO 2015).
LibreCat
| Files available
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11810
Heymann, J., Drude, L., Chinaev, A., & Haeb-Umbach, R. (2015). BLSTM supported GEV Beamformer Front-End for the 3RD CHiME Challenge. In Automatic Speech Recognition and Understanding Workshop (ASRU 2015).
LibreCat
2015 | Conference Paper | LibreCat-ID: 11813 |

Heymann, J., Haeb-Umbach, R., Golik, P., & Schlueter, R. (2015). Unsupervised adaptation of a denoising autoencoder by Bayesian Feature Enhancement for reverberant asr under mismatch conditions. In Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on (pp. 5053–5057). https://doi.org/10.1109/ICASSP.2015.7178933
LibreCat
| DOI
| Download (ext.)
2015 | Journal Article | LibreCat-ID: 11830 |

Jacob, F., & Haeb-Umbach, R. (2015). Absolute Geometry Calibration of Distributed Microphone Arrays in an Audio-Visual Sensor Network. ArXiv E-Prints.
LibreCat
| Download (ext.)
2015 | Book | LibreCat-ID: 11868 |

Li, J., Deng, L., Haeb-Umbach, R., & Gong, Y. (2015). Robust Automatic Speech Recognition. Elsevier.
LibreCat
| Files available
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11875 |

Marchi, E., Schuller, B., Baron-Cohen, S., Golan, O., Boelte, S., Arora, P., & Haeb-Umbach, R. (2015). Typicality and Emotion in the Voice of Children with Autism Spectrum Condition: Evidence Across Three Languages. In INTERSPEECH 2015.
LibreCat
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11919 |

Walter, O., Drude, L., & Haeb-Umbach, R. (2015). Source Counting in Speech Mixtures by Nonparametric Bayesian Estimation of an infinite Gaussian Mixture Model. In 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2015).
LibreCat
| Files available
| Download (ext.)
2015 | Journal Article | LibreCat-ID: 11922 |

Walter, O., Haeb-Umbach, R., Mokbel, B., Paassen, B., & Hammer, B. (2015). Autonomous Learning of Representations. KI - Kuenstliche Intelligenz, 1–13. http://dx.doi.org/10.1007/s13218-015-0372-1
LibreCat
| DOI
| Download (ext.)
2015 | Report | LibreCat-ID: 11923 |

Walter, O., Haeb-Umbach, R., Strunk, J., & P. Himmelmann, N. (2015). Lexicon Discovery for Language Preservation using Unsupervised Word Segmentation with Pitman-Yor Language Models (FGNT-2015-01).
LibreCat
| Download (ext.)
2015 | Conference Paper | LibreCat-ID: 11874 |

Hoang, M. K., Schmalenstroeer, J., & Haeb-Umbach, R. (2015). Aligning training models with smartphone properties in WiFi fingerprinting based indoor localization. 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2015).
LibreCat
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11746 |

Chinaev, A., Puels, M., & Haeb-Umbach, R. (2014). Spectral Noise Tracking for Improved Nonstationary Noise Robust ASR. In 11. ITG Fachtagung Sprachkommunikation (ITG 2014).
LibreCat
| Files available
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11752 |

Drude, L., Chinaev, A., Tran Vu, D. H., & Haeb-Umbach, R. (2014). Source Counting in Speech Mixtures Using a Variational EM Approach for Complexwatson Mixture Models. In 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014).
LibreCat
| Files available
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11753 |

Drude, L., Chinaev, A., Tran Vu, D. H., & Haeb-Umbach, R. (2014). Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models. In 14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014) (pp. 213–217).
LibreCat
| Files available
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11814 |

Heymann, J., Walter, O., Haeb-Umbach, R., & Raj, B. (2014). Iterative Bayesian Word Segmentation for Unspuervised Vocabulary Discovery from Phoneme Lattices. In 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014).
LibreCat
| Files available
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11831 |

Jacob, F., & Haeb-Umbach, R. (2014). Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking. In 11. ITG Fachtagung Sprachkommunikation (ITG 2014).
LibreCat
| Files available
| Download (ext.)
2014 | Journal Article | LibreCat-ID: 11861
Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2014). A New Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 22(1), 95–109. https://doi.org/10.1109/TASLP.2013.2285480
LibreCat
| DOI
2014 | Journal Article | LibreCat-ID: 11867 |

Li, J., Deng, L., Gong, Y., & Haeb-Umbach, R. (2014). An Overview of Noise-Robust Automatic Speech Recognition. IEEE Transactions on Audio, Speech and Language Processing, 22(4), 745–777. https://doi.org/10.1109/TASLP.2014.2304637
LibreCat
| DOI
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11918 |

Walter, O., Despotovic, V., Haeb-Umbach, R., Gemmeke, J., Ons, B., & Van hamme, H. (2014). An Evaluation of Unsupervised Acoustic Model Training for a Dysarthric Speech Interface. In INTERSPEECH 2014.
LibreCat
| Files available
| Download (ext.)
2014 | Journal Article | LibreCat-ID: 11898 |

Schmalenstroeer, J., Jebramcik, P., & Haeb-Umbach, R. (2014). A combined hardware-software approach for acoustic sensor network synchronization . Signal Processing, 0. http://dx.doi.org/10.1016/j.sigpro.2014.06.030
LibreCat
| DOI
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11897 |

Schmalenstroeer, J., Jebramcik, P., & Haeb-Umbach, R. (2014). A Gossiping Approach to Sampling Clock Synchronization in Wireless Acoustic Sensor Networks. 39th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014).
LibreCat
| Files available
| Download (ext.)
2014 | Conference Paper | LibreCat-ID: 11903 |

Schmalenstroeer, J., Zhao, W., & Haeb-Umbach, R. (2014). Online Observation Error Model Estimation for Acoustic Sensor Network Synchronization. 11. ITG Fachtagung Sprachkommunikation (ITG 2014).
LibreCat
| Files available
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11716
Abdelaziz, A. H., Zeiler, S., Kolossa, D., Leutnant, V., & Haeb-Umbach, R. (2013). GMM-based significance decoding. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on (pp. 6827–6831). https://doi.org/10.1109/ICASSP.2013.6638984
LibreCat
| DOI
2013 | Conference Paper | LibreCat-ID: 11740 |

Chinaev, A., & Haeb-Umbach, R. (2013). MAP-based Estimation of the Parameters of a Gaussian Mixture Model in the Presence of Noisy Observations. In 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) (pp. 3352–3356). https://doi.org/10.1109/ICASSP.2013.6638279
LibreCat
| Files available
| DOI
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11742 |

Chinaev, A., Haeb-Umbach, R., Taghia, J., & Martin, R. (2013). Improved Single-Channel Nonstationary Noise Tracking by an Optimized MAP-based Postprocessor. In 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) (pp. 7477–7481). https://doi.org/10.1109/ICASSP.2013.6639116
LibreCat
| Files available
| DOI
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11762 |

Enzner, G., Schmid, D., & Haeb-Umbach, R. (2013). On the Acoustic Channel Identification in Multi-Microphone Systems via Adaptive Blind Signal Enhancement Techniques. In 21th European Signal Processing Conference (EUSIPCO 2013).
LibreCat
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11815 |

Heymann, J., Walter, O., Haeb-Umbach, R., & Raj, B. (2013). Unsupervised Word Segmentation from Noisy Input. In Automatic Speech Recognition and Understanding Workshop (ASRU 2013).
LibreCat
| Files available
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11816 |

Hoang, M. K., & Haeb-Umbach, R. (2013). Parameter estimation and classification of censored Gaussian data with application to WiFi indoor positioning. In 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013) (pp. 3721–3725). https://doi.org/10.1109/ICASSP.2013.6638353
LibreCat
| Files available
| DOI
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11841 |

Kinoshita, K., Delcroix, M., Yoshioka, T., Nakatani, T., Habets, E., Haeb-Umbach, R., … Raj, B. (2013). The reverb challenge: a common evaluation framework for dereverberation and recognition of reverberant speech. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (pp. 22–23).
LibreCat
| Download (ext.)
2013 | Journal Article | LibreCat-ID: 11862
Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2013). Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 21(8), 1640–1652. https://doi.org/10.1109/TASL.2013.2258013
LibreCat
| DOI
2013 | Conference Paper | LibreCat-ID: 11909 |

Tran Vu, D. H., & Haeb-Umbach, R. (2013). Blind Speech Separation Exploiting Temporal and Spectral Correlations Using Turbo Decoding of 2D-HMMs. In 21th European Signal Processing Conference (EUSIPCO 2013).
LibreCat
| Files available
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11917
Vu, D. H. T., & Haeb-Umbach, R. (2013). Using the turbo principle for exploiting temporal and spectral correlations in speech presence probability estimation. In 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) (pp. 863–867). https://doi.org/10.1109/ICASSP.2013.6637771
LibreCat
| DOI
2013 | Conference Paper | LibreCat-ID: 11921 |

Walter, O., Haeb-Umbach, R., Chaudhuri, S., & Raj, B. (2013). Unsupervised Word Discovery from Phonetic Input Using Nested Pitman-Yor Language Modeling. In IEEE International Conference on Robotics and Automation (ICRA 2013).
LibreCat
| Files available
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11924 |

Walter, O., Korthals, T., Haeb-Umbach, R., & Raj, B. (2013). Hierarchical System for Word Discovery Exploiting DTW-Based Initialization. In Automatic Speech Recognition and Understanding Workshop (ASRU 2013).
LibreCat
| Files available
| Download (ext.)
2013 | Report | LibreCat-ID: 11926 |

Walter, O., Schmalenstroeer, J., & Haeb-Umbach, R. (2013). A Novel Initialization Method for Unsupervised Learning of Acoustic Patterns in Speech (FGNT-2013-01).
LibreCat
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11832 |

Jacob, F., Schmalenstroeer, J., & Haeb-Umbach, R. (2013). DoA-Based Microphone Array Position Self-Calibration Using Circular Statistic. 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), 116–120. https://doi.org/10.1109/ICASSP.2013.6637620
LibreCat
| Files available
| DOI
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11891 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2013). Sampling Rate Synchronisation in Acoustic Sensor Networks with a Pre-Trained Clock Skew Error Model. 21th European Signal Processing Conference (EUSIPCO 2013).
LibreCat
| Files available
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11818 |

Hoang, M. K., Schmitz, S., Drueke, C., Vu, D. H. T., Schmalenstroeer, J., & Haeb-Umbach, R. (2013). Server based indoor navigation using RSSI and inertial sensor information. Positioning Navigation and Communication (WPNC), 2013 10th Workshop On, 1–6. https://doi.org/10.1109/WPNC.2013.6533263
LibreCat
| Files available
| DOI
| Download (ext.)
2013 | Conference Paper | LibreCat-ID: 11817 |

Hoang, M. K., Schmalenstroeer, J., Drueke, C., Tran Vu, D. H., & Haeb-Umbach, R. (2013). A Hidden Markov Model for Indoor User Tracking Based on WiFi Fingerprinting and Step Detection. 21th European Signal Processing Conference (EUSIPCO 2013).
LibreCat
| Files available
| Download (ext.)
2012 | Conference Paper | LibreCat-ID: 11741 |

Chinaev, A., & Haeb-Umbach, R. (2012). Quality Analysis and Optimization of the MAP-based Noise Power Spectral Density Tracker. In Speech Communication; 10. ITG Symposium; Proceedings.
LibreCat
| Files available
| Download (ext.)
2012 | Conference Paper | LibreCat-ID: 11745 |

Chinaev, A., Krueger, A., Tran Vu, D. H., & Haeb-Umbach, R. (2012). Improved Noise Power Spectral Density Tracking by a MAP-based Postprocessor. In 37th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012).
LibreCat
| Files available
| Download (ext.)
2012 | Book Chapter | LibreCat-ID: 11844
Krueger, A., & Haeb-Umbach, R. (2012). Reverberant Speech Recognition. In Techniques for Noise Robustness in Automatic Speech Recognition. Wiley.
LibreCat
2012 | Conference Paper | LibreCat-ID: 11849 |

Krueger, A., Walter, O., Leutnant, V., & Haeb-Umbach, R. (2012). Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data. In Proc. Interspeech. Portland, USA.
LibreCat
| Download (ext.)
2012 | Journal Article | LibreCat-ID: 11863 |

Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2012). Investigations Into a Statistical Observation Model for Logarithmic Mel Power Spectral Density Features of Noisy Reverberant Speech. Speech Communication; 10. ITG Symposium; Proceedings Of, 1–4.
LibreCat
| Download (ext.)
2012 | Conference Paper | LibreCat-ID: 11864 |

Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2012). A Statistical Observation Model For Noisy Reverberant Speech Features and its Application to Robust ASR. In Signal Processing, Communications and Computing (ICSPCC), 2012 IEEE International Conference on.
LibreCat
| Download (ext.)
2012 | Report | LibreCat-ID: 11865 |

Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2012). Derivation of the Power Compensation Constant in the Observation Model for Reverberant Speech in the Logarithmic Mel Power Spectral Domain.
LibreCat
| Download (ext.)
2012 | Conference Paper | LibreCat-ID: 11910
Tran Vu, D. H., & Haeb-Umbach, R. (2012). Exploiting Temporal Correlations in Joint Multichannel Speech Separation and Noise Suppression using Hidden Markov Models. In International Workshop on Acoustic Signal Enhancement (IWAENC2012).
LibreCat
2012 | Conference Paper | LibreCat-ID: 11833 |

Jacob, F., Schmalenstroeer, J., & Haeb-Umbach, R. (2012). Microphone Array Position Self-Calibration from Reverberant Speech Input. International Workshop on Acoustic Signal Enhancement (IWAENC 2012).
LibreCat
| Files available
| Download (ext.)
2012 | Conference Paper | LibreCat-ID: 11925 |

Walter, O., Schmalenstroeer, J., Engler, A., & Haeb-Umbach, R. (2012). Smartphone-Based Sensor Fusion for Improved Vehicular Navigation. 9th Workshop on Positioning Navigation and Communication (WPNC 2012).
LibreCat
| Download (ext.)
2011 | Conference Paper | LibreCat-ID: 11721 |

Bevermeier, M., Flanke, S., Haeb-Umbach, R., & Stehr, J. (2011). A Platform for efficient Supply Chain Management Support in Logistics. In International Workshop on Intelligent Transportation (WIT 2011).
LibreCat
| Download (ext.)
2011 | Book Chapter | LibreCat-ID: 11774
Haeb-Umbach, R. (2011). Uncertainty Decoding and Conditional Bayesian Estimation. In R. Haeb-Umbach & D. Kolossa (Eds.), Robust Speech Recognition of Uncertain or Missing Data. Springer.
LibreCat
2011 | Book Chapter | LibreCat-ID: 11775
Haeb-Umbach, R. (2011). Können Computer sprechen und hören, sollen sie es überhaupt können? Sprachverarbeitung und ambiente Intelligenz. In Baustelle Informationsgesellschaft und Universität heute. Ferdinand Schoeningh Verlag, Paderborn.
LibreCat
2011 | Journal Article | LibreCat-ID: 11807
Herbig, T., Gerl, F., Minker, W., & Haeb-Umbach, R. (2011). Adaptive Systems for Unsupervised Speaker Tracking and Speech Recognition. Evolving Systems, 2(3), 199–214.
LibreCat
2011 | Book Chapter | LibreCat-ID: 11843
Krueger, A., & Haeb-Umbach, R. (2011). A Model-Based Approach to Joint Compensation of Noise and Reverberation for Speech Recognition. In R. Haeb-Umbach & D. Kolossa (Eds.), Robust Speech Recognition of Uncertain or Missing Data. Springer.
LibreCat
2011 | Conference Paper | LibreCat-ID: 11845 |

Krueger, A., & Haeb-Umbach, R. (2011). MAP-based estimation of the parameters of non-stationary Gaussian processes from noisy observations. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2011) (pp. 3596–3599). https://doi.org/10.1109/ICASSP.2011.5946256
LibreCat
| DOI
| Download (ext.)
2011 | Journal Article | LibreCat-ID: 11850 |

Krueger, A., Warsitz, E., & Haeb-Umbach, R. (2011). Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation. IEEE Transactions on Audio, Speech, and Language Processing, 19(1), 206–219. https://doi.org/10.1109/TASL.2010.2047324
LibreCat
| DOI
| Download (ext.)
2011 | Book Chapter | LibreCat-ID: 11856
Leutnant, V., & Haeb-Umbach, R. (2011). Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition. In R. Haeb-Umbach & D. Kolossa (Eds.), Robust Speech Recognition of Uncertain or Missing Data. Springer.
LibreCat
2011 | Conference Paper | LibreCat-ID: 11866 |

Leutnant, V., Krueger, A., & Haeb-Umbach, R. (2011). A versatile Gaussian splitting approach to non-linear state estimation and its application to noise-robust ASR. In Interspeech 2011.
LibreCat
| Download (ext.)
2011 | Conference Paper | LibreCat-ID: 11911 |

Tran Vu, D. H., & Haeb-Umbach, R. (2011). On Initial Seed Selection for Frequency Domain Blind Speech Separation. In Interspeech 2011.
LibreCat
| Download (ext.)
2011 | Book (Editor) | LibreCat-ID: 11945 |

Kolossa, D., & Haeb-Umbach, R. (Eds.). (2011). Robust Speech Recognition of Uncertain or Missing Data --- Theory and Applications. Springer.
LibreCat
| Download (ext.)
2011 | Conference Paper | LibreCat-ID: 11889 |

Schmalenstroeer, J., Bartek, M., & Haeb-Umbach, R. (2011). Unsupervised learning of acoustic events using dynamic time warping and hierarchical K-means++ clustering. Interspeech 2011.
LibreCat
| Download (ext.)
2011 | Conference Paper | LibreCat-ID: 11896 |

Schmalenstroeer, J., Jacob, F., Haeb-Umbach, R., Hennecke, M., & Fink, G. A. (2011). Unsupervised Geometry Calibration of Acoustic Sensor Networks Using Source Correspondences. Interspeech 2011.
LibreCat
| Download (ext.)
2011 | Conference Paper | LibreCat-ID: 9456 |

Schmalenstroeer, J., Bartek, M., & Haeb-Umbach, R. (2011). Investigations into Features for Robust Classification into Broad Acoustic Categories. 37. Deutsche Jahrestagung Fuer Akustik (DAGA 2011).
LibreCat
| Download (ext.)
2010 | Conference Paper | LibreCat-ID: 11726 |

Bevermeier, M., Walter, O., Peschke, S., & Haeb-Umbach, R. (2010). Barometric height estimation combined with map-matching in a loosely-coupled Kalman-filter. In 7th Workshop on Positioning Navigation and Communication (WPNC 2010) (pp. 128–134). https://doi.org/10.1109/WPNC.2010.5650745
LibreCat
| DOI
| Download (ext.)
2010 | Journal Article | LibreCat-ID: 11846 |

Krueger, A., & Haeb-Umbach, R. (2010). Model-Based Feature Enhancement for Reverberant Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 18(7), 1692–1707. https://doi.org/10.1109/TASL.2010.2049684
LibreCat
| DOI
| Download (ext.)
2010 | Conference Paper | LibreCat-ID: 11857 |

Leutnant, V., & Haeb-Umbach, R. (2010). Options for Modelling Temporal Statistical Dependencies in an Acoustic Model for ASR. In 36. Deutsche Jahrestagung fuer Akustik (DAGA 2010).
LibreCat
| Download (ext.)
2010 | Conference Paper | LibreCat-ID: 11858 |

Leutnant, V., & Haeb-Umbach, R. (2010). On the Exploitation of Hidden Markov Models and Linear Dynamic Models in a Hybrid Decoder Architecture for Continuous Speech Recognition. In Interspeech 2010.
LibreCat
| Download (ext.)
2010 | Conference Paper | LibreCat-ID: 11887 |

Raj, B., Wilson, K. W., Krueger, A., & Haeb-Umbach, R. (2010). Ungrounded Independent Non-Negative Factor Analysis. In Interspeech 2010.
LibreCat
| Download (ext.)
2010 | Conference Paper | LibreCat-ID: 11912 |

Tran Vu, D. H., & Haeb-Umbach, R. (2010). An EM Approach to Integrated Multichannel Speech Separation and Noise Suppression. In International Workshop on Acoustic Echo and Noise Control (IWAENC 2010).
LibreCat
| Download (ext.)
2010 | Conference Paper | LibreCat-ID: 11913 |

Tran Vu, D. H., & Haeb-Umbach, R. (2010). Blind speech separation employing directional statistics in an Expectation Maximization framework. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010) (pp. 241–244). https://doi.org/10.1109/ICASSP.2010.5495994
LibreCat
| DOI
| Download (ext.)
2010 | Journal Article | LibreCat-ID: 11892 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2010). Online Diarization of Streaming Audio-Visual Data for Smart Environments. IEEE Journal of Selected Topics in Signal Processing, 4(5), 845–856. https://doi.org/10.1109/JSTSP.2010.2050519
LibreCat
| DOI
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11723 |

Bevermeier, M., Peschke, S., & Haeb-Umbach, R. (2009). Robust vehicle localization based on multi-level sensor fusion and online parameter estimation. In 6th Workshop on Positioning Navigation and Communication (WPNC 2009) (pp. 235–242). https://doi.org/10.1109/WPNC.2009.4907833
LibreCat
| DOI
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11724 |

Bevermeier, M., Peschke, S., & Haeb-Umbach, R. (2009). Joint Parameter Estimation and Tracking in a Multi-Stage Kalman Filter for Vehicle Positioning. In IEEE 69th Vehicular Technology Conference (VTC 2009 Spring) (pp. 1–5). https://doi.org/10.1109/VETECS.2009.5073634
LibreCat
| DOI
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11725 |

Bevermeier, M., Peschke, S., & Haeb-Umbach, R. (2009). Eine Plattform fuer Mehrwertdienste im Bereich Logistik - Drahtlose Fahrzeug- und Laderaumueberwachung fuer LKW mit Hilfe einer Maut-On-Board Unit. In DGON Navigationskonvent 2009.
LibreCat
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11847 |

Krueger, A., & Haeb-Umbach, R. (2009). Model based feature enhancement for automatic speech recognition in reverberant environments. In Interspeech 2009.
LibreCat
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11859 |

Leutnant, V., & Haeb-Umbach, R. (2009). On the Estimation and Use of Feature Reliability Information for Noise Robust Speech Recognition. In International Conference on Acoustics (NAG/DAGA 2009).
LibreCat
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11860 |

Leutnant, V., & Haeb-Umbach, R. (2009). An analytic derivation of a phase-sensitive observation model for noise robust speech recognition. In Interspeech 2009.
LibreCat
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11881 |

Peschke, S., Bevermeier, M., & Haeb-Umbach, R. (2009). A GPS positioning approach exploiting GSM velocity estimates. In 6th Workshop on Positioning Navigation and Communication (WPNC 2009) (pp. 195–202). https://doi.org/10.1109/WPNC.2009.4907827
LibreCat
| DOI
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11882 |

Peschke, S., Bevermeier, M., & Haeb-Umbach, R. (2009). Verbesserung von GPS-basierter Ortung durch GSM-Geschwindigkeitsschaetzungen. In DGON Navigationskonvent 2009.
LibreCat
| Download (ext.)
2009 | Journal Article | LibreCat-ID: 11937 |

Windmann, S., & Haeb-Umbach, R. (2009). Approaches to Iterative Speech Feature Enhancement and Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17(5), 974–984. https://doi.org/10.1109/TASL.2009.2014894
LibreCat
| DOI
| Download (ext.)
2009 | Journal Article | LibreCat-ID: 11938 |

Windmann, S., & Haeb-Umbach, R. (2009). Parameter Estimation of a State-Space Model of Noise for Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17(8), 1577–1590. https://doi.org/10.1109/TASL.2009.2023172
LibreCat
| DOI
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11900 |

Schmalenstroeer, J., Leutnant, V., & Haeb-Umbach, R. (2009). Audio-Visual Data Processing for Ambient Communication. 1st International Workshop on Distributed Computing in Ambient Environments within 32nd Annual Conference on Artificial Intelligence.
LibreCat
| Files available
2009 | Conference Paper | LibreCat-ID: 11806 |

Hennecke, M., Ploetz, T., Fink, G. A., Schmalenstroeer, J., & Haeb-Umbach, R. (2009). A hierarchical approach to unsupervised shape calibration of microphone array networks. IEEE/SP 15th Workshop on Statistical Signal Processing (SSP 2009), 257–260. https://doi.org/10.1109/SSP.2009.5278589
LibreCat
| DOI
| Download (ext.)
2009 | Conference Paper | LibreCat-ID: 11899 |

Schmalenstroeer, J., Kelling, M., Leutnant, V., & Haeb-Umbach, R. (2009). Fusing Audio and Video Information for Online Speaker Diarization. Interspeech 2009.
LibreCat
| Download (ext.)
2008 | Journal Article | LibreCat-ID: 11776 |

Haeb-Umbach, R. (2008). Uncertainty Decoding in Automatic Speech Recognition. 2008 ITG Conference on Voice Communication (SprachKommunikation), 1–7.
LibreCat
| Download (ext.)
2008 | Book Chapter | LibreCat-ID: 11789 |

Haeb-Umbach, R., & Ion, V. (2008). Error Concealement. In B. Lindenberg & Z.-H. Tan (Eds.), Automatic Speech Recognition on Mobile Devices and over Communication Networks (Vol. Advances in Computer Vision and Pattern Recognition, pp. 187–210). Springer.
LibreCat
| Download (ext.)
2008 | Journal Article | LibreCat-ID: 11820 |

Ion, V., & Haeb-Umbach, R. (2008). A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing, 16(5), 1047–1060. https://doi.org/10.1109/TASL.2008.925879
LibreCat
| DOI
| Download (ext.)
2008 | Journal Article | LibreCat-ID: 11821 |

Ion, V., & Haeb-Umbach, R. (2008). Investigations into Uncertainty Decoding Employing a Discrete Feature Space for Noise Robust Automatic Speech Recognition. 2008 ITG Conference on Voice Communication (SprachKommunikation), 1–4.
LibreCat
| Download (ext.)
2008 | Conference Paper | LibreCat-ID: 11851 |

Krueger, A., Warsitz, E., & Haeb-Umbach, R. (2008). Blinde Akustische Strahlformung fuer Anwendungen im KFZ. In 34. Deutsche Jahrestagung fuer Akustik (DAGA 2008).
LibreCat
| Download (ext.)
2008 | Journal Article | LibreCat-ID: 11914 |

Tran Vu, D. H., & Haeb-Umbach, R. (2008). Blind Speech Separation in Presence of Correlated Noise with Generalized Eigenvector Beamforming. 2008 ITG Conference on Voice Communication (SprachKommunikation), 1–4.
LibreCat
| Download (ext.)
2008 | Conference Paper | LibreCat-ID: 11915 |

Tran Vu, D. H., Krueger, A., & Haeb-Umbach, R. (2008). Generalized Eigenvector Blind Speech Separation Under Coherent Noise In A GSC Configuration. In International Workshop on Acoustic Echo and Noise Control (IWAENC 2008).
LibreCat
| Download (ext.)
2008 | Conference Paper | LibreCat-ID: 11935 |

Warsitz, E., Krueger, A., & Haeb-Umbach, R. (2008). Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008) (pp. 73–76). https://doi.org/10.1109/ICASSP.2008.4517549
LibreCat
| DOI
| Download (ext.)
2008 | Conference Paper | LibreCat-ID: 11939 |

Windmann, S., & Haeb-Umbach, R. (2008). Modeling the dynamics of speech and noise for speech feature enhancement in ASR. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008) (pp. 4409–4412). https://doi.org/10.1109/ICASSP.2008.4518633
LibreCat
| DOI
| Download (ext.)
2008 | Journal Article | LibreCat-ID: 11940 |

Windmann, S., & Haeb-Umbach, R. (2008). A novel approach to noise estimation in model-based speech feature enhancement. 2008 ITG Conference on Voice Communication (SprachKommunikation), 1–4.
LibreCat
| Download (ext.)
2008 | Journal Article | LibreCat-ID: 11944 |

Windmann, S., Haeb-Umbach, R., & Leutnant, V. (2008). A segmental HMM based on a modified emission probability. 2008 ITG Conference on Voice Communication (SprachKommunikation), 1–4.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11720 |

Bevermeier, M., Ebel, T., & Haeb-Umbach, R. (2007). Channel Estimation by Exploiting Sublayer Information in OFDM Systems. In Multi-Carrier Spread Spectrum 2007.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11722 |

Bevermeier, M., & Haeb-Umbach, R. (2007). Combined Time and Frequency Domain OFDM Channel Estimation. In Multi-Carrier Spread Spectrum 2007.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11785 |

Haeb-Umbach, R., & Bevermeier, M. (2007). OFDM Channel Estimation Based on Combined Estimation in Time and Frequency Domain. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007) (Vol. 3, pp. III-277-III–280). https://doi.org/10.1109/ICASSP.2007.366526
LibreCat
| DOI
| Download (ext.)
2007 | Journal Article | LibreCat-ID: 11799 |

Haeb-Umbach, R., & Peschke, S. (2007). A Novel Similarity Measure for Positioning Cellular Phones by a Comparison With a Database of Signal Power Levels. IEEE Transactions on Vehicular Technology, 56(1), 368–372. https://doi.org/10.1109/TVT.2006.889563
LibreCat
| DOI
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11822 |

Ion, V., & Haeb-Umbach, R. (2007). Multi-Resolution Soft Features for Channel-Robust Distributed Speech Recognition. In Interspeech 2007.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11883 |

Peschke, S., & Haeb-Umbach, R. (2007). Velocity Estimation of Mobile Terminals by Exploiting GSM Downlink Signalling. In 4th Workshop on Positioning Navigation and Communication (WPNC 2007) (pp. 217–222). https://doi.org/10.1109/WPNC.2007.353637
LibreCat
| DOI
| Download (ext.)
2007 | Journal Article | LibreCat-ID: 11927 |

Warsitz, E., & Haeb-Umbach, R. (2007). Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition. IEEE Transactions on Audio, Speech, and Language Processing, 15(5), 1529–1539. https://doi.org/10.1109/TASL.2007.898454
LibreCat
| DOI
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11934 |

Warsitz, E., Haeb-Umbach, R., & Tran Vu, D. H. (2007). Blind Adaptive Principal Eigenvector Beamforming for Acoustical Source Separation. In Interspeech 2007.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11941 |

Windmann, S., & Haeb-Umbach, R. (2007). An Approach to Iterative Speech Feature Enhancement and Recognition. In Interspeech 2007.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11893 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2007). Joint Speaker Segmentation, Localization and Identification for Streaming Audio. Interspeech 2007.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11901 |

Schmalenstroeer, J., Leutnant, V., & Haeb-Umbach, R. (2007). Amigo Context Management Service with Applications in Ambient Communication Scenarios. AMI-07 - European Conference on Ambient Intelligence.
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11933 |

Warsitz, E., Haeb-Umbach, R., & Schmalenstroeer, J. (2007). Zweistufige Sprache/Pause-Detektion in stark gestoerter Umgebung. 33. Deutsche Jahrestagung Fuer Akustik (DAGA 2007).
LibreCat
| Download (ext.)
2007 | Conference Paper | LibreCat-ID: 11902 |

Schmalenstroeer, J., Warsitz, E., & Haeb-Umbach, R. (2007). Projekt Amigo - Sprachsignalverarbeitung im vernetzten Haus. 33. Deutsche Jahrestagung Fuer Akustik (DAGA 2007).
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11823 |

Ion, V., & Haeb-Umbach, R. (2006). Comparison of Decoder-based Transmission Error Compensation Techniques for Distributed Speech Recognition. In 7. ITG-Fachtagung Sprachkommunikation.
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11824 |

Ion, V., & Haeb-Umbach, R. (2006). An Inexpensive Packet Loss Compensation Scheme for Distributed Speech Recognition Based on Soft-Features. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006) (Vol. 1, p. I). https://doi.org/10.1109/ICASSP.2006.1659984
LibreCat
| DOI
| Download (ext.)
2006 | Journal Article | LibreCat-ID: 11825 |

Ion, V., & Haeb-Umbach, R. (2006). Uncertainty decoding for distributed speech recognition over error-prone networks. Speech Communication, 48(11), 1435–1446. https://doi.org/10.1016/j.specom.2006.03.007
LibreCat
| DOI
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11826 |

Ion, V., & Haeb-Umbach, R. (2006). Improved Source Modeling and Predictive Classification for Channel Robust Speech Recognition. In Interspeech 2006.
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11884 |

Peschke, S., & Haeb-Umbach, R. (2006). A Probabilistic Similarity Measure and a Non-Linear Post-Filter for Mobile Phone Positioning using GSM Signal Power Measurements. In European Navigation Conference \& Exhibition (ENC 2006).
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11885 |

Peschke, S., & Haeb-Umbach, R. (2006). Particle Filtering of Database assisted Positioning Estimates using a novel Similarity Measure for GSM Signal Power Level Measurements. In 3rd Workshop on Positioning Navigation and Communication (WPNC 2006).
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11928 |

Warsitz, E., & Haeb-Umbach, R. (2006). Mehrkanalige Sprachsignalverarbeitung durch adaptives Eigenbeamforming fuer Freisprecheinrichtungen im Kraftfahrzeug. In 32. Deutsche Jahrestagung fuer Akustik (DAGA 2006).
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11929 |

Warsitz, E., & Haeb-Umbach, R. (2006). Controlling Speech Distortion in Adaptive Frequency-Domain Principal Eigenvector Beamforming. In International Workshop on Acoustic Echo and Noise Control (IWAENC 2006).
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11942 |

Windmann, S., & Haeb-Umbach, R. (2006). Einkanalige Sprachsignalverbesserung mit Hilfe eines marginalisierten Partikelfilters. In 7. ITG-Fachtagung Sprachkommunikation.
LibreCat
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11943 |

Windmann, S., & Haeb-Umbach, R. (2006). Iterative Speech Enhancement using a Non-Linear Dynamic State Model of Speech and its Parameters. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006) (Vol. 1, p. I). https://doi.org/10.1109/ICASSP.2006.1660058
LibreCat
| DOI
| Download (ext.)
2006 | Conference Paper | LibreCat-ID: 11894 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2006). Online Speaker Change Detection by Combining BIC with Microphone Array Beamforming. Interspeech 2006.
LibreCat
| Download (ext.)
2005 | Conference Paper | LibreCat-ID: 11803 |

Haeb-Umbach, R., & Warsitz, E. (2005). Adaptive Filter-and-Sum Beamforming in Spatially Correlated Noise. In International Workshop on Acoustic Echo and Noise Control (IWAENC 2005).
LibreCat
| Download (ext.)
2005 | Conference Paper | LibreCat-ID: 11827 |

Ion, V., & Haeb-Umbach, R. (2005). A Unified Probabilistic Approach to Error Concealment for Distributed Speech Recognition. In Interspeech 2005.
LibreCat
| Download (ext.)
2005 | Conference Paper | LibreCat-ID: 11828 |

Ion, V., & Haeb-Umbach, R. (2005). A Comparison of Soft-Feature Distributed Speech Recognition with Candidate Codecs for Speech Enabled Mobile Services. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2005) (Vol. 1, pp. 333–336). https://doi.org/10.1109/ICASSP.2005.1415118
LibreCat
| DOI
| Download (ext.)
2005 | Conference Paper | LibreCat-ID: 11930 |

Warsitz, E., & Haeb-Umbach, R. (2005). Acoustic filter-and-sum beamforming by adaptive principal component analysis. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2005) (Vol. 4, p. iv/797-iv/800 Vol. 4). https://doi.org/10.1109/ICASSP.2005.1416129
LibreCat
| DOI
| Download (ext.)
2005 | Conference Paper | LibreCat-ID: 11802 |

Haeb-Umbach, R., & Schmalenstroeer, J. (2005). Speech Processing in the Networked Home Environment - A View on the Amigo Project. Interspeech 2005.
LibreCat
| Download (ext.)
2005 | Conference Paper | LibreCat-ID: 11801 |

Haeb-Umbach, R., & Schmalenstroeer, J. (2005). A Comparison of Particle Filtering Variants for Speech Feature Enhancement. Interspeech 2005.
LibreCat
| Download (ext.)
2004 | Journal Article | LibreCat-ID: 11732 |

Bischoff, R., Haeb-Umbach, R., & Nammi, S. R. (2004). Multipath-Resistant Time of Arrival Estimation for Satellite Positioning. AEUe, Int. Journal on Electronics and Communications, 58(1).
LibreCat
| Download (ext.)
2004 | Conference Paper | LibreCat-ID: 11790 |

Haeb-Umbach, R., & Ion, V. (2004). Soft Features for Improved Distributed Speech Recognition over Wireless Networks. In International Conference on Spoken Language Processing (ICSLP 2004).
LibreCat
| Download (ext.)
2004 | Conference Paper | LibreCat-ID: 11931 |

Warsitz, E., & Haeb-Umbach, R. (2004). Robust speaker direction estimation with particle filtering. In IEEE Workshop on Multimedia Signal Processing (MMSP 2004) (pp. 367–370). https://doi.org/10.1109/MMSP.2004.1436569
LibreCat
| DOI
| Download (ext.)
2004 | Conference Paper | LibreCat-ID: 11932 |

Warsitz, E., Haeb-Umbach, R., & Peschke, S. (2004). Adaptive Beamforming Combined with Particle Filtering for Acoustic Source Localization. In International Conference on Spoken Language Processing (ICSLP 2004).
LibreCat
| Download (ext.)
2003 | Journal Article | LibreCat-ID: 11777 |

Haeb-Umbach, R. (2003). Auf ein Wort - Moeglichkeiten und Grenzen der automatischen Spracherkennung. Forschungsforum Paderborn, 68–71.
LibreCat
| Download (ext.)
2002 | Journal Article | LibreCat-ID: 11727 |

Beyerlein, P., Aubert, X., Haeb-Umbach, R., Harris, M., Klakow, D., Wendemuth, A., … Sixtus, A. (2002). Large Vocabulary Continuous Speech Recognition of Broadcast News - The Philips/RWTH Approach. Speech Communication, (37), 109–131.
LibreCat
| Download (ext.)
2002 | Conference Paper | LibreCat-ID: 11731 |

Bischoff, R., Haeb-Umbach, R., & Heinrichs, G. (2002). A Joint Time Multiplex Receiver for UMTS and Galileo. In ION-GPS 2002.
LibreCat
| Download (ext.)
2002 | Conference Paper | LibreCat-ID: 11733 |

Bischoff, R., Haeb-Umbach, R., Schulz, W., & Heinrichs, G. (2002). Employment of a multipath receiver structure in a combined GALILEO/UMTS receiver. In IEEE 55th Vehicular Technology Conference (VTC 2002 Spring) (Vol. 4, pp. 1844–1848 vol.4). https://doi.org/10.1109/VTC.2002.1002940
LibreCat
| DOI
| Download (ext.)
2002 | Conference Paper | LibreCat-ID: 11808 |

Hesse, T., Bischoff, R., Schulz, W., & Haeb-Umbach, R. (2002). Estimation of Bias Location Error due to Absence of the LOS-Signal in a UMTS-System. In International Symposium on Location Based Services for Cellular Users (LOCELLUS 2002).
LibreCat
| Download (ext.)
2001 | Conference Paper | LibreCat-ID: 11734 |

Bischoff, R., Haeb-Umbach, R., Schulz, W., & Heinrichs, G. (2001). Implementation of a Rake Receiver Architecture into a Galileo Receiver. In 1st ESA Workshop on Satellite Navigation User Equipment Technology (Navitec 2001).
LibreCat
| Download (ext.)
2001 | Journal Article | LibreCat-ID: 11778 |

Haeb-Umbach, R. (2001). Automatic generation of phonetic regression class trees for MLLR adaptation. IEEE Transactions on Speech and Audio Processing, 9(3), 299–302. https://doi.org/10.1109/89.906003
LibreCat
| DOI
| Download (ext.)
2001 | Journal Article | LibreCat-ID: 11870 |

Loog, M., Duin, R. P. W., & Haeb-Umbach, R. (2001). Multiclass linear dimension reduction by weighted pairwise Fisher criteria. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(7), 762–766. https://doi.org/10.1109/34.935849
LibreCat
| DOI
| Download (ext.)
2000 | Conference Paper | LibreCat-ID: 11758 |

Duin, R. P. W., Loog, M., & Haeb-Umbach, R. (2000). Multi-class Linear Feature Extraction by Nonlinear PCA. In International Conference on Pattern Recognition (ICPR 2000).
LibreCat
| Download (ext.)
2000 | Conference Paper | LibreCat-ID: 11779 |

Haeb-Umbach, R. (2000). Data-driven Phonetic Regression Class Tree Estimation for MLLR Adaptation. In International Conference on Spoken Language Processing (ICSLP 2000).
LibreCat
| Download (ext.)
2000 | Conference Paper | LibreCat-ID: 11869 |

Lieb, M., & Haeb-Umbach, R. (2000). LDA derived cepstral trajectory filters in adverse environmental conditions. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000) (Vol. 2, pp. II1105-II1108 vol.2). https://doi.org/10.1109/ICASSP.2000.859157
LibreCat
| DOI
| Download (ext.)
2000 | Conference Paper | LibreCat-ID: 11871 |

Loog, M., & Haeb-Umbach, R. (2000). Multi-class Linear Dimension Reduction by Generalized Fisher Criteria. In International Conference on Spoken Language Processing (ICSLP 2000).
LibreCat
| Download (ext.)
1999 | Conference Paper | LibreCat-ID: 11728 |

Beyerlein, P., Aubert, X. L., Haeb-Umbach, R., Harris, M. J., Klakow, D., Wendemuth, A., … Sixtus, A. (1999). The Philips/RWTH system for transcription of broadcast news. In Eurospeech.
LibreCat
| Download (ext.)
1999 | Conference Paper | LibreCat-ID: 11729 |

Beyerlein, P., Aubert, X. L., Haeb-Umbach, R., Harris, M. J., Klakow, D., Wendemuth, A., … Sixtus, A. (1999). The Philips/RWTH System for Transcription of Broadcast News. In Broadcast News Transcription and Understanding Workshop, Washington.
LibreCat
| Download (ext.)
1999 | Conference Paper | LibreCat-ID: 11780 |

Haeb-Umbach, R. (1999). Investigations on inter-speaker variability in the feature space. In ICASSP99 Phoenix, AZ.
LibreCat
| Download (ext.)
1999 | Conference Paper | LibreCat-ID: 11791 |

Haeb-Umbach, R., & Loog, M. (1999). An Investigation of Cepstral Parameterisations for Large Vocabulary Speech Recognition. In Eurospeech.
LibreCat
| Download (ext.)
1999 | Conference Paper | LibreCat-ID: 11805 |

Harris, M. J., Aubert, X. L., Haeb-Umbach, R., & Beyerlein, P. (1999). A study of broadcast news audio stream segmentation and segment clustering. In Eurospeech.
LibreCat
| Download (ext.)
1998 | Conference Paper | LibreCat-ID: 11730 |

Beyerlein, P., Aubert, X. L., Haeb-Umbach, R., Klakow, D., Ullrich, M., Wendemuth, A., & Wilcox, P. (1998). Automatic Transcription of English Broadcast News. In DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne.
LibreCat
| Download (ext.)
1998 | Conference Paper | LibreCat-ID: 11784 |

Haeb-Umbach, R., Aubert, X. L., Beyerlein, P., Klakow, D., Ullrich, M., Wendemuth, A., & Wilcox, P. (1998). Acoustic Modeling in the Philips Hub-4 Continuous-Speech Recognition System. In DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne.
LibreCat
| Download (ext.)
1998 | Conference Paper | LibreCat-ID: 11842 |

Klakow, D., Aubert, X. L., Haeb-Umbach, R., Beyerlein, P., Ullrich, M., Wendemuth, A., & Wilcox, P. (1998). Language-Model Investigations related to Broadcast News. In DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne.
LibreCat
| Download (ext.)
1998 | Conference Paper | LibreCat-ID: 11936 |

Welling, L., Haeb-Umbach, R., Aubert, X., & Haberland, N. (1998). A Study on Speaker Normalization Using Vocal Tract Normalization and Speaker Adaptive Training. In ICASSP 1998, Seattle.
LibreCat
| Download (ext.)
1997 | Conference Paper | LibreCat-ID: 11750 |

Dolfing, J. G. A., & Haeb-Umbach, R. (1997). Signal Representations for Hidden Markov Model Based On-Line Handwriting Recognition. In ICASSP, Munich.
LibreCat
| Download (ext.)
1997 | Journal Article | LibreCat-ID: 11766
Gamm, S., Haeb-Umbach, R., & Langmann, D. (1997). The development of a command-based speech interface for a telephone answering machine. Speech Communication.
LibreCat
1997 | Conference Paper | LibreCat-ID: 11781 |

Haeb-Umbach, R. (1997). Robust Speech Recognition for Wireless Networks and Mobile Telephony. In Eurospeech.
LibreCat
| Download (ext.)
1997 | Conference Paper | LibreCat-ID: 11819 |

Hoege, H., Tropf, H. S., Winsky, R., van den Heuvel, H., Haeb-Umbach, R., & Choukri, K. (1997). European Speech Databases for Telephone Applications. In ICASSP, Munich.
LibreCat
| Download (ext.)
1997 | Conference Paper | LibreCat-ID: 11852 |

Langmann, D., Fischer, A., Wuppermann, F., Haeb-Umbach, R., & Eisele, T. (1997). Acoustic Front Ends for Speaker-Independent Digit Recognition in Car Environments. In Eurospeech.
LibreCat
| Download (ext.)
1997 | Conference Paper | LibreCat-ID: 11855
Langmann, D., Wuppermann, F., Haeb-Umbach, R., Fischer, A., & Eisele, T. (1997). Investigation of Acoustic Front Ends for Speaker-Independent Speech Recognition in the Car. In Aachener Kolloquium on Signal Theory.
LibreCat
1996 | Conference Paper | LibreCat-ID: 11761 |

Eisele, T., Haeb-Umbach, R., & Langmann, D. (1996). A Comparative Study of Linear Feature Transformation Techniques for Automatic Speech Recognition. In ICSLP , Philadelphia.
LibreCat
| Download (ext.)
1996 | Conference Paper | LibreCat-ID: 11767 |

Gamm, S., Haeb-Umbach, R., & Langmann, D. (1996). Findings with the Design of a Command-Based Speech Interface for a Voice Mail System. In IEEE Workshop on Interactive Voice Technology for Telecommunications Applications.
LibreCat
| Download (ext.)
1996 | Conference Paper | LibreCat-ID: 11853 |

Langmann, D., & Haeb-Umbach, R. (1996). FRESCO: The French Telephone Speech Data Collection - Part of the European SpeechDat(M) Project. In ICSLP, Philadelphia.
LibreCat
| Download (ext.)
1996 | Conference Paper | LibreCat-ID: 11854
Langmann, D., Haeb-Umbach, R., & Eisele, T. (1996). Robust Rejection Modeling for a Small-Vocabulary Application. In ITG Fachtagung Sprachkommunikation, Frankfurt.
LibreCat
1995 | Conference Paper | LibreCat-ID: 11757 |

Dugast, C., Beyerlein, P., & Haeb-Umbach, R. (1995). Application of Clustering Techniques to Mixture Density Modelling for Continuous-Speech Recognition. In ICASSP, Detroit.
LibreCat
| Download (ext.)
1995 | Journal Article | LibreCat-ID: 11764
Gamm, S., & Haeb-Umbach, R. (1995). User interface design of voice controlled consumer electronics. Philips Journal of Research.
LibreCat
1995 | Conference Paper | LibreCat-ID: 11765
Gamm, S., & Haeb-Umbach, R. (1995). Human Factors of a Voice-Controlled Car Stereo. In Eurospeech, Madrid.
LibreCat
1995 | Conference Paper | LibreCat-ID: 11768
Gamm, S., Haeb-Umbach, R., & Langmann, D. (1995). The Usability Engineering of a Voice-Controlled Answering Machine. In International Symposium on Human Factors in Telecommunications, Melbourne.
LibreCat
1995 | Journal Article | LibreCat-ID: 11786
Haeb-Umbach, R., Beyerlein, P., & Geller, D. (1995). Speech recognition algorithms for voice control interfaces. Philips Journal of Research.
LibreCat
1995 | Conference Paper | LibreCat-ID: 11787 |

Haeb-Umbach, R., Beyerlein, P., & Thelen, E. (1995). Automatic Transcription of Unknown Words in a Speech Recognition System. In ICASSP, Detroit.
LibreCat
| Download (ext.)
1995 | Journal Article | LibreCat-ID: 11905
Steinbiss, V., Ney, H. J., Aubert, X. L., Besling, S., Dugast, C., Essen, U., … Tran, B. H. (1995). The Philips Research system for continuous-speech dictation. Philips Journal of Research.
LibreCat
1995 | Journal Article | LibreCat-ID: 11948
Steinbiss, V., Ney, H. J., Essen, U., Tran, B. H., Aubert, X. L., Dugast, C., … Bartosik, H. (1995). Continuous speech dictation - From theory to practice. Speech Communication.
LibreCat
1994 | Journal Article | LibreCat-ID: 11796
Haeb-Umbach, R., & Ney, H. (1994). Improvements in beam search for 10000-word continuous-speech recognition. IEEE Transactions on Speech and Audio Processing.
LibreCat
1994 | Conference Paper | LibreCat-ID: 11878
Ney, H., Steinbeiss, V., Aubert, X. L., & Haeb-Umbach, R. (1994). Progress in Large-Vocabulary, Continuous Speech Recognition. In Artifical Intelligence, Progress and Prospects of Speech Research and Technology, Munich.
LibreCat