LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

317 Publications

2020 | Conference Paper | LibreCat-ID: 17763 |

Haeb-Umbach, R. (2020). Sprachtechnologien für Digitale Assistenten. In R. Böck, I. Siegert, & A. Wendemuth (Eds.), Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2020 (pp. 227–234). TUDpress, Dresden.

LibreCat | Download (ext.)

2020 | Conference Paper | LibreCat-ID: 20695 |

Boeddeker, C., Nakatani, T., Kinoshita, K., & Haeb-Umbach, R. (2020). Jointly Optimal Dereverberation and Beamforming. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp40776.2020.9054393

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20700 |

Boeddeker, C., Cord-Landwehr, T., Heitkaemper, J., Zorila, C., Hayakawa, D., Li, M., … Haeb-Umbach, R. (2020). Towards a speaker diarization system for the CHiME 2020 dinner party transcription. In Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments.

LibreCat | Files available

2020 | Journal Article | LibreCat-ID: 17598 |

Nakatani, T., Boeddeker, C., Kinoshita, K., Ikeshita, R., Delcroix, M., & Haeb-Umbach, R. (2020). Jointly optimal denoising, dereverberation, and source separation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 1–1. https://doi.org/10.1109/TASLP.2020.3013118

LibreCat | DOI | Download (ext.)

2020 | Conference Paper | LibreCat-ID: 20504

Heitkaemper, J., Jakobeit, D., Boeddeker, C., Drude, L., & Haeb-Umbach, R. (2020). Demystifying TasNet: A Dissecting Approach. ICASSP 2020 Virtual Barcelona Spain.

LibreCat | Files available

2020 | Preprint | LibreCat-ID: 28263

Watanabe, S., Mandel, M., Barker, J., Vincent, E., Arora, A., Chang, X., Khudanpur, S., Manohar, V., Povey, D., Raj, D., Snyder, D., Subramanian, A. S., Trmal, J., Yair, B. B., Boeddeker, C., Ni, Z., Fujita, Y., Horiguchi, S., Kanda, N., … Ryant, N. (2020). CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings. In arXiv:2004.09249.

LibreCat

2020 | Conference Paper | LibreCat-ID: 20505

Heitkaemper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (2020). Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments. INTERSPEECH 2020 Virtual Shanghai China.

LibreCat | Files available

2020 | Conference Paper | LibreCat-ID: 20762 |

von Neumann, T., Kinoshita, K., Drude, L., Boeddeker, C., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). End-to-End Training of Time Domain Audio Separation and Recognition. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 7004–7008. https://doi.org/10.1109/ICASSP40776.2020.9053461

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20764 |

von Neumann, T., Boeddeker, C., Drude, L., Kinoshita, K., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. Proc. Interspeech 2020, 3097–3101. https://doi.org/10.21437/Interspeech.2020-2519

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 18651 |

Gburrek, T., Schmalenstroeer, J., Brendel, A., Kellermann, W., & Haeb-Umbach, R. (2020). Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Network. European Signal Processing Conference (EUSIPCO).

LibreCat | Files available

2020 | Conference Paper | LibreCat-ID: 20766 |

Kinoshita, K., von Neumann, T., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2020). Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. Proc. Interspeech 2020, 2652–2656. https://doi.org/10.21437/Interspeech.2020-2388

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20753 |

Ebbers, J., & Haeb-Umbach, R. (2020). Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-Supervised Sound Event Detection. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2020 Workshop (DCASE2020).

LibreCat | Files available

2019 | Journal Article | LibreCat-ID: 17762

Haeb-Umbach, R. (2019). Lektionen für Alexa \& Co?! Forschung, 44(1), 12–15. https://doi.org/10.1002/fors.201970104

LibreCat | DOI

2019 | Journal Article | LibreCat-ID: 19446 |

Drude, L., Heitkaemper, J., Boeddeker, C., & Haeb-Umbach, R. (2019). SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. ArXiv E-Prints.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 11965 |

Drude, L., Heymann, J., & Haeb-Umbach, R. (2019). Unsupervised training of neural mask-based beamforming. In INTERSPEECH 2019, Graz, Austria.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 12874 |

Drude, L., Hasenklever, D., & Haeb-Umbach, R. (2019). Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation. In ICASSP 2019, Brighton, UK.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 12875 |

Heymann, J., Drude, L., Haeb-Umbach, R., Kinoshita, K., & Nakatani, T. (2019). Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR. In ICASSP 2019, Brighton, UK.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 12876 |

Kurz, G., Gilitschenski, I., Pfaff, F., Drude, L., Hanebeck, U. D., Haeb-Umbach, R., & Siegwart, R. Y. (2019). Directional Statistics and Filtering Using libDirectional. In Journal of Statistical Software 89(4).

LibreCat | Files available

2019 | Journal Article | LibreCat-ID: 12890 |

Drude, L., & Haeb-Umbach, R. (2019). Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation. IEEE Journal of Selected Topics in Signal Processing. https://doi.org/10.1109/JSTSP.2019.2912565

LibreCat | Files available | DOI

2019 | Conference Paper | LibreCat-ID: 15812 |

Heymann, J., & Khe Chai Sim, B. L. (2019). Improving CTC Using Stimulated Learning for Sequence Modeling. In ICASSP 2019, Brighton, UK.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 15816 |

Zorila, C., Boeddeker, C., Doddipatla, R., & Haeb-Umbach, R. (2019). An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription. In ASRU 2019, Sentosa, Singapore.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 14822 |

Heitkaemper, J., Feher, T., Freitag, M., & Haeb-Umbach, R. (2019). A Study on Online Source Extraction in the Presence of Changing Speaker Positions. In International Conference on Statistical Language and Speech Processing 2019, Ljubljana, Slovenia.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 14824 |

Martin-Donas, J. M., Heitkaemper, J., Haeb-Umbach, R., Gomez, A. M., & Peinado, A. M. (2019). Multi-Channel Block-Online Source Extraction based on Utterance Adaptation. In INTERSPEECH 2019, Graz, Austria.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 14826 |

Kanda, N., Boeddeker, C., Heitkaemper, J., Fujita, Y., Horiguchi, S., & Haeb-Umbach, R. (2019). Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. In INTERSPEECH 2019, Graz, Austria.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 13271 |

von Neumann, T., Kinoshita, K., Delcroix, M., Araki, S., Nakatani, T., & Haeb-Umbach, R. (2019). All-neural Online Source Separation, Counting, and Diarization for Meeting Analysis. In ICASSP 2019, Brighton, UK.

LibreCat | Files available

2019 | Journal Article | LibreCat-ID: 15814 |

Haeb-Umbach, R., Watanabe, S., Nakatani, T., Bacchiani, M., Hoffmeister, B., Seltzer, M. L., Zen, H., & Souden, M. (2019). Speech Processing for Digital Home Assistance: Combining Signal Processing With Deep-Learning Techniques. IEEE Signal Processing Magazine, 36(6), 111–124. https://doi.org/10.1109/MSP.2019.2918706

LibreCat | Files available | DOI

2019 | Journal Article | LibreCat-ID: 19450 |

Haeb-Umbach, R. (2019). Lektionen für Alexa & Co?! DFG Forschung 1/2019, 12–15. https://doi.org/10.1002/fors.201970104

LibreCat | Files available | DOI

2019 | Conference Paper | LibreCat-ID: 15237 |

Gburrek, T., Glarner, T., Ebbers, J., Haeb-Umbach, R., & Wagner, P. (2019). Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion. Proc. 10th ISCA Speech Synthesis Workshop, 81–86. https://doi.org/10.21437/SSW.2019-15

LibreCat | Files available | DOI | Download (ext.)

2019 | Conference Paper | LibreCat-ID: 15794 |

Ebbers, J., & Haeb-Umbach, R. (2019). Convolutional Recurrent Neural Network and Data Augmentation for Audio Tagging with Noisy Labels and Minimal Supervision. DCASE2019 Workshop, New York, USA.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 15796 |

Ebbers, J., Drude, L., Haeb-Umbach, R., Brendel, A., & Kellermann, W. (2019). Weakly Supervised Sound Activity Detection and Event Classification in Acoustic Sensor Networks. CAMSAP 2019, Guadeloupe, West Indies.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 15792 |

Nelus, A., Ebbers, J., Haeb-Umbach, R., & Martin, R. (2019). Privacy-preserving Variational Information Feature Extraction for Domestic Activity Monitoring Versus Speaker Identification. INTERSPEECH 2019, Graz, Austria.

LibreCat | Files available

2018 | Conference Paper | LibreCat-ID: 18107

Heymann, J., Bacchiani, M., & Sainath, T. N. (2018). Performance of Mask Based Statistical Beamforming in a Smart Home Scenario. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6722–6726). https://doi.org/10.1109/ICASSP.2018.8462372

LibreCat | DOI

2018 | Conference Paper | LibreCat-ID: 11760 |

Ebbers, J., Nelus, A., Martin, R., & Haeb-Umbach, R. (2018). Evaluation of Modulation-MFCC Features and DNN Classification for Acoustic Event Detection. In DAGA 2018, München.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11835 |

Heymann, J., Drude, L., Haeb-Umbach, R., Kinoshita, K., & Nakatani, T. (2018). Frame-Online DNN-WPE Dereverberation. In IWAENC 2018, Tokio, Japan.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11837 |

Heitkaemper, J., Heymann, J., & Haeb-Umbach, R. (2018). Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming. In ITG 2018, Oldenburg, Germany.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11872 |

Drude, L., Boeddeker, C., Heymann, J., Kinoshita, K., Delcroix, M., Nakatani, T., & Haeb-Umbach, R. (2018). Integration neural network based beamforming and weighted prediction error dereverberation. In INTERSPEECH 2018, Hyderabad, India.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11873 |

Drude, L., Heymann, J., Boeddeker, C., & Haeb-Umbach, R. (2018). NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing. In ITG 2018, Oldenburg, Germany.

LibreCat | Files available | Download (ext.)

2018 | Journal Article | LibreCat-ID: 11916 |

Despotovic, V., Walter, O., & Haeb-Umbach, R. (2018). Machine learning techniques for semantic analysis of dysarthric speech: An experimental study. Speech Communication 99 (2018) 242-251 (Elsevier B.V.).

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 12898 |

Drude, L., von Neumann, T., & Haeb-Umbach, R. (2018). Deep Attractor Networks for Speaker Re-Identifikation and Blind Source Separation. In ICASSP 2018, Calgary, Canada.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 12900 |

Drude, L., Higuchi, Takuya , Kinoshita, K., Nakatani, T., & Haeb-Umbach, R. (2018). Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation. In ICASSP 2018, Calgary, Canada.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 12901 |

Boeddeker, C., Erdogan, H., Yoshioka, T., & Haeb-Umbach, R. (2018). Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition. In ICASSP 2018, Calgary, Canada.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 29923 |

Watanabe, S., Hori, T., Karita, S., Hayashi, T., Nishitoba, J., Unno, Y., Enrique Yalta Soplin, N., Heymann, J., Wiesner, M., Chen, N., Renduchintala, A., & Ochiai, T. (2018). ESPnet: End-to-End Speech Processing Toolkit. INTERSPEECH 2018, Hyderabad, India, 2207–2211. https://doi.org/10.21437/Interspeech.2018-1456

LibreCat | Files available | DOI

2018 | Conference Paper | LibreCat-ID: 12899 |

Boeddeker, C., Heitkaemper, J., Schmalenstroeer, J., Drude, L., Heymann, J., & Haeb-Umbach, R. (2018). Front-End Processing for the CHiME-5 Dinner Party Scenario. Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 6859

Afifi, H., Schmalenstroeer, J., Ullmann, J., Haeb-Umbach, R., & Karl, H. (2018). MARVELO - A Framework for Signal Processing in Wireless Acoustic Sensor Networks. Speech Communication; 13th ITG-Symposium, 1–5.

LibreCat

2018 | Conference Paper | LibreCat-ID: 11747 |

Grimm, C., Breddermann, T., Farhoud, R., Fei, T., Warsitz, E., & Haeb-Umbach, R. (2018). Discrimination of Stationary from Moving Targets with Recurrent Neural Networks in Automotive Radar. International Conference on Microwaves for Intelligent Mobility (ICMIM) 2018.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11907 |

Glarner, T., Hanebrink, P., Ebbers, J., & Haeb-Umbach, R. (2018). Full Bayesian Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. INTERSPEECH 2018, Hyderabad, India.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11838 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2018). Efficient Sampling Rate Offset Compensation - An Overlap-Save Based Approach. 26th European Signal Processing Conference (EUSIPCO 2018).

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11876 |

Kitza, M., Michel, W., Boeddeker, C., Heitkaemper, J., Menne, T., Schlüter, R., Ney, H., Schmalenstroeer, J., Drude, L., Heymann, J., & Haeb-Umbach, R. (2018). The RWTH/UPB System Combination for the CHiME 2018 Workshop. Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11836 |

Ebbers, J., Heitkaemper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (2018). Benchmarking Neural Network Architectures for Acoustic Sensor Networks. ITG 2018, Oldenburg, Germany.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11839 |

Schmalenstroeer, J., & Haeb-Umbach, R. (2018). Insights into the Interplay of Sampling Rate Offsets and MVDR Beamforming. ITG 2018, Oldenburg, Germany.

LibreCat | Download (ext.)

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

317 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

317 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options