LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

304 Publications

2024 | Journal Article | LibreCat-ID: 52958 |

C. Boeddeker, A. S. Subramanian, G. Wichern, R. Haeb-Umbach, and J. Le Roux, “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1185–1197, 2024, doi: 10.1109/taslp.2024.3350887.

LibreCat | DOI | Download (ext.)

2024 | Conference Paper | LibreCat-ID: 53659

T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios,” presented at the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seoul, 2024, doi: 10.1109/icassp48485.2024.10445911.

LibreCat | DOI

2023 | Conference Paper | LibreCat-ID: 48269 |

T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On the Integration of Sampling Rate Synchronization and Acoustic Beamforming,” presented at the European Signal Processing Conference (EUSIPCO), Helsinki, 2023.

LibreCat | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 47128 |

T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization,” presented at the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes, 2023, doi: 10.1109/icassp49357.2023.10095370.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48270 |

J. Schmalenstroeer, T. Gburrek, and R. Haeb-Umbach, “LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices,” presented at the ITG Conference on Speech Communication, Aachen, 2023.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 47129 |

T. Cord-Landwehr, C. Boeddeker, C. Zorilă, R. Doddipatla, and R. Haeb-Umbach, “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures,” 2023, doi: 10.21437/interspeech.2023-1379.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48355 |

F. Rautenberg, M. Kuhlmann, J. Wiechmann, F. Seebauer, P. Wagner, and R. Haeb-Umbach, “On Feature Importance and Interpretability of Speaker Representations,” presented at the ITG Conference on Speech Communication, Aachen, 2023.

LibreCat | Files available | Download (ext.) | arXiv

2023 | Conference Paper | LibreCat-ID: 48410 |

J. Wiechmann, F. Rautenberg, P. Wagner, and R. Haeb-Umbach, “Explaining voice characteristics to novice voice practitioners-How successful is it?,” 2023.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48390

S. Berger, P. Vieting, C. Boeddeker, R. Schlüter, and R. Haeb-Umbach, “Mixture Encoder for Joint Speech Separation and Recognition,” 2023, doi: 10.21437/interspeech.2023-1815.

LibreCat | DOI

2023 | Conference Paper | LibreCat-ID: 46069

F. Seebauer, M. Kuhlmann, R. Haeb-Umbach, and P. Wagner, “Re-examining the quality dimensions of synthetic speech,” 2023.

LibreCat

2023 | Journal Article | LibreCat-ID: 35602 |

T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 576–589, 2023, doi: 10.1109/taslp.2022.3228629.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48281 |

T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems,” 2023, doi: 10.1109/icassp49357.2023.10094784.

LibreCat | Files available | DOI | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48275 |

T. von Neumann, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems,” presented at the CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin, 2023.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 49109 |

T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks,” presented at the 57th Asilomar Conference on Signals, Systems, and Computers, 2023.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 44849 |

F. Rautenberg et al., “Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics,” in Fortschritte der Akustik - DAGA 2023, Hamburg, 2023, pp. 1409–1412.

LibreCat | Files available | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33954 |

C. Boeddeker, T. Cord-Landwehr, T. von Neumann, and R. Haeb-Umbach, “An Initialization Scheme for Meeting Separation with Spatial Mixture Models,” 2022, doi: 10.21437/interspeech.2022-10929.

LibreCat | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33471

J. Heitkämper, J. Schmalenstroeer, and R. Haeb-Umbach, “Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels,” presented at the 30th European Signal Processing Conference (EUSIPCO), Belgrad.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33958

K. Kinoshita, T. von Neumann, M. Delcroix, C. Boeddeker, and R. Haeb-Umbach, “Utterance-by-utterance overlap-aware neural diarization with Graph-PIT,” in Proc. Interspeech 2022, 2022, pp. 1486–1490, doi: 10.21437/Interspeech.2022-11408.

LibreCat | DOI

2022 | Conference Paper | LibreCat-ID: 33819 |

T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data,” 2022, doi: 10.1109/icassp43922.2022.9746757.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33847 |

T. Cord-Landwehr, T. von Neumann, C. Boeddeker, and R. Haeb-Umbach, “MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg, 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33848 |

T. Cord-Landwehr, C. Boeddeker, T. von Neumann, C. Zorila, R. Doddipatla, and R. Haeb-Umbach, “Monaural source separation: From anechoic to reverberant environments,” presented at the 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33807 |

T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes,” 2022, doi: 10.1109/icassp43922.2022.9746284.

LibreCat | Files available | DOI

2022 | Journal Article | LibreCat-ID: 33451 |

C. Grimm, T. Fei, E. Warsitz, R. Farhoud, T. Breddermann, and R. Haeb-Umbach, “Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications,” IEEE Transactions on Vehicular Technology, vol. 71, no. 9, pp. 9435–9449, 2022, doi: 10.1109/TVT.2022.3182411.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33696 |

J. Wiechmann, T. Glarner, F. Rautenberg, P. Wagner, and R. Haeb-Umbach, “Technically enabled explaining of voice characteristics,” Bielefeld, 2022.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33857 |

M. Kuhlmann, F. Seebauer, J. Ebbers, P. Wagner, and R. Haeb-Umbach, “Investigation into Target Speaking Rate Adaptation for Voice Conversion,” 2022, doi: 10.21437/interspeech.2022-10740.

LibreCat | Files available | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33808 |

T. Gburrek, J. Schmalenstroeer, J. Heitkaemper, and R. Haeb-Umbach, “Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription,” presented at the 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022), Bamberg, Germany , 2022, doi: 10.1109/IWAENC53105.2022.9914772.

LibreCat | Files available | DOI

2022 | Misc | LibreCat-ID: 33816 |

T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer, and R. Haeb-Umbach, A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 34072 |

J. Ebbers, R. Haeb-Umbach, and R. Serizel, “Threshold Independent Evaluation of Sound Event Detection Scores,” 2022.

LibreCat | Files available

2021 | Journal Article | LibreCat-ID: 21065 |

R. Haeb-Umbach, J. Heymann, L. Drude, S. Watanabe, M. Delcroix, and T. Nakatani, “Far-Field Automatic Speech Recognition,” Proceedings of the IEEE, vol. 109, no. 2, pp. 124–148, 2021.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 28256

W. Zhang et al., “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend,” 2021, doi: 10.1109/icassp39728.2021.9414464.

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 24000

J. Heitkaemper, J. Schmalenstroeer, V. Ion, and R. Haeb-Umbach, “A Database for Research on Detection and Enhancement of Speech Transmitted over HF links,” in Speech Communication; 14th ITG-Symposium, 2021, pp. 1–5.

LibreCat

2021 | Conference Paper | LibreCat-ID: 44843 |

C. Boeddeker, F. Rautenberg, and R. Haeb-Umbach, “A Comparison and Combination of Unsupervised Blind Source Separation Techniques,” presented at the ITG Conference on Speech Communication, Kiel, 2021.

LibreCat | Files available | Download (ext.) | arXiv

2021 | Conference Paper | LibreCat-ID: 28259 |

C. Boeddeker et al., “Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation,” 2021, doi: 10.1109/icassp39728.2021.9414661.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 23998 |

J. Schmalenstroeer, J. Heitkaemper, J. Ullmann, and R. Haeb-Umbach, “Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech,” in 29th European Signal Processing Conference (EUSIPCO), 2021, pp. 1–5.

LibreCat | Download (ext.)

2021 | Journal Article | LibreCat-ID: 22528 |

T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information,” EURASIP Journal on Audio, Speech, and Music Processing, 2021, doi: 10.1186/s13636-021-00210-x.

LibreCat | DOI | Download (ext.)

2021 | Conference Paper | LibreCat-ID: 23994 |

T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks,” 2021, doi: 10.1109/icassp39728.2021.9413831.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 23999 |

T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On Source-Microphone Distance Estimation Using Convolutional Recurrent Neural Networks,” in Speech Communication; 14th ITG-Symposium, 2021, pp. 1–5.

LibreCat | Files available

2021 | Conference Paper | LibreCat-ID: 29304 |

J. Ebbers, M. Kuhlmann, T. Cord-Landwehr, and R. Haeb-Umbach, “Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, pp. 3860–3864.

LibreCat | Files available

2021 | Conference Paper | LibreCat-ID: 26770 |

T. von Neumann, K. Kinoshita, C. Boeddeker, M. Delcroix, and R. Haeb-Umbach, “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers,” presented at the Interspeech, 2021, doi: 10.21437/interspeech.2021-1177.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 29173 |

T. von Neumann, C. Boeddeker, K. Kinoshita, M. Delcroix, and R. Haeb-Umbach, “Speeding Up Permutation Invariant Training for Source Separation,” presented at the Speech Communication; 14th ITG Conference, Kiel, 2021.

LibreCat | Files available

2021 | Conference Paper | LibreCat-ID: 29308 |

J. Ebbers and R. Haeb-Umbach, “Self-Trained Audio Tagging and Sound Event Detection in Domestic Environments,” in Proceedings of the 6th Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE2021), 2021, pp. 226–230.

LibreCat | Files available

2021 | Conference Paper | LibreCat-ID: 29306 |

J. Ebbers, M. C. Keyser, and R. Haeb-Umbach, “Adapting Sound Recognition to A New Environment Via Self-Training,” in Proceedings of the 29th European Signal Processing Conference (EUSIPCO), 2021, pp. 1135–1139.

LibreCat | Files available

2021 | Journal Article | LibreCat-ID: 24456 |

K. J. Rohlfing et al., “Explanation as a Social Practice: Toward a Conceptual Framework for the Social Design of AI Systems,” IEEE Transactions on Cognitive and Developmental Systems, vol. 13, no. 3, pp. 717–728, 2021, doi: 10.1109/tcds.2020.3044366.

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 17763 |

R. Haeb-Umbach, “Sprachtechnologien für Digitale Assistenten,” in Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2020, 2020, pp. 227–234.

LibreCat | Download (ext.)

2020 | Conference Paper | LibreCat-ID: 20700 |

C. Boeddeker et al., “Towards a speaker diarization system for the CHiME 2020 dinner party transcription,” in Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.

LibreCat | Files available

2020 | Journal Article | LibreCat-ID: 17598 |

T. Nakatani, C. Boeddeker, K. Kinoshita, R. Ikeshita, M. Delcroix, and R. Haeb-Umbach, “Jointly optimal denoising, dereverberation, and source separation,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 1–1, 2020, doi: 10.1109/TASLP.2020.3013118.

LibreCat | DOI | Download (ext.)

2020 | Conference Paper | LibreCat-ID: 20504

J. Heitkaemper, D. Jakobeit, C. Boeddeker, L. Drude, and R. Haeb-Umbach, “Demystifying TasNet: A Dissecting Approach,” 2020.

LibreCat | Files available

2020 | Conference Paper | LibreCat-ID: 20505

J. Heitkaemper, J. Schmalenstroeer, and R. Haeb-Umbach, “Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments,” 2020.

LibreCat | Files available

2020 | Conference Paper | LibreCat-ID: 20762 |

T. von Neumann et al., “End-to-End Training of Time Domain Audio Separation and Recognition,” in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 7004–7008, doi: 10.1109/ICASSP40776.2020.9053461.

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20764 |

T. von Neumann et al., “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR,” in Proc. Interspeech 2020, 2020, pp. 3097–3101, doi: 10.21437/Interspeech.2020-2519.

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 18651 |

T. Gburrek, J. Schmalenstroeer, A. Brendel, W. Kellermann, and R. Haeb-Umbach, “Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Network,” 2020.

LibreCat | Files available

2020 | Conference Paper | LibreCat-ID: 20766 |

K. Kinoshita, T. von Neumann, M. Delcroix, T. Nakatani, and R. Haeb-Umbach, “Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation,” in Proc. Interspeech 2020, 2020, pp. 2652–2656, doi: 10.21437/Interspeech.2020-2388.

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20753 |

J. Ebbers and R. Haeb-Umbach, “Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-Supervised Sound Event Detection,” 2020.

LibreCat | Files available

2019 | Journal Article | LibreCat-ID: 17762

R. Haeb-Umbach, “Lektionen für Alexa \& Co?!,” forschung, vol. 44, no. 1, pp. 12–15, 2019.

LibreCat | DOI

2019 | Journal Article | LibreCat-ID: 19446 |

L. Drude, J. Heitkaemper, C. Boeddeker, and R. Haeb-Umbach, “SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition,” ArXiv e-prints, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 11965 |

L. Drude, J. Heymann, and R. Haeb-Umbach, “Unsupervised training of neural mask-based beamforming,” in INTERSPEECH 2019, Graz, Austria, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 12874 |

L. Drude, D. Hasenklever, and R. Haeb-Umbach, “Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation,” in ICASSP 2019, Brighton, UK, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 12875 |

J. Heymann, L. Drude, R. Haeb-Umbach, K. Kinoshita, and T. Nakatani, “Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR,” in ICASSP 2019, Brighton, UK, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 12876 |

G. Kurz et al., “Directional Statistics and Filtering Using libDirectional,” in Journal of Statistical Software 89(4), 2019.

LibreCat | Files available

2019 | Journal Article | LibreCat-ID: 12890 |

L. Drude and R. Haeb-Umbach, “Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation,” IEEE Journal of Selected Topics in Signal Processing, 2019.

LibreCat | Files available | DOI

2019 | Conference Paper | LibreCat-ID: 15816 |

C. Zorila, C. Boeddeker, R. Doddipatla, and R. Haeb-Umbach, “An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription,” in ASRU 2019, Sentosa, Singapore, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 14822 |

J. Heitkaemper, T. Feher, M. Freitag, and R. Haeb-Umbach, “A Study on Online Source Extraction in the Presence of Changing Speaker Positions,” in International Conference on Statistical Language and Speech Processing 2019, Ljubljana, Slovenia, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 14824 |

J. M. Martin-Donas, J. Heitkaemper, R. Haeb-Umbach, A. M. Gomez, and A. M. Peinado, “Multi-Channel Block-Online Source Extraction based on Utterance Adaptation,” in INTERSPEECH 2019, Graz, Austria, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 14826 |

N. Kanda, C. Boeddeker, J. Heitkaemper, Y. Fujita, S. Horiguchi, and R. Haeb-Umbach, “Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR,” in INTERSPEECH 2019, Graz, Austria, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 13271 |

T. von Neumann, K. Kinoshita, M. Delcroix, S. Araki, T. Nakatani, and R. Haeb-Umbach, “All-neural Online Source Separation, Counting, and Diarization for Meeting Analysis,” in ICASSP 2019, Brighton, UK, 2019.

LibreCat | Files available

2019 | Journal Article | LibreCat-ID: 15814 |

R. Haeb-Umbach et al., “Speech Processing for Digital Home Assistance: Combining Signal Processing With Deep-Learning Techniques,” IEEE Signal Processing Magazine, vol. 36, no. 6, pp. 111–124, 2019, doi: 10.1109/MSP.2019.2918706.

LibreCat | Files available | DOI

2019 | Journal Article | LibreCat-ID: 19450 |

R. Haeb-Umbach, “Lektionen für Alexa & Co?!,” DFG forschung 1/2019, pp. 12–15, 2019, doi: 10.1002/fors.201970104.

LibreCat | Files available | DOI

2019 | Conference Paper | LibreCat-ID: 15237 |

T. Gburrek, T. Glarner, J. Ebbers, R. Haeb-Umbach, and P. Wagner, “Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion,” in Proc. 10th ISCA Speech Synthesis Workshop, Vienna, 2019, pp. 81–86, doi: 10.21437/SSW.2019-15.

LibreCat | Files available | DOI | Download (ext.)

2019 | Conference Paper | LibreCat-ID: 15794 |

J. Ebbers and R. Haeb-Umbach, “Convolutional Recurrent Neural Network and Data Augmentation for Audio Tagging with Noisy Labels and Minimal Supervision,” 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 15796 |

J. Ebbers, L. Drude, R. Haeb-Umbach, A. Brendel, and W. Kellermann, “Weakly Supervised Sound Activity Detection and Event Classification in Acoustic Sensor Networks,” 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 15792 |

A. Nelus, J. Ebbers, R. Haeb-Umbach, and R. Martin, “Privacy-preserving Variational Information Feature Extraction for Domestic Activity Monitoring Versus Speaker Identification,” 2019.

LibreCat | Files available

2018 | Conference Paper | LibreCat-ID: 11760 |

J. Ebbers, A. Nelus, R. Martin, and R. Haeb-Umbach, “Evaluation of Modulation-MFCC Features and DNN Classification for Acoustic Event Detection,” in DAGA 2018, München, 2018.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11835 |

J. Heymann, L. Drude, R. Haeb-Umbach, K. Kinoshita, and T. Nakatani, “Frame-Online DNN-WPE Dereverberation,” in IWAENC 2018, Tokio, Japan, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11837 |

J. Heitkaemper, J. Heymann, and R. Haeb-Umbach, “Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming,” in ITG 2018, Oldenburg, Germany, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11872 |

L. Drude et al., “Integration neural network based beamforming and weighted prediction error dereverberation,” in INTERSPEECH 2018, Hyderabad, India, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11873 |

L. Drude, J. Heymann, C. Boeddeker, and R. Haeb-Umbach, “NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing,” in ITG 2018, Oldenburg, Germany, 2018.

LibreCat | Files available | Download (ext.)

2018 | Journal Article | LibreCat-ID: 11916 |

V. Despotovic, O. Walter, and R. Haeb-Umbach, “Machine learning techniques for semantic analysis of dysarthric speech: An experimental study,” Speech Communication 99 (2018) 242-251 (Elsevier B.V.), 2018.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 12898 |

L. Drude, T. von Neumann, and R. Haeb-Umbach, “Deep Attractor Networks for Speaker Re-Identifikation and Blind Source Separation,” in ICASSP 2018, Calgary, Canada, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 12900 |

L. Drude, Takuya Higuchi, K. Kinoshita, T. Nakatani, and R. Haeb-Umbach, “Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation,” in ICASSP 2018, Calgary, Canada, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 12901 |

C. Boeddeker, H. Erdogan, T. Yoshioka, and R. Haeb-Umbach, “Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition,” in ICASSP 2018, Calgary, Canada, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 12899 |

C. Boeddeker, J. Heitkaemper, J. Schmalenstroeer, L. Drude, J. Heymann, and R. Haeb-Umbach, “Front-End Processing for the CHiME-5 Dinner Party Scenario,” 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 6859

H. Afifi, J. Schmalenstroeer, J. Ullmann, R. Haeb-Umbach, and H. Karl, “MARVELO - A Framework for Signal Processing in Wireless Acoustic Sensor Networks,” in Speech Communication; 13th ITG-Symposium, 2018, pp. 1–5.

LibreCat

2018 | Conference Paper | LibreCat-ID: 11747 |

C. Grimm, T. Breddermann, R. Farhoud, T. Fei, E. Warsitz, and R. Haeb-Umbach, “Discrimination of Stationary from Moving Targets with Recurrent Neural Networks in Automotive Radar,” 2018.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11907 |

T. Glarner, P. Hanebrink, J. Ebbers, and R. Haeb-Umbach, “Full Bayesian Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery,” 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11838 |

J. Schmalenstroeer and R. Haeb-Umbach, “Efficient Sampling Rate Offset Compensation - An Overlap-Save Based Approach,” 2018.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11876 |

M. Kitza et al., “The RWTH/UPB System Combination for the CHiME 2018 Workshop,” 2018.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11836 |

J. Ebbers, J. Heitkaemper, J. Schmalenstroeer, and R. Haeb-Umbach, “Benchmarking Neural Network Architectures for Acoustic Sensor Networks,” 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11839 |

J. Schmalenstroeer and R. Haeb-Umbach, “Insights into the Interplay of Sampling Rate Offsets and MVDR Beamforming,” 2018.

LibreCat | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11717 |

P. Arora and R. Haeb-Umbach, “A Study on Transfer Learning for Acoustic Event Detection in a Real Life Scenario,” in IEEE 19th International Workshop on Multimedia Signal Processing (MMSP), 2017.

LibreCat | Files available | Download (ext.)

2017 | Report | LibreCat-ID: 11735 |

C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, and R. Haeb-Umbach, On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming. 2017.

LibreCat | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11736 |

C. Boeddeker, P. Hanebrink, L. Drude, J. Heymann, and R. Haeb-Umbach, “Optimizing Neural-Network Supported Acoustic Beamforming by Algorithmic Differentiation,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.

LibreCat | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11737 |

A. Chinaev and R. Haeb-Umbach, “A Generalized Log-Spectral Amplitude Estimator for Single-Channel Speech Enhancement,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.

LibreCat | Files available | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11754 |

L. Drude and R. Haeb-Umbach, “Tight integration of spatial and spectral features for BSS with Deep Clustering embeddings,” in INTERSPEECH 2017, Stockholm, Schweden, 2017.

LibreCat | Files available | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11770 |

T. Glarner, B. Boenninghoff, O. Walter, and R. Haeb-Umbach, “Leveraging Text Data for Word Segmentation for Underresourced Languages,” in INTERSPEECH 2017, Stockholm, Schweden, 2017.

LibreCat | Files available | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11809 |

J. Heymann, L. Drude, C. Boeddeker, P. Hanebrink, and R. Haeb-Umbach, “BEAMNET: End-to-End Training of a Beamformer-Supported Multi-Channel ASR System,” in Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2017.

LibreCat | Files available | Download (ext.)

2017 | Journal Article | LibreCat-ID: 11811 |

J. Heymann, L. Drude, and R. Haeb-Umbach, “A Generic Neural Acoustic Beamforming Architecture for Robust Multi-Channel Speech Processing,” Computer Speech and Language, 2017.

LibreCat | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11763 |

T. Fei, C. Grimm, R. Farhoud, T. Breddermann, E. Warsitz, and R. Haeb-Umbach, “A Novel Target Separation Algorithm Applied to The Two-Dimensional Spectrum for FMCW Automotive Radar Systems,” 2017.

LibreCat | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11772 |

C. Grimm, T. Breddermann, R. Farhoud, T. Fei, E. Warsitz, and R. Haeb-Umbach, “Hypothesis Test for the Detection of Moving Targets in Automotive Radar,” 2017.

LibreCat | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11759 |

J. Ebbers, J. Heymann, L. Drude, T. Glarner, R. Haeb-Umbach, and B. Raj, “Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery,” 2017.

LibreCat | Files available | Download (ext.)

2017 | Conference Paper | LibreCat-ID: 11895 |

J. Schmalenstroeer, J. Heymann, L. Drude, C. Boeddeker, and R. Haeb-Umbach, “Multi-Stage Coherence Drift Based Sampling Rate Synchronization for Acoustic Beamforming,” 2017.

LibreCat | Files available | Download (ext.)

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

304 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

304 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options