LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

317 Publications

2024 | Journal Article | LibreCat-ID: 52958 |

Boeddeker, Christoph, Aswin Shanmugam Subramanian, Gordon Wichern, Reinhold Haeb-Umbach, and Jonathan Le Roux. “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings.” IEEE/ACM Transactions on Audio, Speech, and Language Processing 32 (2024): 1185–97. https://doi.org/10.1109/taslp.2024.3350887.

LibreCat | DOI | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48269 |

Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “On the Integration of Sampling Rate Synchronization and Acoustic Beamforming.” In European Signal Processing Conference (EUSIPCO), 2023.

LibreCat | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 47128 |

Cord-Landwehr, Tobias, Christoph Boeddeker, Cătălin Zorilă, Rama Doddipatla, and Reinhold Haeb-Umbach. “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization.” In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023. https://doi.org/10.1109/icassp49357.2023.10095370.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48270 |

Schmalenstroeer, Joerg, Tobias Gburrek, and Reinhold Haeb-Umbach. “LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices.” In ITG Conference on Speech Communication, 2023.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 47129 |

Cord-Landwehr, Tobias, Christoph Boeddeker, Cătălin Zorilă, Rama Doddipatla, and Reinhold Haeb-Umbach. “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures.” In INTERSPEECH 2023. ISCA, 2023. https://doi.org/10.21437/interspeech.2023-1379.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48355 |

Rautenberg, Frederik, Michael Kuhlmann, Jana Wiechmann, Fritz Seebauer, Petra Wagner, and Reinhold Haeb-Umbach. “On Feature Importance and Interpretability of Speaker Representations.” In ITG Conference on Speech Communication, 2023.

LibreCat | Files available | Download (ext.) | arXiv

2023 | Conference Paper | LibreCat-ID: 48410 |

Wiechmann, Jana, Frederik Rautenberg, Petra Wagner, and Reinhold Haeb-Umbach. “Explaining Voice Characteristics to Novice Voice Practitioners-How Successful Is It?” In 20th International Congress of the Phonetic Sciences (ICPhS) , 2023.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48391

Aralikatti, Rohith, Christoph Boeddeker, Gordon Wichern, Aswin Subramanian, and Jonathan Le Roux. “Reverberation as Supervision For Speech Separation.” In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023. https://doi.org/10.1109/icassp49357.2023.10095022.

LibreCat | DOI

2023 | Conference Paper | LibreCat-ID: 48390

Berger, Simon, Peter Vieting, Christoph Boeddeker, Ralf Schlüter, and Reinhold Haeb-Umbach. “Mixture Encoder for Joint Speech Separation and Recognition.” In INTERSPEECH 2023. ISCA, 2023. https://doi.org/10.21437/interspeech.2023-1815.

LibreCat | DOI

2023 | Conference Paper | LibreCat-ID: 46069

Seebauer, Fritz, Michael Kuhlmann, Reinhold Haeb-Umbach, and Petra Wagner. “Re-Examining the Quality Dimensions of Synthetic Speech.” In 12th Speech Synthesis Workshop (SSW) 2023, 2023.

LibreCat

2023 | Journal Article | LibreCat-ID: 35602 |

Neumann, Thilo von, Keisuke Kinoshita, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria.” IEEE/ACM Transactions on Audio, Speech, and Language Processing 31 (2023): 576–89. https://doi.org/10.1109/taslp.2022.3228629.

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48281 |

Neumann, Thilo von, Christoph Boeddeker, Keisuke Kinoshita, Marc Delcroix, and Reinhold Haeb-Umbach. “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems.” In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023. https://doi.org/10.1109/icassp49357.2023.10094784.

LibreCat | Files available | DOI | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48275 |

Neumann, Thilo von, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems.” In Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 49109 |

Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks.” In Proc. Asilomar Conference on Signals, Systems, and Computers, 2023.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 49111

Ebbers, Janek, Reinhold Haeb-Umbach, and Romain Serizel. “Post-Processing Independent Evaluation of Sound Event Detection Systems.” In Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023), 36–40. Tampere, Finland, 2023.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 44849 |

Rautenberg, Frederik, Michael Kuhlmann, Janek Ebbers, Jana Wiechmann, Fritz Seebauer, Petra Wagner, and Reinhold Haeb-Umbach. “Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics.” In Fortschritte Der Akustik - DAGA 2023, 1409–12, 2023.

LibreCat | Files available | Download (ext.)

2022 | Journal Article | LibreCat-ID: 33669 |

Zhang, Wangyou, Xuankai Chang, Christoph Boeddeker, Tomohiro Nakatani, Shinji Watanabe, and Yanmin Qian. “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022. https://doi.org/10.1109/TASLP.2022.3209942.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker, Christoph, Tobias Cord-Landwehr, Thilo von Neumann, and Reinhold Haeb-Umbach. “An Initialization Scheme for Meeting Separation with Spatial Mixture Models.” In Interspeech 2022. ISCA, 2022. https://doi.org/10.21437/interspeech.2022-10929.

LibreCat | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33471

Heitkämper, Jens, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels.” In Proceedings of the 30th European Signal Processing Conference (EUSIPCO). Belgrad, n.d.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33806

Afifi, Haitham, Holger Karl, Tobias Gburrek, and Joerg Schmalenstroeer. “Data-Driven Time Synchronization in Wireless Multimedia Networks.” In 2022 International Wireless Communications and Mobile Computing (IWCMC). IEEE, 2022. https://doi.org/10.1109/iwcmc55113.2022.9824980.

LibreCat | DOI

2022 | Conference Paper | LibreCat-ID: 33958

Kinoshita, Keisuke, Thilo von Neumann, Marc Delcroix, Christoph Boeddeker, and Reinhold Haeb-Umbach. “Utterance-by-Utterance Overlap-Aware Neural Diarization with Graph-PIT.” In Proc. Interspeech 2022, 1486–90. ISCA, 2022. https://doi.org/10.21437/Interspeech.2022-11408.

LibreCat | DOI

2022 | Conference Paper | LibreCat-ID: 33819 |

Neumann, Thilo von, Keisuke Kinoshita, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “SA-SDR: A Novel Loss Function for Separation of Meeting Style Data.” In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022. https://doi.org/10.1109/icassp43922.2022.9746757.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, Tobias, Thilo von Neumann, Christoph Boeddeker, and Reinhold Haeb-Umbach. “MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator.” In 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr, Tobias, Christoph Boeddeker, Thilo von Neumann, Catalin Zorila, Rama Doddipatla, and Reinhold Haeb-Umbach. “Monaural Source Separation: From Anechoic to Reverberant Environments.” In 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). Bamberg: IEEE, 2022.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33807 |

Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes.” In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022. https://doi.org/10.1109/icassp43922.2022.9746284.

LibreCat | Files available | DOI

2022 | Journal Article | LibreCat-ID: 33451 |

Grimm, Christopher, Tai Fei, Ernst Warsitz, Ridha Farhoud, Tobias Breddermann, and Reinhold Haeb-Umbach. “Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications.” IEEE Transactions on Vehicular Technology 71, no. 9 (2022): 9435–49. https://doi.org/10.1109/TVT.2022.3182411.

LibreCat | Files available | DOI

2022 | Report | LibreCat-ID: 49113

Ebbers, Janek, and Reinhold Haeb-Umbach. Pre-Training And Self-Training For Sound Event Detection In Domestic Environments, 2022.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33696 |

Wiechmann, Jana, Thomas Glarner, Frederik Rautenberg, Petra Wagner, and Reinhold Haeb-Umbach. “Technically Enabled Explaining of Voice Characteristics.” In 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P), 2022.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33857 |

Kuhlmann, Michael, Fritz Seebauer, Janek Ebbers, Petra Wagner, and Reinhold Haeb-Umbach. “Investigation into Target Speaking Rate Adaptation for Voice Conversion.” In Interspeech 2022. ISCA, 2022. https://doi.org/10.21437/interspeech.2022-10740.

LibreCat | Files available | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33808 |

Gburrek, Tobias, Joerg Schmalenstroeer, Jens Heitkaemper, and Reinhold Haeb-Umbach. “Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription.” In 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE, 2022. https://doi.org/10.1109/IWAENC53105.2022.9914772.

LibreCat | Files available | DOI

2022 | Misc | LibreCat-ID: 33816 |

Gburrek, Tobias, Christoph Boeddeker, Thilo von Neumann, Tobias Cord-Landwehr, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022. https://doi.org/10.48550/ARXIV.2205.00944.

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 34072 |

Ebbers, Janek, Reinhold Haeb-Umbach, and Romain Serizel. “Threshold Independent Evaluation of Sound Event Detection Scores.” In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.

LibreCat | Files available

2021 | Journal Article | LibreCat-ID: 21065 |

Haeb-Umbach, Reinhold, Jahn Heymann, Lukas Drude, Shinji Watanabe, Marc Delcroix, and Tomohiro Nakatani. “Far-Field Automatic Speech Recognition.” Proceedings of the IEEE 109, no. 2 (2021): 124–48. https://doi.org/10.1109/JPROC.2020.3018668.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 28256

Zhang, Wangyou, Christoph Boeddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, and Yanmin Qian. “End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend.” In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021. https://doi.org/10.1109/icassp39728.2021.9414464.

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 28262

Li, Chenda, Jing Shi, Wangyou Zhang, Aswin Shanmugam Subramanian, Xuankai Chang, Naoyuki Kamo, Moto Hira, et al. “ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration.” In 2021 IEEE Spoken Language Technology Workshop (SLT), 2021. https://doi.org/10.1109/slt48900.2021.9383615.

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 28261

Li, Chenda, Yi Luo, Cong Han, Jinyu Li, Takuya Yoshioka, Tianyan Zhou, Marc Delcroix, et al. “Dual-Path RNN for Long Recording Speech Separation.” In 2021 IEEE Spoken Language Technology Workshop (SLT), 2021. https://doi.org/10.1109/slt48900.2021.9383514.

LibreCat | DOI

2021 | Conference Paper | LibreCat-ID: 24000

Heitkaemper, Jens, Joerg Schmalenstroeer, Valentin Ion, and Reinhold Haeb-Umbach. “A Database for Research on Detection and Enhancement of Speech Transmitted over HF Links.” In Speech Communication; 14th ITG-Symposium, 1–5, 2021.

LibreCat

2021 | Conference Paper | LibreCat-ID: 44843 |

Boeddeker, Christoph, Frederik Rautenberg, and Reinhold Haeb-Umbach. “A Comparison and Combination of Unsupervised Blind Source Separation Techniques.” In ITG Conference on Speech Communication, 2021.

LibreCat | Files available | Download (ext.) | arXiv

2021 | Conference Paper | LibreCat-ID: 28259 |

Boeddeker, Christoph, Wangyou Zhang, Tomohiro Nakatani, Keisuke Kinoshita, Tsubasa Ochiai, Marc Delcroix, Naoyuki Kamo, Yanmin Qian, and Reinhold Haeb-Umbach. “Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation.” In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021. https://doi.org/10.1109/icassp39728.2021.9414661.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 23998 |

Schmalenstroeer, Joerg, Jens Heitkaemper, Joerg Ullmann, and Reinhold Haeb-Umbach. “Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech.” In 29th European Signal Processing Conference (EUSIPCO), 1–5, 2021.

LibreCat | Download (ext.)

2021 | Journal Article | LibreCat-ID: 22528 |

Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Geometry Calibration in Wireless Acoustic Sensor Networks Utilizing DoA and Distance Information.” EURASIP Journal on Audio, Speech, and Music Processing, 2021. https://doi.org/10.1186/s13636-021-00210-x.

LibreCat | DOI | Download (ext.)

2021 | Conference Paper | LibreCat-ID: 23994 |

Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks.” In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021. https://doi.org/10.1109/icassp39728.2021.9413831.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 23999 |

Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “On Source-Microphone Distance Estimation Using Convolutional Recurrent Neural Networks.” In Speech Communication; 14th ITG-Symposium, 1–5, 2021.

LibreCat | Files available

2021 | Conference Paper | LibreCat-ID: 23997 |

Chinaev, Aleksej, Gerald Enzner, Tobias Gburrek, and Joerg Schmalenstroeer. “Online Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks with Packet Loss.” In 29th European Signal Processing Conference (EUSIPCO), 1–5, 2021.

LibreCat | Download (ext.)

2021 | Conference Paper | LibreCat-ID: 29304 |

Ebbers, Janek, Michael Kuhlmann, Tobias Cord-Landwehr, and Reinhold Haeb-Umbach. “Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations.” In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 3860–3864, 2021.

LibreCat | Files available

2021 | Conference Paper | LibreCat-ID: 26770 |

Neumann, Thilo von, Keisuke Kinoshita, Christoph Boeddeker, Marc Delcroix, and Reinhold Haeb-Umbach. “Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers.” In Interspeech 2021, 2021. https://doi.org/10.21437/interspeech.2021-1177.

LibreCat | Files available | DOI

2021 | Conference Paper | LibreCat-ID: 29173 |

Neumann, Thilo von, Christoph Boeddeker, Keisuke Kinoshita, Marc Delcroix, and Reinhold Haeb-Umbach. “Speeding Up Permutation Invariant Training for Source Separation.” In Speech Communication; 14th ITG Conference, 2021.

LibreCat | Files available

2021 | Conference Paper | LibreCat-ID: 29308 |

Ebbers, Janek, and Reinhold Haeb-Umbach. “Self-Trained Audio Tagging and Sound Event Detection in Domestic Environments.” In Proceedings of the 6th Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE2021), 226–230. Barcelona, Spain, 2021.

LibreCat | Files available

2021 | Conference Paper | LibreCat-ID: 29306 |

Ebbers, Janek, Moritz Curt Keyser, and Reinhold Haeb-Umbach. “Adapting Sound Recognition to A New Environment Via Self-Training.” In Proceedings of the 29th European Signal Processing Conference (EUSIPCO), 1135–1139, 2021.

LibreCat | Files available

2021 | Journal Article | LibreCat-ID: 24456 |

Rohlfing, Katharina J., Philipp Cimiano, Ingrid Scharlau, Tobias Matzner, Heike M. Buhl, Hendrik Buschmeier, Elena Esposito, et al. “Explanation as a Social Practice: Toward a Conceptual Framework for the Social Design of AI Systems.” IEEE Transactions on Cognitive and Developmental Systems 13, no. 3 (2021): 717–28. https://doi.org/10.1109/tcds.2020.3044366.

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 17763 |

Haeb-Umbach, Reinhold. “Sprachtechnologien Für Digitale Assistenten.” In Studientexte Zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2020, edited by Ronald Böck, Ingo Siegert, and Andreas Wendemuth, 227–34. TUDpress, Dresden, 2020.

LibreCat | Download (ext.)

2020 | Conference Paper | LibreCat-ID: 20695 |

Boeddeker, Christoph, Tomohiro Nakatani, Keisuke Kinoshita, and Reinhold Haeb-Umbach. “Jointly Optimal Dereverberation and Beamforming.” In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020. https://doi.org/10.1109/icassp40776.2020.9054393.

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20700 |

Boeddeker, Christoph, Tobias Cord-Landwehr, Jens Heitkaemper, Catalin Zorila, Daichi Hayakawa, Mohan Li, Min Liu, Rama Doddipatla, and Reinhold Haeb-Umbach. “Towards a Speaker Diarization System for the CHiME 2020 Dinner Party Transcription.” In Proc. CHiME 2020 Workshop on Speech Processing in Everyday Environments, 2020.

LibreCat | Files available

2020 | Journal Article | LibreCat-ID: 17598 |

Nakatani, Tomohiro, Christoph Boeddeker, Keisuke Kinoshita, Rintaro Ikeshita, Marc Delcroix, and Reinhold Haeb-Umbach. “Jointly Optimal Denoising, Dereverberation, and Source Separation.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020, 1–1. https://doi.org/10.1109/TASLP.2020.3013118.

LibreCat | DOI | Download (ext.)

2020 | Conference Paper | LibreCat-ID: 20504

Heitkaemper, Jens, Darius Jakobeit, Christoph Boeddeker, Lukas Drude, and Reinhold Haeb-Umbach. “Demystifying TasNet: A Dissecting Approach.” In ICASSP 2020 Virtual Barcelona Spain, 2020.

LibreCat | Files available

2020 | Preprint | LibreCat-ID: 28263

Watanabe, Shinji, Michael Mandel, Jon Barker, Emmanuel Vincent, Ashish Arora, Xuankai Chang, Sanjeev Khudanpur, et al. “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings.” ArXiv:2004.09249, 2020.

LibreCat

2020 | Conference Paper | LibreCat-ID: 20505

Heitkaemper, Jens, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments.” In INTERSPEECH 2020 Virtual Shanghai China, 2020.

LibreCat | Files available

2020 | Conference Paper | LibreCat-ID: 20762 |

Neumann, Thilo von, Keisuke Kinoshita, Lukas Drude, Christoph Boeddeker, Marc Delcroix, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “End-to-End Training of Time Domain Audio Separation and Recognition.” In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 7004–8, 2020. https://doi.org/10.1109/ICASSP40776.2020.9053461.

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20764 |

Neumann, Thilo von, Christoph Boeddeker, Lukas Drude, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR.” In Proc. Interspeech 2020, 3097–3101, 2020. https://doi.org/10.21437/Interspeech.2020-2519.

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 18651 |

Gburrek, Tobias, Joerg Schmalenstroeer, Andreas Brendel, Walter Kellermann, and Reinhold Haeb-Umbach. “Deep Neural Network Based Distance Estimation for Geometry Calibration in Acoustic Sensor Network.” In European Signal Processing Conference (EUSIPCO), 2020.

LibreCat | Files available

2020 | Conference Paper | LibreCat-ID: 20766 |

Kinoshita, Keisuke, Thilo von Neumann, Marc Delcroix, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and Its Application to Speaker Stream Separation.” In Proc. Interspeech 2020, 2652–56, 2020. https://doi.org/10.21437/Interspeech.2020-2388.

LibreCat | Files available | DOI

2020 | Conference Paper | LibreCat-ID: 20753 |

Ebbers, Janek, and Reinhold Haeb-Umbach. “Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-Supervised Sound Event Detection.” In Proceedings of the Detection and Classification of Acoustic Scenes and Events 2020 Workshop (DCASE2020), 2020.

LibreCat | Files available

2019 | Journal Article | LibreCat-ID: 17762

Haeb-Umbach, Reinhold. “Lektionen Für Alexa \& Co?!” Forschung 44, no. 1 (2019): 12–15. https://doi.org/10.1002/fors.201970104.

LibreCat | DOI

2019 | Journal Article | LibreCat-ID: 19446 |

Drude, Lukas, Jens Heitkaemper, Christoph Boeddeker, and Reinhold Haeb-Umbach. “SMS-WSJ: Database, Performance Measures, and Baseline Recipe for Multi-Channel Source Separation and Recognition.” ArXiv E-Prints, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 11965 |

Drude, Lukas, Jahn Heymann, and Reinhold Haeb-Umbach. “Unsupervised Training of Neural Mask-Based Beamforming.” In INTERSPEECH 2019, Graz, Austria, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 12874 |

Drude, Lukas, Daniel Hasenklever, and Reinhold Haeb-Umbach. “Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation.” In ICASSP 2019, Brighton, UK, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 12875 |

Heymann, Jahn, Lukas Drude, Reinhold Haeb-Umbach, Keisuke Kinoshita, and Tomohiro Nakatani. “Joint Optimization of Neural Network-Based WPE Dereverberation and Acoustic Model for Robust Online ASR.” In ICASSP 2019, Brighton, UK, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 12876 |

Kurz, Gerhard, Igor Gilitschenski, Florian Pfaff, Lukas Drude, Uwe D. Hanebeck, Reinhold Haeb-Umbach, and Roland Y. Siegwart. “Directional Statistics and Filtering Using LibDirectional.” In Journal of Statistical Software 89(4), 2019.

LibreCat | Files available

2019 | Journal Article | LibreCat-ID: 12890 |

Drude, Lukas, and Reinhold Haeb-Umbach. “Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation.” IEEE Journal of Selected Topics in Signal Processing, 2019. https://doi.org/10.1109/JSTSP.2019.2912565.

LibreCat | Files available | DOI

2019 | Conference Paper | LibreCat-ID: 15812 |

Heymann, Jahn, and Bo Li Khe Chai Sim. “Improving CTC Using Stimulated Learning for Sequence Modeling.” In ICASSP 2019, Brighton, UK, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 15816 |

Zorila, Catalin, Christoph Boeddeker, Rama Doddipatla, and Reinhold Haeb-Umbach. “An Investigation Into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription.” In ASRU 2019, Sentosa, Singapore, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 14822 |

Heitkaemper, Jens, Thomas Feher, Michael Freitag, and Reinhold Haeb-Umbach. “A Study on Online Source Extraction in the Presence of Changing Speaker Positions.” In International Conference on Statistical Language and Speech Processing 2019, Ljubljana, Slovenia, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 14824 |

Martin-Donas, Juan M., Jens Heitkaemper, Reinhold Haeb-Umbach, Angel M. Gomez, and Antonio M. Peinado. “Multi-Channel Block-Online Source Extraction Based on Utterance Adaptation.” In INTERSPEECH 2019, Graz, Austria, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 14826 |

Kanda, Naoyuki, Christoph Boeddeker, Jens Heitkaemper, Yusuke Fujita, Shota Horiguchi, and Reinhold Haeb-Umbach. “Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR.” In INTERSPEECH 2019, Graz, Austria, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 13271 |

Neumann, Thilo von, Keisuke Kinoshita, Marc Delcroix, Shoko Araki, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “All-Neural Online Source Separation, Counting, and Diarization for Meeting Analysis.” In ICASSP 2019, Brighton, UK, 2019.

LibreCat | Files available

2019 | Journal Article | LibreCat-ID: 15814 |

Haeb-Umbach, Reinhold, Shinji Watanabe, Tomohiro Nakatani, Michiel Bacchiani, Bjoern Hoffmeister, Michael L. Seltzer, Heiga Zen, and Mehrez Souden. “Speech Processing for Digital Home Assistance: Combining Signal Processing With Deep-Learning Techniques.” IEEE Signal Processing Magazine 36, no. 6 (2019): 111–24. https://doi.org/10.1109/MSP.2019.2918706.

LibreCat | Files available | DOI

2019 | Journal Article | LibreCat-ID: 19450 |

Haeb-Umbach, Reinhold. “Lektionen Für Alexa & Co?!” DFG Forschung 1/2019, 2019, 12–15. https://doi.org/10.1002/fors.201970104.

LibreCat | Files available | DOI

2019 | Conference Paper | LibreCat-ID: 15237 |

Gburrek, Tobias, Thomas Glarner, Janek Ebbers, Reinhold Haeb-Umbach, and Petra Wagner. “Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion.” In Proc. 10th ISCA Speech Synthesis Workshop, 81–86, 2019. https://doi.org/10.21437/SSW.2019-15.

LibreCat | Files available | DOI | Download (ext.)

2019 | Conference Paper | LibreCat-ID: 15794 |

Ebbers, Janek, and Reinhold Haeb-Umbach. “Convolutional Recurrent Neural Network and Data Augmentation for Audio Tagging with Noisy Labels and Minimal Supervision.” In DCASE2019 Workshop, New York, USA, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 15796 |

Ebbers, Janek, Lukas Drude, Reinhold Haeb-Umbach, Andreas Brendel, and Walter Kellermann. “Weakly Supervised Sound Activity Detection and Event Classification in Acoustic Sensor Networks.” In CAMSAP 2019, Guadeloupe, West Indies, 2019.

LibreCat | Files available

2019 | Conference Paper | LibreCat-ID: 15792 |

Nelus, Alexandru, Janek Ebbers, Reinhold Haeb-Umbach, and Rainer Martin. “Privacy-Preserving Variational Information Feature Extraction for Domestic Activity Monitoring Versus Speaker Identification.” In INTERSPEECH 2019, Graz, Austria, 2019.

LibreCat | Files available

2018 | Conference Paper | LibreCat-ID: 18107

Heymann, Jahn, M. Bacchiani, and T. N. Sainath. “Performance of Mask Based Statistical Beamforming in a Smart Home Scenario.” In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6722–26, 2018. https://doi.org/10.1109/ICASSP.2018.8462372.

LibreCat | DOI

2018 | Conference Paper | LibreCat-ID: 11760 |

Ebbers, Janek, Alexandru Nelus, Rainer Martin, and Reinhold Haeb-Umbach. “Evaluation of Modulation-MFCC Features and DNN Classification for Acoustic Event Detection.” In DAGA 2018, München, 2018.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11835 |

Heymann, Jahn, Lukas Drude, Reinhold Haeb-Umbach, Keisuke Kinoshita, and Tomohiro Nakatani. “Frame-Online DNN-WPE Dereverberation.” In IWAENC 2018, Tokio, Japan, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11837 |

Heitkaemper, Jens, Jahn Heymann, and Reinhold Haeb-Umbach. “Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming.” In ITG 2018, Oldenburg, Germany, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11872 |

Drude, Lukas, Christoph Boeddeker, Jahn Heymann, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “Integration Neural Network Based Beamforming and Weighted Prediction Error Dereverberation.” In INTERSPEECH 2018, Hyderabad, India, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11873 |

Drude, Lukas, Jahn Heymann, Christoph Boeddeker, and Reinhold Haeb-Umbach. “NARA-WPE: A Python Package for Weighted Prediction Error Dereverberation in Numpy and Tensorflow for Online and Offline Processing.” In ITG 2018, Oldenburg, Germany, 2018.

LibreCat | Files available | Download (ext.)

2018 | Journal Article | LibreCat-ID: 11916 |

Despotovic, Vladimir, Oliver Walter, and Reinhold Haeb-Umbach. “Machine Learning Techniques for Semantic Analysis of Dysarthric Speech: An Experimental Study.” Speech Communication 99 (2018) 242-251 (Elsevier B.V.), 2018.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 12898 |

Drude, Lukas, Thilo von Neumann, and Reinhold Haeb-Umbach. “Deep Attractor Networks for Speaker Re-Identifikation and Blind Source Separation.” In ICASSP 2018, Calgary, Canada, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 12900 |

Drude, Lukas, Takuya Higuchi, Keisuke Kinoshita, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation.” In ICASSP 2018, Calgary, Canada, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 12901 |

Boeddeker, Christoph, Hakan Erdogan, Takuya Yoshioka, and Reinhold Haeb-Umbach. “Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition.” In ICASSP 2018, Calgary, Canada, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 29923 |

Watanabe, Shinji, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, et al. “ESPnet: End-to-End Speech Processing Toolkit.” In INTERSPEECH 2018, Hyderabad, India, 2207–2211, 2018. https://doi.org/10.21437/Interspeech.2018-1456.

LibreCat | Files available | DOI

2018 | Conference Paper | LibreCat-ID: 12899 |

Boeddeker, Christoph, Jens Heitkaemper, Joerg Schmalenstroeer, Lukas Drude, Jahn Heymann, and Reinhold Haeb-Umbach. “Front-End Processing for the CHiME-5 Dinner Party Scenario.” In Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 6859

Afifi, Haitham, Joerg Schmalenstroeer, Joerg Ullmann, Reinhold Haeb-Umbach, and Holger Karl. “MARVELO - A Framework for Signal Processing in Wireless Acoustic Sensor Networks.” In Speech Communication; 13th ITG-Symposium, 1–5, 2018.

LibreCat

2018 | Conference Paper | LibreCat-ID: 11747 |

Grimm, Christopher, Tobias Breddermann, Ridha Farhoud, Tai Fei, Ernst Warsitz, and Reinhold Haeb-Umbach. “Discrimination of Stationary from Moving Targets with Recurrent Neural Networks in Automotive Radar.” In International Conference on Microwaves for Intelligent Mobility (ICMIM) 2018, 2018.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11907 |

Glarner, Thomas, Patrick Hanebrink, Janek Ebbers, and Reinhold Haeb-Umbach. “Full Bayesian Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery.” In INTERSPEECH 2018, Hyderabad, India, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11838 |

Schmalenstroeer, Joerg, and Reinhold Haeb-Umbach. “Efficient Sampling Rate Offset Compensation - An Overlap-Save Based Approach.” In 26th European Signal Processing Conference (EUSIPCO 2018), 2018.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11876 |

Kitza, Markus, Wilfried Michel, Christoph Boeddeker, Jens Heitkaemper, Tobias Menne, Ralf Schlüter, Hermann Ney, et al. “The RWTH/UPB System Combination for the CHiME 2018 Workshop.” In Proc. CHiME 2018 Workshop on Speech Processing in Everyday Environments, Hyderabad, India, 2018.

LibreCat | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11836 |

Ebbers, Janek, Jens Heitkaemper, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Benchmarking Neural Network Architectures for Acoustic Sensor Networks.” In ITG 2018, Oldenburg, Germany, 2018.

LibreCat | Files available | Download (ext.)

2018 | Conference Paper | LibreCat-ID: 11839 |

Schmalenstroeer, Joerg, and Reinhold Haeb-Umbach. “Insights into the Interplay of Sampling Rate Offsets and MVDR Beamforming.” In ITG 2018, Oldenburg, Germany, 2018.

LibreCat | Download (ext.)

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

317 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

317 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options