LibreCat – Publication List Manager

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.

318 Publications

2023 | Conference Paper | LibreCat-ID: 46069

Seebauer, F., Kuhlmann, M., Haeb-Umbach, R., & Wagner, P. (2023). Re-examining the quality dimensions of synthetic speech. 12th Speech Synthesis Workshop (SSW) 2023.

LibreCat

2023 | Journal Article | LibreCat-ID: 35602 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, 576–589. https://doi.org/10.1109/taslp.2022.3228629

LibreCat | Files available | DOI

2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann, T., Boeddeker, C., Kinoshita, K., Delcroix, M., & Haeb-Umbach, R. (2023). On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp49357.2023.10094784

LibreCat | Files available | DOI | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann, T., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2023). MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments. CHiME 2023 Workshop on Speech Processing in Everyday Environments, Dublin.

LibreCat | Files available | Download (ext.)

2023 | Conference Paper | LibreCat-ID: 49109 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2023). Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks. Proc. Asilomar Conference on Signals, Systems, and Computers. 57th Asilomar Conference on Signals, Systems, and Computers.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 49111

Ebbers, J., Haeb-Umbach, R., & Serizel, R. (2023). Post-Processing Independent Evaluation of Sound Event Detection Systems. Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023), 36–40.

LibreCat | Files available

2023 | Conference Paper | LibreCat-ID: 44849 |

Rautenberg, F., Kuhlmann, M., Ebbers, J., Wiechmann, J., Seebauer, F., Wagner, P., & Haeb-Umbach, R. (2023). Speech Disentanglement for Analysis and Modification of Acoustic and Perceptual Speaker Characteristics. Fortschritte Der Akustik - DAGA 2023, 1409–1412.

LibreCat | Files available | Download (ext.)

2022 | Journal Article | LibreCat-ID: 33669 |

Zhang, W., Chang, X., Boeddeker, C., Nakatani, T., Watanabe, S., & Qian, Y. (2022). End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party. IEEE/ACM Transactions on Audio, Speech, and Language Processing. https://doi.org/10.1109/TASLP.2022.3209942

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33954 |

Boeddeker, C., Cord-Landwehr, T., von Neumann, T., & Haeb-Umbach, R. (2022). An Initialization Scheme for Meeting Separation with Spatial Mixture Models. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10929

LibreCat | DOI | Download (ext.)

2022 | Conference Paper | LibreCat-ID: 33471

Heitkämper, J., Schmalenstroeer, J., & Haeb-Umbach, R. (n.d.). Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels. Proceedings of the 30th European Signal Processing Conference (EUSIPCO). 30th European Signal Processing Conference (EUSIPCO), Belgrad.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33806

Afifi, H., Karl, H., Gburrek, T., & Schmalenstroeer, J. (2022). Data-driven Time Synchronization in Wireless Multimedia Networks. 2022 International Wireless Communications and Mobile Computing (IWCMC). https://doi.org/10.1109/iwcmc55113.2022.9824980

LibreCat | DOI

2022 | Conference Paper | LibreCat-ID: 33958

Kinoshita, K., von Neumann, T., Delcroix, M., Boeddeker, C., & Haeb-Umbach, R. (2022). Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. Proc. Interspeech 2022, 1486–1490. https://doi.org/10.21437/Interspeech.2022-11408

LibreCat | DOI

2022 | Conference Paper | LibreCat-ID: 33819 |

von Neumann, T., Kinoshita, K., Boeddeker, C., Delcroix, M., & Haeb-Umbach, R. (2022). SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746757

LibreCat | Files available | DOI

2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, T., von Neumann, T., Boeddeker, C., & Haeb-Umbach, R. (2022). MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg.

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33848 |

Cord-Landwehr, T., Boeddeker, C., von Neumann, T., Zorila, C., Doddipatla, R., & Haeb-Umbach, R. (2022). Monaural source separation: From anechoic to reverberant environments. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).

LibreCat | Files available | arXiv

2022 | Conference Paper | LibreCat-ID: 33807 |

Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746284

LibreCat | Files available | DOI

2022 | Journal Article | LibreCat-ID: 33451 |

Grimm, C., Fei, T., Warsitz, E., Farhoud, R., Breddermann, T., & Haeb-Umbach, R. (2022). Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications. IEEE Transactions on Vehicular Technology, 71(9), 9435–9449. https://doi.org/10.1109/TVT.2022.3182411

LibreCat | Files available | DOI

2022 | Report | LibreCat-ID: 49113

Ebbers, J., & Haeb-Umbach, R. (2022). Pre-Training And Self-Training For Sound Event Detection In Domestic Environments.

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33696 |

Wiechmann, J., Glarner, T., Rautenberg, F., Wagner, P., & Haeb-Umbach, R. (2022). Technically enabled explaining of voice characteristics. 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P).

LibreCat | Files available

2022 | Conference Paper | LibreCat-ID: 33857 |

Kuhlmann, M., Seebauer, F., Ebbers, J., Wagner, P., & Haeb-Umbach, R. (2022). Investigation into Target Speaking Rate Adaptation for Voice Conversion. Interspeech 2022. https://doi.org/10.21437/interspeech.2022-10740

LibreCat | Files available | DOI | Download (ext.)

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

318 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Publications at Paderborn University

Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).

318 Publications

Filters and Search Terms

Search

Filter Publications

Display / Sort

Export / Embed

Export Options