Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
331 Publications
2023 | Conference Paper | LibreCat-ID: 49111
Ebbers, Janek, et al. “Post-Processing Independent Evaluation of Sound Event Detection Systems.” Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023), 2023, pp. 36–40.
LibreCat
| Files available
2023 | Conference Paper | LibreCat-ID: 57098
Seebauer, Fritz, et al. “DISCERNING DIMENSIONS OF QUALITY FOR STATE OF THE ART SYNTHETIC SPEECH.” Proceedings of the 20th International Congress of Phonetic Sciences, 2023.
LibreCat
2023 | Conference Paper | LibreCat-ID: 57086
Kuhlmann, Michael, et al. “Investigating Speaker Embedding Disentanglement on Natural Read Speech.” Speech Communication; 15th ITG Conference, 2023, pp. 121–125.
LibreCat
2023 | Conference Paper | LibreCat-ID: 48281 |

von Neumann, Thilo, et al. “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10094784.
LibreCat
| Files available
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48275 |

von Neumann, Thilo, et al. “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems.” Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.
LibreCat
| Files available
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 47128 |

Cord-Landwehr, Tobias, et al. “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10095370.
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 47129 |

Cord-Landwehr, Tobias, et al. “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures.” INTERSPEECH 2023, ISCA, 2023, doi:10.21437/interspeech.2023-1379.
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 54439 |

Boeddeker, Christoph, et al. “Multi-Stage Diarization Refinement for the CHiME-7 DASR Scenario.” 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023), ISCA, 2023, doi:10.21437/chime.2023-10.
LibreCat
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48390 |

Berger, Simon, et al. “Mixture Encoder for Joint Speech Separation and Recognition.” INTERSPEECH 2023, ISCA, 2023, doi:10.21437/interspeech.2023-1815.
LibreCat
| DOI
| Download (ext.)
2022 | Journal Article | LibreCat-ID: 33669 |

Zhang, Wangyou, et al. “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi:10.1109/TASLP.2022.3209942.
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33471
Heitkämper, Jens, et al. “Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels.” Proceedings of the 30th European Signal Processing Conference (EUSIPCO).
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33806
Afifi, Haitham, et al. “Data-Driven Time Synchronization in Wireless Multimedia Networks.” 2022 International Wireless Communications and Mobile Computing (IWCMC), IEEE, 2022, doi:10.1109/iwcmc55113.2022.9824980.
LibreCat
| DOI
2022 | Conference Paper | LibreCat-ID: 33847 |

Cord-Landwehr, Tobias, et al. “MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
LibreCat
| Files available
| arXiv
2022 | Conference Paper | LibreCat-ID: 33807 |

Gburrek, Tobias, et al. “On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes.” ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2022, doi:10.1109/icassp43922.2022.9746284.
LibreCat
| Files available
| DOI
2022 | Journal Article | LibreCat-ID: 33451 |

Grimm, Christopher, et al. “Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications.” IEEE Transactions on Vehicular Technology, vol. 71, no. 9, 2022, pp. 9435–49, doi:10.1109/TVT.2022.3182411.
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 33696 |

Wiechmann, Jana, et al. “Technically Enabled Explaining of Voice Characteristics.” 18. Phonetik Und Phonologie Im Deutschsprachigen Raum (P&P), 2022.
LibreCat
| Files available
2022 | Conference Paper | LibreCat-ID: 33857 |

Kuhlmann, Michael, et al. “Investigation into Target Speaking Rate Adaptation for Voice Conversion.” Interspeech 2022, ISCA, 2022, doi:10.21437/interspeech.2022-10740.
LibreCat
| Files available
| DOI
| Download (ext.)
2022 | Conference Paper | LibreCat-ID: 33808 |

Gburrek, Tobias, et al. “Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription.” 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2022, doi:10.1109/IWAENC53105.2022.9914772.
LibreCat
| Files available
| DOI
2022 | Conference Paper | LibreCat-ID: 34072 |

Ebbers, Janek, et al. “Threshold Independent Evaluation of Sound Event Detection Scores.” Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
LibreCat
| Files available
2022 | Report | LibreCat-ID: 49113
Ebbers, Janek, and Reinhold Haeb-Umbach. Pre-Training And Self-Training For Sound Event Detection In Domestic Environments. 2022.
LibreCat
| Files available