Please note that LibreCat no longer supports Internet Explorer versions 8 or 9 (or earlier).
We recommend upgrading to the latest Internet Explorer, Google Chrome, or Firefox.
42 Publications
- 1 (current)
- 2
- 3
- 4
- 5
2024 | Journal Article | LibreCat-ID: 52958 |
Boeddeker, Christoph, et al. “TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, Institute of Electrical and Electronics Engineers (IEEE), 2024, pp. 1185–97, doi:10.1109/taslp.2024.3350887.
LibreCat
| DOI
| Download (ext.)
2024 | Conference Paper | LibreCat-ID: 53659
Cord-Landwehr, Tobias, et al. “Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios.” ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2024, doi:10.1109/icassp48485.2024.10445911.
LibreCat
| DOI
2023 | Conference Paper | LibreCat-ID: 47128 |
Cord-Landwehr, Tobias, et al. “Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10095370.
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 47129 |
Cord-Landwehr, Tobias, et al. “A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures.” INTERSPEECH 2023, ISCA, 2023, doi:10.21437/interspeech.2023-1379.
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 48391
Aralikatti, Rohith, et al. “Reverberation as Supervision For Speech Separation.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10095022.
LibreCat
| DOI
2023 | Conference Paper | LibreCat-ID: 48390
Berger, Simon, et al. “Mixture Encoder for Joint Speech Separation and Recognition.” INTERSPEECH 2023, ISCA, 2023, doi:10.21437/interspeech.2023-1815.
LibreCat
| DOI
2023 | Journal Article | LibreCat-ID: 35602 |
von Neumann, Thilo, et al. “Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, Institute of Electrical and Electronics Engineers (IEEE), 2023, pp. 576–89, doi:10.1109/taslp.2022.3228629.
LibreCat
| Files available
| DOI
2023 | Conference Paper | LibreCat-ID: 48281 |
von Neumann, Thilo, et al. “On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems.” ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, doi:10.1109/icassp49357.2023.10094784.
LibreCat
| Files available
| DOI
| Download (ext.)
2023 | Conference Paper | LibreCat-ID: 48275 |
von Neumann, Thilo, et al. “MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems.” Proc. CHiME 2023 Workshop on Speech Processing in Everyday Environments, 2023.
LibreCat
| Files available
| Download (ext.)
2022 | Journal Article | LibreCat-ID: 33669 |
Zhang, Wangyou, et al. “End-to-End Dereverberation, Beamforming, and Speech Recognition in A Cocktail Party.” IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, doi:10.1109/TASLP.2022.3209942.
LibreCat
| Files available
| DOI
- 1 (current)
- 2
- 3
- 4
- 5