{"language":[{"iso":"eng"}],"has_accepted_license":"1","type":"conference","department":[{"_id":"54"}],"_id":"61079","ddc":["000"],"abstract":[{"lang":"eng","text":"We propose a spatio-spectral, combined model-based and data-driven\r\ndiarization pipeline consisting of TDOA-based segmentation followed by\r\nembedding-based clustering. The proposed system requires neither access to\r\nmulti-channel training data nor prior knowledge about the number or placement\r\nof microphones. It works for both a compact microphone array and distributed\r\nmicrophones, with minor adjustments. Due to its superior handling of\r\noverlapping speech during segmentation, the proposed pipeline significantly\r\noutperforms the single-channel pyannote approach, both in a scenario with a\r\ncompact microphone array and in a setup with distributed microphones.\r\nAdditionally, we show that, unlike fully spatial diarization pipelines, the\r\nproposed system can correctly track speakers when they change positions."}],"year":"2025","title":"Spatio-spectral diarization of meetings by combining TDOA-based segmentation and speaker embedding-based clustering","citation":{"bibtex":"@inproceedings{Cord-Landwehr_Gburrek_Deegen_Haeb-Umbach_2025, title={Spatio-spectral diarization of meetings by combining TDOA-based segmentation and speaker embedding-based clustering}, DOI={10.21437/Interspeech.2025-1663}, booktitle={Proceedings of INTERSPEECH}, author={Cord-Landwehr, Tobias and Gburrek, Tobias and Deegen, Marc and Haeb-Umbach, Reinhold}, year={2025} }","mla":"Cord-Landwehr, Tobias, et al. “Spatio-Spectral Diarization of Meetings by Combining TDOA-Based Segmentation and Speaker Embedding-Based Clustering.” Proceedings of INTERSPEECH, 2025, doi:10.21437/Interspeech.2025-1663.","ama":"Cord-Landwehr T, Gburrek T, Deegen M, Haeb-Umbach R. Spatio-spectral diarization of meetings by combining TDOA-based segmentation and speaker embedding-based clustering. In: Proceedings of INTERSPEECH. ; 2025. doi:10.21437/Interspeech.2025-1663","chicago":"Cord-Landwehr, Tobias, Tobias Gburrek, Marc Deegen, and Reinhold Haeb-Umbach. “Spatio-Spectral Diarization of Meetings by Combining TDOA-Based Segmentation and Speaker Embedding-Based Clustering.” In Proceedings of INTERSPEECH, 2025. https://doi.org/10.21437/Interspeech.2025-1663.","apa":"Cord-Landwehr, T., Gburrek, T., Deegen, M., & Haeb-Umbach, R. (2025). Spatio-spectral diarization of meetings by combining TDOA-based segmentation and speaker embedding-based clustering. Proceedings of INTERSPEECH. Interspeech 2025, Rotterdam. https://doi.org/10.21437/Interspeech.2025-1663","short":"T. Cord-Landwehr, T. Gburrek, M. Deegen, R. Haeb-Umbach, in: Proceedings of INTERSPEECH, 2025.","ieee":"T. Cord-Landwehr, T. Gburrek, M. Deegen, and R. Haeb-Umbach, “Spatio-spectral diarization of meetings by combining TDOA-based segmentation and speaker embedding-based clustering,” presented at the Interspeech 2025, Rotterdam, 2025, doi: 10.21437/Interspeech.2025-1663."},"status":"public","date_created":"2025-08-29T09:39:01Z","file":[{"creator":"cord","access_level":"open_access","relation":"main_file","content_type":"application/pdf","date_created":"2025-08-29T09:43:32Z","file_name":"main.pdf","file_id":"61085","file_size":921918,"date_updated":"2025-08-29T09:43:32Z"}],"oa":"1","conference":{"location":"Rotterdam","name":"Interspeech 2025"},"file_date_updated":"2025-08-29T09:43:32Z","external_id":{"arxiv":["2506.16228"]},"doi":"10.21437/Interspeech.2025-1663","publication":"Proceedings of INTERSPEECH","author":[{"id":"44393","last_name":"Cord-Landwehr","full_name":"Cord-Landwehr, Tobias","first_name":"Tobias"},{"full_name":"Gburrek, Tobias","last_name":"Gburrek","id":"44006","first_name":"Tobias"},{"first_name":"Marc","last_name":"Deegen","full_name":"Deegen, Marc"},{"id":"242","last_name":"Haeb-Umbach","full_name":"Haeb-Umbach, Reinhold","first_name":"Reinhold"}],"user_id":"44393","project":[{"_id":"52","name":"Computing Resources Provided by the Paderborn Center for Parallel Computing"}],"date_updated":"2025-08-29T09:46:27Z"}