{"date_created":"2023-10-20T08:04:46Z","status":"public","file_date_updated":"2023-10-20T08:20:58Z","abstract":[{"text":"Unsupervised speech disentanglement aims at separating fast varying from\r\nslowly varying components of a speech signal. In this contribution, we take a\r\ncloser look at the embedding vector representing the slowly varying signal\r\ncomponents, commonly named the speaker embedding vector. We ask, which\r\nproperties of a speaker's voice are captured and investigate to which extent do\r\nindividual embedding vector components sign responsible for them, using the\r\nconcept of Shapley values. Our findings show that certain speaker-specific\r\nacoustic-phonetic properties can be fairly well predicted from the speaker\r\nembedding, while the investigated more abstract voice quality features cannot.","lang":"eng"}],"external_id":{"arxiv":["2310.12599"]},"has_accepted_license":"1","file":[{"file_name":"arxiv.pdf","date_updated":"2023-10-20T08:20:58Z","content_type":"application/pdf","date_created":"2023-10-20T08:20:58Z","creator":"frra","relation":"main_file","access_level":"closed","file_size":272390,"file_id":"48359","success":1}],"ddc":["000"],"type":"conference","date_updated":"2023-11-22T13:44:33Z","_id":"48355","language":[{"iso":"eng"}],"title":"On Feature Importance and Interpretability of Speaker Representations","conference":{"end_date":"2023-09-22","location":"Aachen","start_date":"2023-09-20","name":"ITG Conference on Speech Communication"},"author":[{"id":"72602","first_name":"Frederik","last_name":"Rautenberg","full_name":"Rautenberg, Frederik"},{"first_name":"Michael","id":"49871","full_name":"Kuhlmann, Michael","last_name":"Kuhlmann"},{"first_name":"Jana","full_name":"Wiechmann, Jana","last_name":"Wiechmann"},{"first_name":"Fritz","last_name":"Seebauer","full_name":"Seebauer, Fritz"},{"full_name":"Wagner, Petra","last_name":"Wagner","first_name":"Petra"},{"full_name":"Haeb-Umbach, Reinhold","last_name":"Haeb-Umbach","id":"242","first_name":"Reinhold"}],"year":"2023","main_file_link":[{"url":"https://arxiv.org/abs/2310.12599","open_access":"1"}],"user_id":"72602","oa":"1","citation":{"ama":"Rautenberg F, Kuhlmann M, Wiechmann J, Seebauer F, Wagner P, Haeb-Umbach R. On Feature Importance and Interpretability of Speaker Representations. In: ITG Conference on Speech Communication. ; 2023.","mla":"Rautenberg, Frederik, et al. “On Feature Importance and Interpretability of Speaker Representations.” ITG Conference on Speech Communication, 2023.","bibtex":"@inproceedings{Rautenberg_Kuhlmann_Wiechmann_Seebauer_Wagner_Haeb-Umbach_2023, title={On Feature Importance and Interpretability of Speaker Representations}, booktitle={ITG Conference on Speech Communication}, author={Rautenberg, Frederik and Kuhlmann, Michael and Wiechmann, Jana and Seebauer, Fritz and Wagner, Petra and Haeb-Umbach, Reinhold}, year={2023} }","chicago":"Rautenberg, Frederik, Michael Kuhlmann, Jana Wiechmann, Fritz Seebauer, Petra Wagner, and Reinhold Haeb-Umbach. “On Feature Importance and Interpretability of Speaker Representations.” In ITG Conference on Speech Communication, 2023.","apa":"Rautenberg, F., Kuhlmann, M., Wiechmann, J., Seebauer, F., Wagner, P., & Haeb-Umbach, R. (2023). On Feature Importance and Interpretability of Speaker Representations. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Aachen.","short":"F. Rautenberg, M. Kuhlmann, J. Wiechmann, F. Seebauer, P. Wagner, R. Haeb-Umbach, in: ITG Conference on Speech Communication, 2023.","ieee":"F. Rautenberg, M. Kuhlmann, J. Wiechmann, F. Seebauer, P. Wagner, and R. Haeb-Umbach, “On Feature Importance and Interpretability of Speaker Representations,” presented at the ITG Conference on Speech Communication, Aachen, 2023."},"publication":"ITG Conference on Speech Communication","project":[{"grant_number":"438445824","name":"TRR 318 - C06: TRR 318 - Technisch unterstütztes Erklären von Stimmcharakteristika (Teilprojekt C06)","_id":"129"}],"department":[{"_id":"54"},{"_id":"660"}]}