{"publication":"ITG Conference on Speech Communication","year":"2023","file":[{"success":1,"relation":"main_file","date_created":"2023-10-20T08:20:58Z","access_level":"closed","file_name":"arxiv.pdf","content_type":"application/pdf","creator":"frra","file_size":272390,"file_id":"48359","date_updated":"2023-10-20T08:20:58Z"}],"project":[{"grant_number":"438445824","_id":"129","name":"TRR 318 - C06: TRR 318 - Technisch unterstütztes Erklären von Stimmcharakteristika (Teilprojekt C06)"}],"status":"public","date_created":"2023-10-20T08:04:46Z","abstract":[{"lang":"eng","text":"Unsupervised speech disentanglement aims at separating fast varying from\r\nslowly varying components of a speech signal. In this contribution, we take a\r\ncloser look at the embedding vector representing the slowly varying signal\r\ncomponents, commonly named the speaker embedding vector. We ask, which\r\nproperties of a speaker's voice are captured and investigate to which extent do\r\nindividual embedding vector components sign responsible for them, using the\r\nconcept of Shapley values. Our findings show that certain speaker-specific\r\nacoustic-phonetic properties can be fairly well predicted from the speaker\r\nembedding, while the investigated more abstract voice quality features cannot."}],"main_file_link":[{"url":"https://arxiv.org/abs/2310.12599","open_access":"1"}],"has_accepted_license":"1","ddc":["000"],"language":[{"iso":"eng"}],"type":"conference","file_date_updated":"2023-10-20T08:20:58Z","user_id":"72602","department":[{"_id":"54"},{"_id":"660"}],"_id":"48355","external_id":{"arxiv":["2310.12599"]},"author":[{"full_name":"Rautenberg, Frederik","first_name":"Frederik","last_name":"Rautenberg","id":"72602"},{"full_name":"Kuhlmann, Michael","last_name":"Kuhlmann","first_name":"Michael","id":"49871"},{"full_name":"Wiechmann, Jana","first_name":"Jana","last_name":"Wiechmann"},{"full_name":"Seebauer, Fritz","first_name":"Fritz","last_name":"Seebauer"},{"last_name":"Wagner","first_name":"Petra","full_name":"Wagner, Petra"},{"id":"242","first_name":"Reinhold","last_name":"Haeb-Umbach","full_name":"Haeb-Umbach, Reinhold"}],"citation":{"mla":"Rautenberg, Frederik, et al. “On Feature Importance and Interpretability of Speaker Representations.” ITG Conference on Speech Communication, 2023.","ieee":"F. Rautenberg, M. Kuhlmann, J. Wiechmann, F. Seebauer, P. Wagner, and R. Haeb-Umbach, “On Feature Importance and Interpretability of Speaker Representations,” presented at the ITG Conference on Speech Communication, Aachen, 2023.","ama":"Rautenberg F, Kuhlmann M, Wiechmann J, Seebauer F, Wagner P, Haeb-Umbach R. On Feature Importance and Interpretability of Speaker Representations. In: ITG Conference on Speech Communication. ; 2023.","apa":"Rautenberg, F., Kuhlmann, M., Wiechmann, J., Seebauer, F., Wagner, P., & Haeb-Umbach, R. (2023). On Feature Importance and Interpretability of Speaker Representations. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Aachen.","short":"F. Rautenberg, M. Kuhlmann, J. Wiechmann, F. Seebauer, P. Wagner, R. Haeb-Umbach, in: ITG Conference on Speech Communication, 2023.","chicago":"Rautenberg, Frederik, Michael Kuhlmann, Jana Wiechmann, Fritz Seebauer, Petra Wagner, and Reinhold Haeb-Umbach. “On Feature Importance and Interpretability of Speaker Representations.” In ITG Conference on Speech Communication, 2023.","bibtex":"@inproceedings{Rautenberg_Kuhlmann_Wiechmann_Seebauer_Wagner_Haeb-Umbach_2023, title={On Feature Importance and Interpretability of Speaker Representations}, booktitle={ITG Conference on Speech Communication}, author={Rautenberg, Frederik and Kuhlmann, Michael and Wiechmann, Jana and Seebauer, Fritz and Wagner, Petra and Haeb-Umbach, Reinhold}, year={2023} }"},"title":"On Feature Importance and Interpretability of Speaker Representations","conference":{"name":"ITG Conference on Speech Communication","location":"Aachen","end_date":"2023-09-22","start_date":"2023-09-20"},"oa":"1","date_updated":"2023-11-22T13:44:33Z"}