On Feature Importance and Interpretability of Speaker Representations
F. Rautenberg, M. Kuhlmann, J. Wiechmann, F. Seebauer, P. Wagner, R. Haeb-Umbach, in: ITG Conference on Speech Communication, 2023.
Download
arxiv.pdf
272.39 KB
Download (ext.)
Conference Paper
| English
Author
Rautenberg, FrederikLibreCat;
Kuhlmann, MichaelLibreCat;
Wiechmann, Jana;
Seebauer, Fritz;
Wagner, Petra;
Haeb-Umbach, ReinholdLibreCat
Project
Abstract
Unsupervised speech disentanglement aims at separating fast varying from
slowly varying components of a speech signal. In this contribution, we take a
closer look at the embedding vector representing the slowly varying signal
components, commonly named the speaker embedding vector. We ask, which
properties of a speaker's voice are captured and investigate to which extent do
individual embedding vector components sign responsible for them, using the
concept of Shapley values. Our findings show that certain speaker-specific
acoustic-phonetic properties can be fairly well predicted from the speaker
embedding, while the investigated more abstract voice quality features cannot.
Publishing Year
Proceedings Title
ITG Conference on Speech Communication
Conference
ITG Conference on Speech Communication
Conference Location
Aachen
Conference Date
2023-09-20 – 2023-09-22
LibreCat-ID
Cite this
Rautenberg F, Kuhlmann M, Wiechmann J, Seebauer F, Wagner P, Haeb-Umbach R. On Feature Importance and Interpretability of Speaker Representations. In: ITG Conference on Speech Communication. ; 2023.
Rautenberg, F., Kuhlmann, M., Wiechmann, J., Seebauer, F., Wagner, P., & Haeb-Umbach, R. (2023). On Feature Importance and Interpretability of Speaker Representations. ITG Conference on Speech Communication. ITG Conference on Speech Communication, Aachen.
@inproceedings{Rautenberg_Kuhlmann_Wiechmann_Seebauer_Wagner_Haeb-Umbach_2023, title={On Feature Importance and Interpretability of Speaker Representations}, booktitle={ITG Conference on Speech Communication}, author={Rautenberg, Frederik and Kuhlmann, Michael and Wiechmann, Jana and Seebauer, Fritz and Wagner, Petra and Haeb-Umbach, Reinhold}, year={2023} }
Rautenberg, Frederik, Michael Kuhlmann, Jana Wiechmann, Fritz Seebauer, Petra Wagner, and Reinhold Haeb-Umbach. “On Feature Importance and Interpretability of Speaker Representations.” In ITG Conference on Speech Communication, 2023.
F. Rautenberg, M. Kuhlmann, J. Wiechmann, F. Seebauer, P. Wagner, and R. Haeb-Umbach, “On Feature Importance and Interpretability of Speaker Representations,” presented at the ITG Conference on Speech Communication, Aachen, 2023.
Rautenberg, Frederik, et al. “On Feature Importance and Interpretability of Speaker Representations.” ITG Conference on Speech Communication, 2023.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Main File(s)
File Name
arxiv.pdf
272.39 KB
Access Level
Closed Access
Last Uploaded
2023-10-20T08:20:58Z
Link(s) to Main File(s)
Access Level
Closed Access