Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking

F. Jacob, R. Haeb-Umbach, in: 11. ITG Fachtagung Sprachkommunikation (ITG 2014), 2014.

Conference Paper | English
Author
Abstract
"Several self-localization algorithms have been proposed, that determine the positions of either acoustic or visual sensors autonomously. Usually these positions are given in a modality specific coordinate system, with an unknown rotation, translation and scale between the different systems. For a joint audiovisual tracking, where the different modalities support each other, the two modalities need to be mapped into a common coordinate system. In this paper we propose to estimate this mapping based on audiovisual correlates, i.e., a speaker that can be localized by both, a microphone and a camera network separately. The voice is tracked by a microphone network, which had to be calibrated by a self-localization algorithm at first, and the head is tracked by a calibrated camera network. Unlike existing Singular Value Decomposition based approaches to estimate the coordinate system mapping, we propose to perform an estimation in the shape domain, which turns out to be computationally more efficient. Simulations of the self-localization of an acoustic sensor network and a following coordinate mapping for a joint speaker localization showed a significant improvement of the localization performance, since the modalities were able to support each other."
Publishing Year
Proceedings Title
11. ITG Fachtagung Sprachkommunikation (ITG 2014)
LibreCat-ID

Cite this

Jacob F, Haeb-Umbach R. Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking. In: 11. ITG Fachtagung Sprachkommunikation (ITG 2014). ; 2014.
Jacob, F., & Haeb-Umbach, R. (2014). Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking. In 11. ITG Fachtagung Sprachkommunikation (ITG 2014).
@inproceedings{Jacob_Haeb-Umbach_2014, title={Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking}, booktitle={11. ITG Fachtagung Sprachkommunikation (ITG 2014)}, author={Jacob, Florian and Haeb-Umbach, Reinhold}, year={2014} }
Jacob, Florian, and Reinhold Haeb-Umbach. “Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking.” In 11. ITG Fachtagung Sprachkommunikation (ITG 2014), 2014.
F. Jacob and R. Haeb-Umbach, “Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking,” in 11. ITG Fachtagung Sprachkommunikation (ITG 2014), 2014.
Jacob, Florian, and Reinhold Haeb-Umbach. “Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking.” 11. ITG Fachtagung Sprachkommunikation (ITG 2014), 2014.

Link(s) to Main File(s)
Access Level
Restricted Closed Access
External material:
Supplementary Material
Description
Presentation

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar