---
_id: '11831'
abstract:
- lang: eng
text: ' "Several self-localization algorithms have been proposed, that determine
the positions of either acoustic or visual sensors autonomously. Usually these
positions are given in a modality specific coordinate system, with an unknown
rotation, translation and scale between the different systems. For a joint audiovisual
tracking, where the different modalities support each other, the two modalities
need to be mapped into a common coordinate system. In this paper we propose to
estimate this mapping based on audiovisual correlates, i.e., a speaker that can
be localized by both, a microphone and a camera network separately. The voice
is tracked by a microphone network, which had to be calibrated by a self-localization
algorithm at first, and the head is tracked by a calibrated camera network. Unlike
existing Singular Value Decomposition based approaches to estimate the coordinate
system mapping, we propose to perform an estimation in the shape domain, which
turns out to be computationally more efficient. Simulations of the self-localization
of an acoustic sensor network and a following coordinate mapping for a joint speaker
localization showed a significant improvement of the localization performance,
since the modalities were able to support each other." '
author:
- first_name: Florian
full_name: Jacob, Florian
last_name: Jacob
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: 'Jacob F, Haeb-Umbach R. Coordinate Mapping Between an Acoustic and Visual
Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking.
In: 11. ITG Fachtagung Sprachkommunikation (ITG 2014). ; 2014.'
apa: Jacob, F., & Haeb-Umbach, R. (2014). Coordinate Mapping Between an Acoustic
and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker
Tracking. In 11. ITG Fachtagung Sprachkommunikation (ITG 2014).
bibtex: '@inproceedings{Jacob_Haeb-Umbach_2014, title={Coordinate Mapping Between
an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating
Speaker Tracking}, booktitle={11. ITG Fachtagung Sprachkommunikation (ITG 2014)},
author={Jacob, Florian and Haeb-Umbach, Reinhold}, year={2014} }'
chicago: Jacob, Florian, and Reinhold Haeb-Umbach. “Coordinate Mapping Between an
Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating
Speaker Tracking.” In 11. ITG Fachtagung Sprachkommunikation (ITG 2014),
2014.
ieee: F. Jacob and R. Haeb-Umbach, “Coordinate Mapping Between an Acoustic and Visual
Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking,”
in 11. ITG Fachtagung Sprachkommunikation (ITG 2014), 2014.
mla: Jacob, Florian, and Reinhold Haeb-Umbach. “Coordinate Mapping Between an Acoustic
and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker
Tracking.” 11. ITG Fachtagung Sprachkommunikation (ITG 2014), 2014.
short: 'F. Jacob, R. Haeb-Umbach, in: 11. ITG Fachtagung Sprachkommunikation (ITG
2014), 2014.'
date_created: 2019-07-12T05:29:06Z
date_updated: 2022-01-06T06:51:11Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
url: https://groups.uni-paderborn.de/nt/pubs/2014/JaHa2014.pdf
oa: '1'
publication: 11. ITG Fachtagung Sprachkommunikation (ITG 2014)
related_material:
link:
- description: Presentation
relation: supplementary_material
url: https://groups.uni-paderborn.de/nt/pubs/2014/JaHa2014_Talk.pdf
status: public
title: Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape
Domain for a Joint Self-Calibrating Speaker Tracking
type: conference
user_id: '44006'
year: '2014'
...