---
_id: '48269'
author:
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Joerg
full_name: Schmalenstroeer, Joerg
id: '460'
last_name: Schmalenstroeer
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: 'Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On the Integration of Sampling
Rate Synchronization and Acoustic Beamforming. In: European Signal Processing
Conference (EUSIPCO). ; 2023.'
apa: Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2023). On the Integration
of Sampling Rate Synchronization and Acoustic Beamforming. European Signal
Processing Conference (EUSIPCO). European Signal Processing Conference (EUSIPCO),
Helsinki.
bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Haeb-Umbach_2023, title={On the
Integration of Sampling Rate Synchronization and Acoustic Beamforming}, booktitle={European
Signal Processing Conference (EUSIPCO)}, author={Gburrek, Tobias and Schmalenstroeer,
Joerg and Haeb-Umbach, Reinhold}, year={2023} }'
chicago: Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “On the
Integration of Sampling Rate Synchronization and Acoustic Beamforming.” In European
Signal Processing Conference (EUSIPCO), 2023.
ieee: T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On the Integration of
Sampling Rate Synchronization and Acoustic Beamforming,” presented at the European
Signal Processing Conference (EUSIPCO), Helsinki, 2023.
mla: Gburrek, Tobias, et al. “On the Integration of Sampling Rate Synchronization
and Acoustic Beamforming.” European Signal Processing Conference (EUSIPCO),
2023.
short: 'T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: European Signal Processing
Conference (EUSIPCO), 2023.'
conference:
location: Helsinki
name: European Signal Processing Conference (EUSIPCO)
date_created: 2023-10-18T12:54:08Z
date_updated: 2023-10-26T08:16:23Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
url: https://eurasip.org/Proceedings/Eusipco/Eusipco2023/pdfs/0000011.pdf
oa: '1'
publication: European Signal Processing Conference (EUSIPCO)
quality_controlled: '1'
status: public
title: On the Integration of Sampling Rate Synchronization and Acoustic Beamforming
type: conference
user_id: '460'
year: '2023'
...
---
_id: '48270'
author:
- first_name: Joerg
full_name: Schmalenstroeer, Joerg
id: '460'
last_name: Schmalenstroeer
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: 'Schmalenstroeer J, Gburrek T, Haeb-Umbach R. LibriWASN: A Data Set for Meeting
Separation, Diarization, and Recognition with Asynchronous Recording Devices.
In: ITG Conference on Speech Communication. ; 2023.'
apa: 'Schmalenstroeer, J., Gburrek, T., & Haeb-Umbach, R. (2023). LibriWASN:
A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous
Recording Devices. ITG Conference on Speech Communication. ITG Conference
on Speech Communication, Aachen.'
bibtex: '@inproceedings{Schmalenstroeer_Gburrek_Haeb-Umbach_2023, title={LibriWASN:
A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous
Recording Devices}, booktitle={ITG Conference on Speech Communication}, author={Schmalenstroeer,
Joerg and Gburrek, Tobias and Haeb-Umbach, Reinhold}, year={2023} }'
chicago: 'Schmalenstroeer, Joerg, Tobias Gburrek, and Reinhold Haeb-Umbach. “LibriWASN:
A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous
Recording Devices.” In ITG Conference on Speech Communication, 2023.'
ieee: 'J. Schmalenstroeer, T. Gburrek, and R. Haeb-Umbach, “LibriWASN: A Data Set
for Meeting Separation, Diarization, and Recognition with Asynchronous Recording
Devices,” presented at the ITG Conference on Speech Communication, Aachen, 2023.'
mla: 'Schmalenstroeer, Joerg, et al. “LibriWASN: A Data Set for Meeting Separation,
Diarization, and Recognition with Asynchronous Recording Devices.” ITG Conference
on Speech Communication, 2023.'
short: 'J. Schmalenstroeer, T. Gburrek, R. Haeb-Umbach, in: ITG Conference on Speech
Communication, 2023.'
conference:
location: Aachen
name: ITG Conference on Speech Communication
date_created: 2023-10-18T13:00:54Z
date_updated: 2023-11-15T14:48:45Z
ddc:
- '004'
department:
- _id: '54'
file:
- access_level: open_access
content_type: application/pdf
creator: schmalen
date_created: 2023-10-26T08:20:15Z
date_updated: 2023-11-15T14:48:44Z
file_id: '48483'
file_name: SchTgbHaeb2023Final.pdf
file_size: 2844502
relation: main_file
file_date_updated: 2023-11-15T14:48:44Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
publication: ITG Conference on Speech Communication
quality_controlled: '1'
status: public
title: 'LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition
with Asynchronous Recording Devices'
type: conference
user_id: '460'
year: '2023'
...
---
_id: '49109'
abstract:
- lang: eng
text: "We propose a diarization system, that estimates “who spoke when” based on
spatial information, to be used as a front-end of a meeting transcription system
running on the signals gathered from an acoustic sensor network (ASN). Although
the\r\nspatial distribution of the microphones is advantageous, exploiting the
spatial diversity for diarization and signal enhancement is challenging, because
the microphones’ positions are typically unknown, and the recorded signals are
initially unsynchronized in general. Here, we approach these issues by first blindly
synchronizing the signals and then estimating time differences of arrival (TDOAs).
The TDOA information is exploited to estimate the speakers’ activity, even in
the presence of multiple speakers being simultaneously active. This speaker activity
information serves as a guide for a spatial mixture model, on which basis the
individual speaker’s signals are extracted via beamforming. Finally, the extracted
signals are forwarded to a speech recognizer. Additionally, a novel initialization
scheme for spatial mixture models based on the TDOA estimates is proposed. Experiments
conducted on real recordings from the LibriWASN data set have shown that our proposed
system is advantageous compared to a system using a spatial mixture model, which
does not make use\r\nof external diarization information."
author:
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Joerg
full_name: Schmalenstroeer, Joerg
id: '460'
last_name: Schmalenstroeer
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: 'Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Spatial Diarization for Meeting
Transcription with Ad-Hoc Acoustic Sensor Networks. In: Proc. Asilomar Conference
on Signals, Systems, and Computers. ; 2023.'
apa: Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2023). Spatial Diarization
for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks. Proc. Asilomar
Conference on Signals, Systems, and Computers. 57th Asilomar Conference on
Signals, Systems, and Computers.
bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Haeb-Umbach_2023, title={Spatial
Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks}, booktitle={Proc.
Asilomar Conference on Signals, Systems, and Computers}, author={Gburrek, Tobias
and Schmalenstroeer, Joerg and Haeb-Umbach, Reinhold}, year={2023} }'
chicago: Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Spatial
Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks.” In
Proc. Asilomar Conference on Signals, Systems, and Computers, 2023.
ieee: T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “Spatial Diarization for
Meeting Transcription with Ad-Hoc Acoustic Sensor Networks,” presented at the
57th Asilomar Conference on Signals, Systems, and Computers, 2023.
mla: Gburrek, Tobias, et al. “Spatial Diarization for Meeting Transcription with
Ad-Hoc Acoustic Sensor Networks.” Proc. Asilomar Conference on Signals, Systems,
and Computers, 2023.
short: 'T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: Proc. Asilomar Conference
on Signals, Systems, and Computers, 2023.'
conference:
end_date: 2023-11-01
name: 57th Asilomar Conference on Signals, Systems, and Computers
start_date: 2023-10-31
date_created: 2023-11-22T07:52:29Z
date_updated: 2023-11-22T07:58:49Z
ddc:
- '004'
department:
- _id: '54'
file:
- access_level: open_access
content_type: application/pdf
creator: schmalen
date_created: 2023-11-22T07:51:18Z
date_updated: 2023-11-22T07:58:49Z
file_id: '49110'
file_name: asilomar.pdf
file_size: 212317
relation: main_file
file_date_updated: 2023-11-22T07:58:49Z
has_accepted_license: '1'
keyword:
- Diarization
- time difference of arrival
- ad-hoc acoustic sensor network
- meeting transcription
language:
- iso: eng
oa: '1'
publication: Proc. Asilomar Conference on Signals, Systems, and Computers
quality_controlled: '1'
status: public
title: Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks
type: conference
user_id: '460'
year: '2023'
...
---
_id: '33806'
author:
- first_name: Haitham
full_name: Afifi, Haitham
last_name: Afifi
- first_name: Holger
full_name: Karl, Holger
last_name: Karl
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Joerg
full_name: Schmalenstroeer, Joerg
id: '460'
last_name: Schmalenstroeer
citation:
ama: 'Afifi H, Karl H, Gburrek T, Schmalenstroeer J. Data-driven Time Synchronization
in Wireless Multimedia Networks. In: 2022 International Wireless Communications
and Mobile Computing (IWCMC). IEEE; 2022. doi:10.1109/iwcmc55113.2022.9824980'
apa: Afifi, H., Karl, H., Gburrek, T., & Schmalenstroeer, J. (2022). Data-driven
Time Synchronization in Wireless Multimedia Networks. 2022 International Wireless
Communications and Mobile Computing (IWCMC). https://doi.org/10.1109/iwcmc55113.2022.9824980
bibtex: '@inproceedings{Afifi_Karl_Gburrek_Schmalenstroeer_2022, title={Data-driven
Time Synchronization in Wireless Multimedia Networks}, DOI={10.1109/iwcmc55113.2022.9824980},
booktitle={2022 International Wireless Communications and Mobile Computing (IWCMC)},
publisher={IEEE}, author={Afifi, Haitham and Karl, Holger and Gburrek, Tobias
and Schmalenstroeer, Joerg}, year={2022} }'
chicago: Afifi, Haitham, Holger Karl, Tobias Gburrek, and Joerg Schmalenstroeer.
“Data-Driven Time Synchronization in Wireless Multimedia Networks.” In 2022
International Wireless Communications and Mobile Computing (IWCMC). IEEE,
2022. https://doi.org/10.1109/iwcmc55113.2022.9824980.
ieee: 'H. Afifi, H. Karl, T. Gburrek, and J. Schmalenstroeer, “Data-driven Time
Synchronization in Wireless Multimedia Networks,” 2022, doi: 10.1109/iwcmc55113.2022.9824980.'
mla: Afifi, Haitham, et al. “Data-Driven Time Synchronization in Wireless Multimedia
Networks.” 2022 International Wireless Communications and Mobile Computing
(IWCMC), IEEE, 2022, doi:10.1109/iwcmc55113.2022.9824980.
short: 'H. Afifi, H. Karl, T. Gburrek, J. Schmalenstroeer, in: 2022 International
Wireless Communications and Mobile Computing (IWCMC), IEEE, 2022.'
date_created: 2022-10-18T09:24:17Z
date_updated: 2023-10-26T08:16:07Z
department:
- _id: '54'
doi: 10.1109/iwcmc55113.2022.9824980
language:
- iso: eng
publication: 2022 International Wireless Communications and Mobile Computing (IWCMC)
publication_status: published
publisher: IEEE
quality_controlled: '1'
status: public
title: Data-driven Time Synchronization in Wireless Multimedia Networks
type: conference
user_id: '460'
year: '2022'
...
---
_id: '33807'
author:
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Joerg
full_name: Schmalenstroeer, Joerg
id: '460'
last_name: Schmalenstroeer
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: 'Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On Synchronization of Wireless
Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets
and Speaker Changes. In: ICASSP 2022 - 2022 IEEE International Conference on
Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2022. doi:10.1109/icassp43922.2022.9746284'
apa: Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2022). On Synchronization
of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling
Rate Offsets and Speaker Changes. ICASSP 2022 - 2022 IEEE International Conference
on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp43922.2022.9746284
bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Haeb-Umbach_2022, title={On Synchronization
of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling
Rate Offsets and Speaker Changes}, DOI={10.1109/icassp43922.2022.9746284},
booktitle={ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech
and Signal Processing (ICASSP)}, publisher={IEEE}, author={Gburrek, Tobias and
Schmalenstroeer, Joerg and Haeb-Umbach, Reinhold}, year={2022} }'
chicago: Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “On Synchronization
of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling
Rate Offsets and Speaker Changes.” In ICASSP 2022 - 2022 IEEE International
Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022.
https://doi.org/10.1109/icassp43922.2022.9746284.
ieee: 'T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On Synchronization of
Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate
Offsets and Speaker Changes,” 2022, doi: 10.1109/icassp43922.2022.9746284.'
mla: Gburrek, Tobias, et al. “On Synchronization of Wireless Acoustic Sensor Networks
in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes.” ICASSP
2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing
(ICASSP), IEEE, 2022, doi:10.1109/icassp43922.2022.9746284.
short: 'T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: ICASSP 2022 - 2022 IEEE
International Conference on Acoustics, Speech and Signal Processing (ICASSP),
IEEE, 2022.'
date_created: 2022-10-18T09:25:51Z
date_updated: 2023-11-17T06:39:28Z
ddc:
- '004'
department:
- _id: '54'
doi: 10.1109/icassp43922.2022.9746284
file:
- access_level: open_access
content_type: application/pdf
creator: tgburrek
date_created: 2023-11-17T06:39:04Z
date_updated: 2023-11-17T06:39:04Z
file_id: '48990'
file_name: gburrek_icassp22.pdf
file_size: 358015
relation: main_file
file_date_updated: 2023-11-17T06:39:04Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
publication: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech
and Signal Processing (ICASSP)
publication_status: published
publisher: IEEE
quality_controlled: '1'
status: public
title: On Synchronization of Wireless Acoustic Sensor Networks in the Presence of
Time-Varying Sampling Rate Offsets and Speaker Changes
type: conference
user_id: '44006'
year: '2022'
...
---
_id: '33808'
author:
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Joerg
full_name: Schmalenstroeer, Joerg
id: '460'
last_name: Schmalenstroeer
- first_name: Jens
full_name: Heitkaemper, Jens
id: '27643'
last_name: Heitkaemper
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: 'Gburrek T, Schmalenstroeer J, Heitkaemper J, Haeb-Umbach R. Informed vs. Blind
Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription. In:
2022 International Workshop on Acoustic Signal Enhancement (IWAENC). IEEE;
2022. doi:10.1109/IWAENC53105.2022.9914772'
apa: Gburrek, T., Schmalenstroeer, J., Heitkaemper, J., & Haeb-Umbach, R. (2022).
Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting
Transcription. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).
17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022), Bamberg,
Germany . https://doi.org/10.1109/IWAENC53105.2022.9914772
bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Heitkaemper_Haeb-Umbach_2022, title={Informed
vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription},
DOI={10.1109/IWAENC53105.2022.9914772},
booktitle={2022 International Workshop on Acoustic Signal Enhancement (IWAENC)},
publisher={IEEE}, author={Gburrek, Tobias and Schmalenstroeer, Joerg and Heitkaemper,
Jens and Haeb-Umbach, Reinhold}, year={2022} }'
chicago: Gburrek, Tobias, Joerg Schmalenstroeer, Jens Heitkaemper, and Reinhold
Haeb-Umbach. “Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks
for Meeting Transcription.” In 2022 International Workshop on Acoustic Signal
Enhancement (IWAENC). IEEE, 2022. https://doi.org/10.1109/IWAENC53105.2022.9914772.
ieee: 'T. Gburrek, J. Schmalenstroeer, J. Heitkaemper, and R. Haeb-Umbach, “Informed
vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription,”
presented at the 17th International Workshop on Acoustic Signal Enhancement (IWAENC
2022), Bamberg, Germany , 2022, doi: 10.1109/IWAENC53105.2022.9914772.'
mla: Gburrek, Tobias, et al. “Informed vs. Blind Beamforming in Ad-Hoc Acoustic
Sensor Networks for Meeting Transcription.” 2022 International Workshop on
Acoustic Signal Enhancement (IWAENC), IEEE, 2022, doi:10.1109/IWAENC53105.2022.9914772.
short: 'T. Gburrek, J. Schmalenstroeer, J. Heitkaemper, R. Haeb-Umbach, in: 2022
International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2022.'
conference:
end_date: 2022-09-08
location: ' Bamberg, Germany '
name: 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022)
start_date: 2022-09-05
date_created: 2022-10-18T09:30:24Z
date_updated: 2023-11-17T06:40:58Z
ddc:
- '004'
department:
- _id: '54'
doi: 10.1109/IWAENC53105.2022.9914772
file:
- access_level: open_access
content_type: application/pdf
creator: tgburrek
date_created: 2023-11-17T06:40:40Z
date_updated: 2023-11-17T06:40:40Z
file_id: '48991'
file_name: iwaenc_22_camera_ready_ieee_check.pdf
file_size: 266475
relation: main_file
file_date_updated: 2023-11-17T06:40:40Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
project:
- _id: '52'
name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing'
publication: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC)
publisher: IEEE
quality_controlled: '1'
status: public
title: Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting
Transcription
type: conference
user_id: '44006'
year: '2022'
...
---
_id: '33816'
author:
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Christoph
full_name: Boeddeker, Christoph
id: '40767'
last_name: Boeddeker
- first_name: Thilo
full_name: von Neumann, Thilo
id: '49870'
last_name: von Neumann
orcid: https://orcid.org/0000-0002-7717-8670
- first_name: Tobias
full_name: Cord-Landwehr, Tobias
id: '44393'
last_name: Cord-Landwehr
- first_name: Joerg
full_name: Schmalenstroeer, Joerg
id: '460'
last_name: Schmalenstroeer
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: Gburrek T, Boeddeker C, von Neumann T, Cord-Landwehr T, Schmalenstroeer J,
Haeb-Umbach R. A Meeting Transcription System for an Ad-Hoc Acoustic Sensor
Network. arXiv; 2022. doi:10.48550/ARXIV.2205.00944
apa: Gburrek, T., Boeddeker, C., von Neumann, T., Cord-Landwehr, T., Schmalenstroeer,
J., & Haeb-Umbach, R. (2022). A Meeting Transcription System for an Ad-Hoc
Acoustic Sensor Network. arXiv. https://doi.org/10.48550/ARXIV.2205.00944
bibtex: '@book{Gburrek_Boeddeker_von Neumann_Cord-Landwehr_Schmalenstroeer_Haeb-Umbach_2022,
title={A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network},
DOI={10.48550/ARXIV.2205.00944},
publisher={arXiv}, author={Gburrek, Tobias and Boeddeker, Christoph and von Neumann,
Thilo and Cord-Landwehr, Tobias and Schmalenstroeer, Joerg and Haeb-Umbach, Reinhold},
year={2022} }'
chicago: Gburrek, Tobias, Christoph Boeddeker, Thilo von Neumann, Tobias Cord-Landwehr,
Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. A Meeting Transcription System
for an Ad-Hoc Acoustic Sensor Network. arXiv, 2022. https://doi.org/10.48550/ARXIV.2205.00944.
ieee: T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer,
and R. Haeb-Umbach, A Meeting Transcription System for an Ad-Hoc Acoustic Sensor
Network. arXiv, 2022.
mla: Gburrek, Tobias, et al. A Meeting Transcription System for an Ad-Hoc Acoustic
Sensor Network. arXiv, 2022, doi:10.48550/ARXIV.2205.00944.
short: T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer,
R. Haeb-Umbach, A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network,
arXiv, 2022.
date_created: 2022-10-18T11:10:58Z
date_updated: 2023-11-17T06:42:16Z
ddc:
- '004'
department:
- _id: '54'
doi: 10.48550/ARXIV.2205.00944
file:
- access_level: open_access
content_type: application/pdf
creator: tgburrek
date_created: 2023-11-17T06:42:04Z
date_updated: 2023-11-17T06:42:04Z
file_id: '48992'
file_name: meeting_transcription_22.pdf
file_size: 199006
relation: main_file
file_date_updated: 2023-11-17T06:42:04Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
project:
- _id: '52'
name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing'
publisher: arXiv
status: public
title: A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network
type: misc
user_id: '44006'
year: '2022'
...
---
_id: '22528'
abstract:
- lang: eng
text: Due to the ad hoc nature of wireless acoustic sensor networks, the position
of the sensor nodes is typically unknown. This contribution proposes a technique
to estimate the position and orientation of the sensor nodes from the recorded
speech signals. The method assumes that a node comprises a microphone array with
synchronously sampled microphones rather than a single microphone, but does not
require the sampling clocks of the nodes to be synchronized. From the observed
audio signals, the distances between the acoustic sources and arrays, as well
as the directions of arrival, are estimated. They serve as input to a non-linear
least squares problem, from which both the sensor nodes’ positions and orientations,
as well as the source positions, are alternatingly estimated in an iterative process.
Given one set of unknowns, i.e., either the source positions or the sensor nodes’
geometry, the other set of unknowns can be computed in closed-form. The proposed
approach is computationally efficient and the first one, which employs both distance
and directional information for geometry calibration in a common cost function.
Since both distance and direction of arrival measurements suffer from outliers,
e.g., caused by strong reflections of the sound waves on the surfaces of the room,
we introduce measures to deemphasize or remove unreliable measurements. Additionally,
we discuss modifications of our previously proposed deep neural network-based
acoustic distance estimator, to account not only for omnidirectional sources but
also for directional sources. Simulation results show good positioning accuracy
and compare very favorably with alternative approaches from the literature.
author:
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Joerg
full_name: Schmalenstroeer, Joerg
id: '460'
last_name: Schmalenstroeer
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Geometry calibration in wireless
acoustic sensor networks utilizing DoA and distance information. EURASIP Journal
on Audio, Speech, and Music Processing. Published online 2021. doi:10.1186/s13636-021-00210-x
apa: Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2021). Geometry calibration
in wireless acoustic sensor networks utilizing DoA and distance information. EURASIP
Journal on Audio, Speech, and Music Processing. https://doi.org/10.1186/s13636-021-00210-x
bibtex: '@article{Gburrek_Schmalenstroeer_Haeb-Umbach_2021, title={Geometry calibration
in wireless acoustic sensor networks utilizing DoA and distance information},
DOI={10.1186/s13636-021-00210-x},
journal={EURASIP Journal on Audio, Speech, and Music Processing}, author={Gburrek,
Tobias and Schmalenstroeer, Joerg and Haeb-Umbach, Reinhold}, year={2021} }'
chicago: Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Geometry
Calibration in Wireless Acoustic Sensor Networks Utilizing DoA and Distance Information.”
EURASIP Journal on Audio, Speech, and Music Processing, 2021. https://doi.org/10.1186/s13636-021-00210-x.
ieee: 'T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “Geometry calibration
in wireless acoustic sensor networks utilizing DoA and distance information,”
EURASIP Journal on Audio, Speech, and Music Processing, 2021, doi: 10.1186/s13636-021-00210-x.'
mla: Gburrek, Tobias, et al. “Geometry Calibration in Wireless Acoustic Sensor Networks
Utilizing DoA and Distance Information.” EURASIP Journal on Audio, Speech,
and Music Processing, 2021, doi:10.1186/s13636-021-00210-x.
short: T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, EURASIP Journal on Audio,
Speech, and Music Processing (2021).
date_created: 2021-07-05T05:30:15Z
date_updated: 2023-11-17T06:36:17Z
department:
- _id: '54'
doi: 10.1186/s13636-021-00210-x
language:
- iso: eng
main_file_link:
- open_access: '1'
url: https://asmp-eurasipjournals.springeropen.com/articles/10.1186/s13636-021-00210-x
oa: '1'
publication: EURASIP Journal on Audio, Speech, and Music Processing
publication_identifier:
issn:
- 1687-4722
publication_status: published
quality_controlled: '1'
status: public
title: Geometry calibration in wireless acoustic sensor networks utilizing DoA and
distance information
type: journal_article
user_id: '44006'
year: '2021'
...
---
_id: '23994'
author:
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Joerg
full_name: Schmalenstroeer, Joerg
id: '460'
last_name: Schmalenstroeer
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: 'Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Iterative Geometry Calibration
from Distance Estimates for Wireless Acoustic Sensor Networks. In: ICASSP 2021
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing
(ICASSP). ; 2021. doi:10.1109/icassp39728.2021.9413831'
apa: Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2021). Iterative Geometry
Calibration from Distance Estimates for Wireless Acoustic Sensor Networks. ICASSP
2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing
(ICASSP). https://doi.org/10.1109/icassp39728.2021.9413831
bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Haeb-Umbach_2021, title={Iterative
Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks},
DOI={10.1109/icassp39728.2021.9413831},
booktitle={ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech
and Signal Processing (ICASSP)}, author={Gburrek, Tobias and Schmalenstroeer,
Joerg and Haeb-Umbach, Reinhold}, year={2021} }'
chicago: Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Iterative
Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks.”
In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and
Signal Processing (ICASSP), 2021. https://doi.org/10.1109/icassp39728.2021.9413831.
ieee: 'T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “Iterative Geometry Calibration
from Distance Estimates for Wireless Acoustic Sensor Networks,” 2021, doi: 10.1109/icassp39728.2021.9413831.'
mla: Gburrek, Tobias, et al. “Iterative Geometry Calibration from Distance Estimates
for Wireless Acoustic Sensor Networks.” ICASSP 2021 - 2021 IEEE International
Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, doi:10.1109/icassp39728.2021.9413831.
short: 'T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: ICASSP 2021 - 2021 IEEE
International Conference on Acoustics, Speech and Signal Processing (ICASSP),
2021.'
date_created: 2021-09-09T08:30:16Z
date_updated: 2023-11-17T06:30:12Z
ddc:
- '004'
department:
- _id: '54'
doi: 10.1109/icassp39728.2021.9413831
file:
- access_level: open_access
content_type: application/pdf
creator: tgburrek
date_created: 2023-11-17T06:29:40Z
date_updated: 2023-11-17T06:30:11Z
file_id: '48988'
file_name: icassp21.pdf
file_size: 312400
relation: main_file
file_date_updated: 2023-11-17T06:30:11Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
publication: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech
and Signal Processing (ICASSP)
publication_status: published
quality_controlled: '1'
status: public
title: Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic
Sensor Networks
type: conference
user_id: '44006'
year: '2021'
...
---
_id: '23999'
author:
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Joerg
full_name: Schmalenstroeer, Joerg
id: '460'
last_name: Schmalenstroeer
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: 'Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On Source-Microphone Distance
Estimation Using Convolutional Recurrent Neural Networks. In: Speech Communication;
14th ITG-Symposium. ; 2021:1-5.'
apa: Gburrek, T., Schmalenstroeer, J., & Haeb-Umbach, R. (2021). On Source-Microphone
Distance Estimation Using Convolutional Recurrent Neural Networks. Speech Communication;
14th ITG-Symposium, 1–5.
bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Haeb-Umbach_2021, title={On Source-Microphone
Distance Estimation Using Convolutional Recurrent Neural Networks}, booktitle={Speech
Communication; 14th ITG-Symposium}, author={Gburrek, Tobias and Schmalenstroeer,
Joerg and Haeb-Umbach, Reinhold}, year={2021}, pages={1–5} }'
chicago: Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “On Source-Microphone
Distance Estimation Using Convolutional Recurrent Neural Networks.” In Speech
Communication; 14th ITG-Symposium, 1–5, 2021.
ieee: T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On Source-Microphone
Distance Estimation Using Convolutional Recurrent Neural Networks,” in Speech
Communication; 14th ITG-Symposium, 2021, pp. 1–5.
mla: Gburrek, Tobias, et al. “On Source-Microphone Distance Estimation Using Convolutional
Recurrent Neural Networks.” Speech Communication; 14th ITG-Symposium, 2021,
pp. 1–5.
short: 'T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: Speech Communication;
14th ITG-Symposium, 2021, pp. 1–5.'
date_created: 2021-09-09T08:40:44Z
date_updated: 2023-11-17T06:32:20Z
ddc:
- '004'
department:
- _id: '54'
file:
- access_level: open_access
content_type: application/pdf
creator: tgburrek
date_created: 2023-11-17T06:31:37Z
date_updated: 2023-11-17T06:31:37Z
file_id: '48989'
file_name: dist_est.pdf
file_size: 449694
relation: main_file
file_date_updated: 2023-11-17T06:31:37Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
page: 1-5
publication: Speech Communication; 14th ITG-Symposium
quality_controlled: '1'
status: public
title: On Source-Microphone Distance Estimation Using Convolutional Recurrent Neural
Networks
type: conference
user_id: '44006'
year: '2021'
...
---
_id: '23997'
author:
- first_name: Aleksej
full_name: Chinaev, Aleksej
last_name: Chinaev
- first_name: Gerald
full_name: Enzner, Gerald
last_name: Enzner
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Joerg
full_name: Schmalenstroeer, Joerg
id: '460'
last_name: Schmalenstroeer
citation:
ama: 'Chinaev A, Enzner G, Gburrek T, Schmalenstroeer J. Online Estimation of Sampling
Rate Offsets in Wireless Acoustic Sensor Networks with Packet Loss. In: 29th
European Signal Processing Conference (EUSIPCO). ; 2021:1-5.'
apa: Chinaev, A., Enzner, G., Gburrek, T., & Schmalenstroeer, J. (2021). Online
Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks with
Packet Loss. 29th European Signal Processing Conference (EUSIPCO), 1–5.
bibtex: '@inproceedings{Chinaev_Enzner_Gburrek_Schmalenstroeer_2021, title={Online
Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks with
Packet Loss}, booktitle={29th European Signal Processing Conference (EUSIPCO)},
author={Chinaev, Aleksej and Enzner, Gerald and Gburrek, Tobias and Schmalenstroeer,
Joerg}, year={2021}, pages={1–5} }'
chicago: Chinaev, Aleksej, Gerald Enzner, Tobias Gburrek, and Joerg Schmalenstroeer.
“Online Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks
with Packet Loss.” In 29th European Signal Processing Conference (EUSIPCO),
1–5, 2021.
ieee: A. Chinaev, G. Enzner, T. Gburrek, and J. Schmalenstroeer, “Online Estimation
of Sampling Rate Offsets in Wireless Acoustic Sensor Networks with Packet Loss,”
in 29th European Signal Processing Conference (EUSIPCO), 2021, pp. 1–5.
mla: Chinaev, Aleksej, et al. “Online Estimation of Sampling Rate Offsets in Wireless
Acoustic Sensor Networks with Packet Loss.” 29th European Signal Processing
Conference (EUSIPCO), 2021, pp. 1–5.
short: 'A. Chinaev, G. Enzner, T. Gburrek, J. Schmalenstroeer, in: 29th European
Signal Processing Conference (EUSIPCO), 2021, pp. 1–5.'
date_created: 2021-09-09T08:39:06Z
date_updated: 2023-11-17T06:37:10Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
url: https://eurasip.org/Proceedings/Eusipco/Eusipco2021/pdfs/0001110.pdf
oa: '1'
page: 1-5
publication: 29th European Signal Processing Conference (EUSIPCO)
quality_controlled: '1'
status: public
title: Online Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks
with Packet Loss
type: conference
user_id: '44006'
year: '2021'
...
---
_id: '18651'
abstract:
- lang: eng
text: 'We present an approach to deep neural network based (DNN-based) distance
estimation in reverberant rooms for supporting geometry calibration tasks in wireless
acoustic sensor networks. Signal diffuseness information from acoustic signals
is aggregated via the coherent-to-diffuse power ratio to obtain a distance-related
feature, which is mapped to a source-to-microphone distance estimate by means
of a DNN. This information is then combined with direction-of-arrival estimates
from compact microphone arrays to infer the geometry of the sensor network. Unlike
many other approaches to geometry calibration, the proposed scheme does only require
that the sampling clocks of the sensor nodes are roughly synchronized. In simulations
we show that the proposed DNN-based distance estimator generalizes to unseen acoustic
environments and that precise estimates of the sensor node positions are obtained. '
author:
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Joerg
full_name: Schmalenstroeer, Joerg
id: '460'
last_name: Schmalenstroeer
- first_name: Andreas
full_name: Brendel, Andreas
last_name: Brendel
- first_name: Walter
full_name: Kellermann, Walter
last_name: Kellermann
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
citation:
ama: 'Gburrek T, Schmalenstroeer J, Brendel A, Kellermann W, Haeb-Umbach R. Deep
Neural Network based Distance Estimation for Geometry Calibration in Acoustic
Sensor Network. In: European Signal Processing Conference (EUSIPCO). ;
2020.'
apa: Gburrek, T., Schmalenstroeer, J., Brendel, A., Kellermann, W., & Haeb-Umbach,
R. (2020). Deep Neural Network based Distance Estimation for Geometry Calibration
in Acoustic Sensor Network. European Signal Processing Conference (EUSIPCO).
bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Brendel_Kellermann_Haeb-Umbach_2020,
title={Deep Neural Network based Distance Estimation for Geometry Calibration
in Acoustic Sensor Network}, booktitle={European Signal Processing Conference
(EUSIPCO)}, author={Gburrek, Tobias and Schmalenstroeer, Joerg and Brendel, Andreas
and Kellermann, Walter and Haeb-Umbach, Reinhold}, year={2020} }'
chicago: Gburrek, Tobias, Joerg Schmalenstroeer, Andreas Brendel, Walter Kellermann,
and Reinhold Haeb-Umbach. “Deep Neural Network Based Distance Estimation for Geometry
Calibration in Acoustic Sensor Network.” In European Signal Processing Conference
(EUSIPCO), 2020.
ieee: T. Gburrek, J. Schmalenstroeer, A. Brendel, W. Kellermann, and R. Haeb-Umbach,
“Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic
Sensor Network,” 2020.
mla: Gburrek, Tobias, et al. “Deep Neural Network Based Distance Estimation for
Geometry Calibration in Acoustic Sensor Network.” European Signal Processing
Conference (EUSIPCO), 2020.
short: 'T. Gburrek, J. Schmalenstroeer, A. Brendel, W. Kellermann, R. Haeb-Umbach,
in: European Signal Processing Conference (EUSIPCO), 2020.'
date_created: 2020-08-31T07:20:57Z
date_updated: 2023-11-17T06:23:39Z
ddc:
- '004'
department:
- _id: '54'
file:
- access_level: open_access
content_type: application/pdf
creator: tgburrek
date_created: 2023-11-17T06:21:40Z
date_updated: 2023-11-17T06:21:40Z
file_id: '48987'
file_name: Gburrek2020.pdf
file_size: 292159
relation: main_file
file_date_updated: 2023-11-17T06:21:40Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
publication: European Signal Processing Conference (EUSIPCO)
quality_controlled: '1'
status: public
title: Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic
Sensor Network
type: conference
user_id: '44006'
year: '2020'
...
---
_id: '15237'
abstract:
- lang: eng
text: This paper presents an approach to voice conversion, whichdoes neither
require parallel data nor speaker or phone labels fortraining. It can convert
between speakers which are not in thetraining set by employing the previously
proposed concept of afactorized hierarchical variational autoencoder. Here, linguisticand
speaker induced variations are separated upon the notionthat content induced variations
change at a much shorter timescale, i.e., at the segment level, than speaker induced
variations,which vary at the longer utterance level. In this contribution wepropose
to employ convolutional instead of recurrent networklayers in the encoder and decoder blocks, which is shown toachieve
better phone recognition accuracy on the latent segmentvariables at frame-level
due to their better temporal resolution.For voice conversion the mean of the utterance
variables is re-placed with the respective estimated mean of the target speaker.The
resulting log-mel spectra of the decoder output are used aslocal conditions of
a WaveNet which is utilized for synthesis ofthe speech waveforms. Experiments
show both good disentan-glement properties of the latent space variables, and
good voiceconversion performance.
author:
- first_name: Tobias
full_name: Gburrek, Tobias
id: '44006'
last_name: Gburrek
- first_name: Thomas
full_name: Glarner, Thomas
id: '14169'
last_name: Glarner
- first_name: Janek
full_name: Ebbers, Janek
id: '34851'
last_name: Ebbers
- first_name: Reinhold
full_name: Haeb-Umbach, Reinhold
id: '242'
last_name: Haeb-Umbach
- first_name: Petra
full_name: Wagner, Petra
last_name: Wagner
citation:
ama: 'Gburrek T, Glarner T, Ebbers J, Haeb-Umbach R, Wagner P. Unsupervised Learning
of a Disentangled Speech Representation for Voice Conversion. In: Proc. 10th
ISCA Speech Synthesis Workshop. ; 2019:81-86. doi:10.21437/SSW.2019-15'
apa: Gburrek, T., Glarner, T., Ebbers, J., Haeb-Umbach, R., & Wagner, P. (2019).
Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion.
Proc. 10th ISCA Speech Synthesis Workshop, 81–86. https://doi.org/10.21437/SSW.2019-15
bibtex: '@inproceedings{Gburrek_Glarner_Ebbers_Haeb-Umbach_Wagner_2019, title={Unsupervised
Learning of a Disentangled Speech Representation for Voice Conversion}, DOI={10.21437/SSW.2019-15}, booktitle={Proc.
10th ISCA Speech Synthesis Workshop}, author={Gburrek, Tobias and Glarner, Thomas
and Ebbers, Janek and Haeb-Umbach, Reinhold and Wagner, Petra}, year={2019}, pages={81–86}
}'
chicago: Gburrek, Tobias, Thomas Glarner, Janek Ebbers, Reinhold Haeb-Umbach, and
Petra Wagner. “Unsupervised Learning of a Disentangled Speech Representation for
Voice Conversion.” In Proc. 10th ISCA Speech Synthesis Workshop, 81–86,
2019. https://doi.org/10.21437/SSW.2019-15.
ieee: 'T. Gburrek, T. Glarner, J. Ebbers, R. Haeb-Umbach, and P. Wagner, “Unsupervised
Learning of a Disentangled Speech Representation for Voice Conversion,” in Proc.
10th ISCA Speech Synthesis Workshop, Vienna, 2019, pp. 81–86, doi: 10.21437/SSW.2019-15.'
mla: Gburrek, Tobias, et al. “Unsupervised Learning of a Disentangled Speech Representation
for Voice Conversion.” Proc. 10th ISCA Speech Synthesis Workshop, 2019,
pp. 81–86, doi:10.21437/SSW.2019-15.
short: 'T. Gburrek, T. Glarner, J. Ebbers, R. Haeb-Umbach, P. Wagner, in: Proc.
10th ISCA Speech Synthesis Workshop, 2019, pp. 81–86.'
conference:
location: Vienna
name: 10th ISCA Speech Synthesis Workshop
date_created: 2019-12-04T08:12:29Z
date_updated: 2023-11-17T06:20:39Z
department:
- _id: '54'
doi: 10.21437/SSW.2019-15
language:
- iso: eng
main_file_link:
- open_access: '1'
url: https://www.isca-speech.org/archive/pdfs/ssw_2019/gburrek19_ssw.pdf
oa: '1'
page: 81-86
publication: Proc. 10th ISCA Speech Synthesis Workshop
quality_controlled: '1'
related_material:
link:
- description: Listening examples
relation: supplementary_material
url: http://go.upb.de/vcex
status: public
title: Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion
type: conference
user_id: '44006'
year: '2019'
...