---
_id: '61079'
abstract:
- lang: eng
  text: "We propose a spatio-spectral, combined model-based and data-driven\r\ndiarization
    pipeline consisting of TDOA-based segmentation followed by\r\nembedding-based
    clustering. The proposed system requires neither access to\r\nmulti-channel training
    data nor prior knowledge about the number or placement\r\nof microphones. It works
    for both a compact microphone array and distributed\r\nmicrophones, with minor
    adjustments. Due to its superior handling of\r\noverlapping speech during segmentation,
    the proposed pipeline significantly\r\noutperforms the single-channel pyannote
    approach, both in a scenario with a\r\ncompact microphone array and in a setup
    with distributed microphones.\r\nAdditionally, we show that, unlike fully spatial
    diarization pipelines, the\r\nproposed system can correctly track speakers when
    they change positions."
author:
- first_name: Tobias
  full_name: Cord-Landwehr, Tobias
  id: '44393'
  last_name: Cord-Landwehr
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Marc
  full_name: Deegen, Marc
  id: '70272'
  last_name: Deegen
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Cord-Landwehr T, Gburrek T, Deegen M, Haeb-Umbach R. Spatio-spectral diarization
    of meetings by combining TDOA-based  segmentation and speaker embedding-based
    clustering. In: <i>Proceedings of INTERSPEECH</i>. ; 2025. doi:<a href="https://doi.org/10.21437/Interspeech.2025-1663">10.21437/Interspeech.2025-1663</a>'
  apa: Cord-Landwehr, T., Gburrek, T., Deegen, M., &#38; Haeb-Umbach, R. (2025). Spatio-spectral
    diarization of meetings by combining TDOA-based  segmentation and speaker embedding-based
    clustering. <i>Proceedings of INTERSPEECH</i>. Interspeech 2025, Rotterdam. <a
    href="https://doi.org/10.21437/Interspeech.2025-1663">https://doi.org/10.21437/Interspeech.2025-1663</a>
  bibtex: '@inproceedings{Cord-Landwehr_Gburrek_Deegen_Haeb-Umbach_2025, title={Spatio-spectral
    diarization of meetings by combining TDOA-based  segmentation and speaker embedding-based
    clustering}, DOI={<a href="https://doi.org/10.21437/Interspeech.2025-1663">10.21437/Interspeech.2025-1663</a>},
    booktitle={Proceedings of INTERSPEECH}, author={Cord-Landwehr, Tobias and Gburrek,
    Tobias and Deegen, Marc and Haeb-Umbach, Reinhold}, year={2025} }'
  chicago: Cord-Landwehr, Tobias, Tobias Gburrek, Marc Deegen, and Reinhold Haeb-Umbach.
    “Spatio-Spectral Diarization of Meetings by Combining TDOA-Based  Segmentation
    and Speaker Embedding-Based Clustering.” In <i>Proceedings of INTERSPEECH</i>,
    2025. <a href="https://doi.org/10.21437/Interspeech.2025-1663">https://doi.org/10.21437/Interspeech.2025-1663</a>.
  ieee: 'T. Cord-Landwehr, T. Gburrek, M. Deegen, and R. Haeb-Umbach, “Spatio-spectral
    diarization of meetings by combining TDOA-based  segmentation and speaker embedding-based
    clustering,” presented at the Interspeech 2025, Rotterdam, 2025, doi: <a href="https://doi.org/10.21437/Interspeech.2025-1663">10.21437/Interspeech.2025-1663</a>.'
  mla: Cord-Landwehr, Tobias, et al. “Spatio-Spectral Diarization of Meetings by Combining
    TDOA-Based  Segmentation and Speaker Embedding-Based Clustering.” <i>Proceedings
    of INTERSPEECH</i>, 2025, doi:<a href="https://doi.org/10.21437/Interspeech.2025-1663">10.21437/Interspeech.2025-1663</a>.
  short: 'T. Cord-Landwehr, T. Gburrek, M. Deegen, R. Haeb-Umbach, in: Proceedings
    of INTERSPEECH, 2025.'
conference:
  location: Rotterdam
  name: Interspeech 2025
date_created: 2025-08-29T09:39:01Z
date_updated: 2025-11-10T09:06:47Z
ddc:
- '000'
department:
- _id: '54'
doi: 10.21437/Interspeech.2025-1663
external_id:
  arxiv:
  - '2506.16228'
file:
- access_level: open_access
  content_type: application/pdf
  creator: cord
  date_created: 2025-08-29T09:43:32Z
  date_updated: 2025-08-29T09:43:32Z
  file_id: '61085'
  file_name: main.pdf
  file_size: 921918
  relation: main_file
file_date_updated: 2025-08-29T09:43:32Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
project:
- _id: '52'
  name: Computing Resources Provided by the Paderborn Center for Parallel Computing
publication: Proceedings of INTERSPEECH
status: public
title: Spatio-spectral diarization of meetings by combining TDOA-based  segmentation
  and speaker embedding-based clustering
type: conference
user_id: '44393'
year: '2025'
...
---
_id: '57031'
author:
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Adrian Tobias
  full_name: Meise, Adrian Tobias
  id: '79268'
  last_name: Meise
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Gburrek T, Meise AT, Schmalenstroeer J, Haeb-Umbach R. Diminishing Domain
    Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation
    Models. In: <i>2024 18th International Workshop on Acoustic Signal Enhancement
    (IWAENC)</i>. IEEE; 2024. doi:<a href="https://doi.org/10.1109/iwaenc61483.2024.10694103">10.1109/iwaenc61483.2024.10694103</a>'
  apa: Gburrek, T., Meise, A. T., Schmalenstroeer, J., &#38; Haeb-Umbach, R. (2024).
    Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic
    Room Reverberation Models. <i>2024 18th International Workshop on Acoustic Signal
    Enhancement (IWAENC)</i>. <a href="https://doi.org/10.1109/iwaenc61483.2024.10694103">https://doi.org/10.1109/iwaenc61483.2024.10694103</a>
  bibtex: '@inproceedings{Gburrek_Meise_Schmalenstroeer_Haeb-Umbach_2024, title={Diminishing
    Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room
    Reverberation Models}, DOI={<a href="https://doi.org/10.1109/iwaenc61483.2024.10694103">10.1109/iwaenc61483.2024.10694103</a>},
    booktitle={2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC)},
    publisher={IEEE}, author={Gburrek, Tobias and Meise, Adrian Tobias and Schmalenstroeer,
    Joerg and Haeb-Umbach, Reinhold}, year={2024} }'
  chicago: Gburrek, Tobias, Adrian Tobias Meise, Joerg Schmalenstroeer, and Reinhold
    Haeb-Umbach. “Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation
    via Stochastic Room Reverberation Models.” In <i>2024 18th International Workshop
    on Acoustic Signal Enhancement (IWAENC)</i>. IEEE, 2024. <a href="https://doi.org/10.1109/iwaenc61483.2024.10694103">https://doi.org/10.1109/iwaenc61483.2024.10694103</a>.
  ieee: 'T. Gburrek, A. T. Meise, J. Schmalenstroeer, and R. Haeb-Umbach, “Diminishing
    Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room
    Reverberation Models,” 2024, doi: <a href="https://doi.org/10.1109/iwaenc61483.2024.10694103">10.1109/iwaenc61483.2024.10694103</a>.'
  mla: Gburrek, Tobias, et al. “Diminishing Domain Mismatch for DNN-Based Acoustic
    Distance Estimation via Stochastic Room Reverberation Models.” <i>2024 18th International
    Workshop on Acoustic Signal Enhancement (IWAENC)</i>, IEEE, 2024, doi:<a href="https://doi.org/10.1109/iwaenc61483.2024.10694103">10.1109/iwaenc61483.2024.10694103</a>.
  short: 'T. Gburrek, A.T. Meise, J. Schmalenstroeer, R. Haeb-Umbach, in: 2024 18th
    International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2024.'
date_created: 2024-11-13T08:39:57Z
date_updated: 2026-01-05T16:28:23Z
ddc:
- '006'
department:
- _id: '54'
doi: 10.1109/iwaenc61483.2024.10694103
file:
- access_level: open_access
  content_type: application/pdf
  creator: tgburrek
  date_created: 2024-11-13T08:42:13Z
  date_updated: 2024-11-13T08:42:13Z
  file_id: '57033'
  file_name: dist_est_iwaenc24_gburrek.pdf
  file_size: 300589
  relation: main_file
file_date_updated: 2024-11-13T08:42:13Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
publication: 2024 18th International Workshop on Acoustic Signal Enhancement (IWAENC)
publication_status: published
publisher: IEEE
quality_controlled: '1'
status: public
title: Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via
  Stochastic Room Reverberation Models
type: conference
user_id: '44006'
year: '2024'
...
---
_id: '48269'
author:
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On the Integration of Sampling
    Rate Synchronization and Acoustic Beamforming. In: <i>European Signal Processing
    Conference (EUSIPCO)</i>. ; 2023.'
  apa: Gburrek, T., Schmalenstroeer, J., &#38; Haeb-Umbach, R. (2023). On the Integration
    of Sampling Rate Synchronization and Acoustic Beamforming. <i>European Signal
    Processing Conference (EUSIPCO)</i>. European Signal Processing Conference (EUSIPCO),
    Helsinki.
  bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Haeb-Umbach_2023, title={On the
    Integration of Sampling Rate Synchronization and Acoustic Beamforming}, booktitle={European
    Signal Processing Conference (EUSIPCO)}, author={Gburrek, Tobias and Schmalenstroeer,
    Joerg and Haeb-Umbach, Reinhold}, year={2023} }'
  chicago: Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “On the
    Integration of Sampling Rate Synchronization and Acoustic Beamforming.” In <i>European
    Signal Processing Conference (EUSIPCO)</i>, 2023.
  ieee: T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On the Integration of
    Sampling Rate Synchronization and Acoustic Beamforming,” presented at the European
    Signal Processing Conference (EUSIPCO), Helsinki, 2023.
  mla: Gburrek, Tobias, et al. “On the Integration of Sampling Rate Synchronization
    and Acoustic Beamforming.” <i>European Signal Processing Conference (EUSIPCO)</i>,
    2023.
  short: 'T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: European Signal Processing
    Conference (EUSIPCO), 2023.'
conference:
  location: Helsinki
  name: European Signal Processing Conference (EUSIPCO)
date_created: 2023-10-18T12:54:08Z
date_updated: 2023-10-26T08:16:23Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://eurasip.org/Proceedings/Eusipco/Eusipco2023/pdfs/0000011.pdf
oa: '1'
publication: European Signal Processing Conference (EUSIPCO)
quality_controlled: '1'
status: public
title: On the Integration of Sampling Rate Synchronization and Acoustic Beamforming
type: conference
user_id: '460'
year: '2023'
...
---
_id: '48270'
author:
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Schmalenstroeer J, Gburrek T, Haeb-Umbach R. LibriWASN: A Data Set for Meeting
    Separation, Diarization, and Recognition with Asynchronous Recording Devices.
    In: <i>ITG Conference on Speech Communication</i>. ; 2023.'
  apa: 'Schmalenstroeer, J., Gburrek, T., &#38; Haeb-Umbach, R. (2023). LibriWASN:
    A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous
    Recording Devices. <i>ITG Conference on Speech Communication</i>. ITG Conference
    on Speech Communication, Aachen.'
  bibtex: '@inproceedings{Schmalenstroeer_Gburrek_Haeb-Umbach_2023, title={LibriWASN:
    A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous
    Recording Devices}, booktitle={ITG Conference on Speech Communication}, author={Schmalenstroeer,
    Joerg and Gburrek, Tobias and Haeb-Umbach, Reinhold}, year={2023} }'
  chicago: 'Schmalenstroeer, Joerg, Tobias Gburrek, and Reinhold Haeb-Umbach. “LibriWASN:
    A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous
    Recording Devices.” In <i>ITG Conference on Speech Communication</i>, 2023.'
  ieee: 'J. Schmalenstroeer, T. Gburrek, and R. Haeb-Umbach, “LibriWASN: A Data Set
    for Meeting Separation, Diarization, and Recognition with Asynchronous Recording
    Devices,” presented at the ITG Conference on Speech Communication, Aachen, 2023.'
  mla: 'Schmalenstroeer, Joerg, et al. “LibriWASN: A Data Set for Meeting Separation,
    Diarization, and Recognition with Asynchronous Recording Devices.” <i>ITG Conference
    on Speech Communication</i>, 2023.'
  short: 'J. Schmalenstroeer, T. Gburrek, R. Haeb-Umbach, in: ITG Conference on Speech
    Communication, 2023.'
conference:
  location: Aachen
  name: ITG Conference on Speech Communication
date_created: 2023-10-18T13:00:54Z
date_updated: 2023-11-15T14:48:45Z
ddc:
- '004'
department:
- _id: '54'
file:
- access_level: open_access
  content_type: application/pdf
  creator: schmalen
  date_created: 2023-10-26T08:20:15Z
  date_updated: 2023-11-15T14:48:44Z
  file_id: '48483'
  file_name: SchTgbHaeb2023Final.pdf
  file_size: 2844502
  relation: main_file
file_date_updated: 2023-11-15T14:48:44Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
publication: ITG Conference on Speech Communication
quality_controlled: '1'
status: public
title: 'LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition
  with Asynchronous Recording Devices'
type: conference
user_id: '460'
year: '2023'
...
---
_id: '49109'
abstract:
- lang: eng
  text: "We propose a diarization system, that estimates “who spoke when” based on
    spatial information, to be used as a front-end of a meeting transcription system
    running on the signals gathered from an acoustic sensor network (ASN). Although
    the\r\nspatial distribution of the microphones is advantageous, exploiting the
    spatial diversity for diarization and signal enhancement is challenging, because
    the microphones’ positions are typically unknown, and the recorded signals are
    initially unsynchronized in general. Here, we approach these issues by first blindly
    synchronizing the signals and then estimating time differences of arrival (TDOAs).
    The TDOA information is exploited to estimate the speakers’ activity, even in
    the presence of multiple speakers being simultaneously active. This speaker activity
    information serves as a guide for a spatial mixture model, on which basis the
    individual speaker’s signals are extracted via beamforming. Finally, the extracted
    signals are forwarded to a speech recognizer. Additionally, a novel initialization
    scheme for spatial mixture models based on the TDOA estimates is proposed. Experiments
    conducted on real recordings from the LibriWASN data set have shown that our proposed
    system is advantageous compared to a system using a spatial mixture model, which
    does not make use\r\nof external diarization information."
author:
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Spatial Diarization for Meeting
    Transcription with Ad-Hoc Acoustic Sensor Networks. In: <i>Proc. Asilomar Conference
    on Signals, Systems, and Computers</i>. ; 2023.'
  apa: Gburrek, T., Schmalenstroeer, J., &#38; Haeb-Umbach, R. (2023). Spatial Diarization
    for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks. <i>Proc. Asilomar
    Conference on Signals, Systems, and Computers</i>. 57th Asilomar Conference on
    Signals, Systems, and Computers.
  bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Haeb-Umbach_2023, title={Spatial
    Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks}, booktitle={Proc.
    Asilomar Conference on Signals, Systems, and Computers}, author={Gburrek, Tobias
    and Schmalenstroeer, Joerg and Haeb-Umbach, Reinhold}, year={2023} }'
  chicago: Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Spatial
    Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks.” In
    <i>Proc. Asilomar Conference on Signals, Systems, and Computers</i>, 2023.
  ieee: T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “Spatial Diarization for
    Meeting Transcription with Ad-Hoc Acoustic Sensor Networks,” presented at the
    57th Asilomar Conference on Signals, Systems, and Computers, 2023.
  mla: Gburrek, Tobias, et al. “Spatial Diarization for Meeting Transcription with
    Ad-Hoc Acoustic Sensor Networks.” <i>Proc. Asilomar Conference on Signals, Systems,
    and Computers</i>, 2023.
  short: 'T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: Proc. Asilomar Conference
    on Signals, Systems, and Computers, 2023.'
conference:
  end_date: 2023-11-01
  name: 57th Asilomar Conference on Signals, Systems, and Computers
  start_date: 2023-10-31
date_created: 2023-11-22T07:52:29Z
date_updated: 2023-11-22T07:58:49Z
ddc:
- '004'
department:
- _id: '54'
file:
- access_level: open_access
  content_type: application/pdf
  creator: schmalen
  date_created: 2023-11-22T07:51:18Z
  date_updated: 2023-11-22T07:58:49Z
  file_id: '49110'
  file_name: asilomar.pdf
  file_size: 212317
  relation: main_file
file_date_updated: 2023-11-22T07:58:49Z
has_accepted_license: '1'
keyword:
- Diarization
- time difference of arrival
- ad-hoc acoustic sensor network
- meeting transcription
language:
- iso: eng
oa: '1'
publication: Proc. Asilomar Conference on Signals, Systems, and Computers
quality_controlled: '1'
status: public
title: Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks
type: conference
user_id: '460'
year: '2023'
...
---
_id: '33806'
author:
- first_name: Haitham
  full_name: Afifi, Haitham
  last_name: Afifi
- first_name: Holger
  full_name: Karl, Holger
  last_name: Karl
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
citation:
  ama: 'Afifi H, Karl H, Gburrek T, Schmalenstroeer J. Data-driven Time Synchronization
    in Wireless Multimedia Networks. In: <i>2022 International Wireless Communications
    and Mobile Computing (IWCMC)</i>. IEEE; 2022. doi:<a href="https://doi.org/10.1109/iwcmc55113.2022.9824980">10.1109/iwcmc55113.2022.9824980</a>'
  apa: Afifi, H., Karl, H., Gburrek, T., &#38; Schmalenstroeer, J. (2022). Data-driven
    Time Synchronization in Wireless Multimedia Networks. <i>2022 International Wireless
    Communications and Mobile Computing (IWCMC)</i>. <a href="https://doi.org/10.1109/iwcmc55113.2022.9824980">https://doi.org/10.1109/iwcmc55113.2022.9824980</a>
  bibtex: '@inproceedings{Afifi_Karl_Gburrek_Schmalenstroeer_2022, title={Data-driven
    Time Synchronization in Wireless Multimedia Networks}, DOI={<a href="https://doi.org/10.1109/iwcmc55113.2022.9824980">10.1109/iwcmc55113.2022.9824980</a>},
    booktitle={2022 International Wireless Communications and Mobile Computing (IWCMC)},
    publisher={IEEE}, author={Afifi, Haitham and Karl, Holger and Gburrek, Tobias
    and Schmalenstroeer, Joerg}, year={2022} }'
  chicago: Afifi, Haitham, Holger Karl, Tobias Gburrek, and Joerg Schmalenstroeer.
    “Data-Driven Time Synchronization in Wireless Multimedia Networks.” In <i>2022
    International Wireless Communications and Mobile Computing (IWCMC)</i>. IEEE,
    2022. <a href="https://doi.org/10.1109/iwcmc55113.2022.9824980">https://doi.org/10.1109/iwcmc55113.2022.9824980</a>.
  ieee: 'H. Afifi, H. Karl, T. Gburrek, and J. Schmalenstroeer, “Data-driven Time
    Synchronization in Wireless Multimedia Networks,” 2022, doi: <a href="https://doi.org/10.1109/iwcmc55113.2022.9824980">10.1109/iwcmc55113.2022.9824980</a>.'
  mla: Afifi, Haitham, et al. “Data-Driven Time Synchronization in Wireless Multimedia
    Networks.” <i>2022 International Wireless Communications and Mobile Computing
    (IWCMC)</i>, IEEE, 2022, doi:<a href="https://doi.org/10.1109/iwcmc55113.2022.9824980">10.1109/iwcmc55113.2022.9824980</a>.
  short: 'H. Afifi, H. Karl, T. Gburrek, J. Schmalenstroeer, in: 2022 International
    Wireless Communications and Mobile Computing (IWCMC), IEEE, 2022.'
date_created: 2022-10-18T09:24:17Z
date_updated: 2023-10-26T08:16:07Z
department:
- _id: '54'
doi: 10.1109/iwcmc55113.2022.9824980
language:
- iso: eng
publication: 2022 International Wireless Communications and Mobile Computing (IWCMC)
publication_status: published
publisher: IEEE
quality_controlled: '1'
status: public
title: Data-driven Time Synchronization in Wireless Multimedia Networks
type: conference
user_id: '460'
year: '2022'
...
---
_id: '33807'
author:
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On Synchronization of Wireless
    Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets
    and Speaker Changes. In: <i>ICASSP 2022 - 2022 IEEE International Conference on
    Acoustics, Speech and Signal Processing (ICASSP)</i>. IEEE; 2022. doi:<a href="https://doi.org/10.1109/icassp43922.2022.9746284">10.1109/icassp43922.2022.9746284</a>'
  apa: Gburrek, T., Schmalenstroeer, J., &#38; Haeb-Umbach, R. (2022). On Synchronization
    of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling
    Rate Offsets and Speaker Changes. <i>ICASSP 2022 - 2022 IEEE International Conference
    on Acoustics, Speech and Signal Processing (ICASSP)</i>. <a href="https://doi.org/10.1109/icassp43922.2022.9746284">https://doi.org/10.1109/icassp43922.2022.9746284</a>
  bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Haeb-Umbach_2022, title={On Synchronization
    of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling
    Rate Offsets and Speaker Changes}, DOI={<a href="https://doi.org/10.1109/icassp43922.2022.9746284">10.1109/icassp43922.2022.9746284</a>},
    booktitle={ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech
    and Signal Processing (ICASSP)}, publisher={IEEE}, author={Gburrek, Tobias and
    Schmalenstroeer, Joerg and Haeb-Umbach, Reinhold}, year={2022} }'
  chicago: Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “On Synchronization
    of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling
    Rate Offsets and Speaker Changes.” In <i>ICASSP 2022 - 2022 IEEE International
    Conference on Acoustics, Speech and Signal Processing (ICASSP)</i>. IEEE, 2022.
    <a href="https://doi.org/10.1109/icassp43922.2022.9746284">https://doi.org/10.1109/icassp43922.2022.9746284</a>.
  ieee: 'T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On Synchronization of
    Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate
    Offsets and Speaker Changes,” 2022, doi: <a href="https://doi.org/10.1109/icassp43922.2022.9746284">10.1109/icassp43922.2022.9746284</a>.'
  mla: Gburrek, Tobias, et al. “On Synchronization of Wireless Acoustic Sensor Networks
    in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes.” <i>ICASSP
    2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing
    (ICASSP)</i>, IEEE, 2022, doi:<a href="https://doi.org/10.1109/icassp43922.2022.9746284">10.1109/icassp43922.2022.9746284</a>.
  short: 'T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: ICASSP 2022 - 2022 IEEE
    International Conference on Acoustics, Speech and Signal Processing (ICASSP),
    IEEE, 2022.'
date_created: 2022-10-18T09:25:51Z
date_updated: 2023-11-17T06:39:28Z
ddc:
- '004'
department:
- _id: '54'
doi: 10.1109/icassp43922.2022.9746284
file:
- access_level: open_access
  content_type: application/pdf
  creator: tgburrek
  date_created: 2023-11-17T06:39:04Z
  date_updated: 2023-11-17T06:39:04Z
  file_id: '48990'
  file_name: gburrek_icassp22.pdf
  file_size: 358015
  relation: main_file
file_date_updated: 2023-11-17T06:39:04Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
publication: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech
  and Signal Processing (ICASSP)
publication_status: published
publisher: IEEE
quality_controlled: '1'
status: public
title: On Synchronization of Wireless Acoustic Sensor Networks in the Presence of
  Time-Varying Sampling Rate Offsets and Speaker Changes
type: conference
user_id: '44006'
year: '2022'
...
---
_id: '33808'
author:
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Jens
  full_name: Heitkaemper, Jens
  id: '27643'
  last_name: Heitkaemper
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Gburrek T, Schmalenstroeer J, Heitkaemper J, Haeb-Umbach R. Informed vs. Blind
    Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription. In:
    <i>2022 International Workshop on Acoustic Signal Enhancement (IWAENC)</i>. IEEE;
    2022. doi:<a href="https://doi.org/10.1109/IWAENC53105.2022.9914772">10.1109/IWAENC53105.2022.9914772</a>'
  apa: Gburrek, T., Schmalenstroeer, J., Heitkaemper, J., &#38; Haeb-Umbach, R. (2022).
    Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting
    Transcription. <i>2022 International Workshop on Acoustic Signal Enhancement (IWAENC)</i>.
    17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022),  Bamberg,
    Germany . <a href="https://doi.org/10.1109/IWAENC53105.2022.9914772">https://doi.org/10.1109/IWAENC53105.2022.9914772</a>
  bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Heitkaemper_Haeb-Umbach_2022, title={Informed
    vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription},
    DOI={<a href="https://doi.org/10.1109/IWAENC53105.2022.9914772">10.1109/IWAENC53105.2022.9914772</a>},
    booktitle={2022 International Workshop on Acoustic Signal Enhancement (IWAENC)},
    publisher={IEEE}, author={Gburrek, Tobias and Schmalenstroeer, Joerg and Heitkaemper,
    Jens and Haeb-Umbach, Reinhold}, year={2022} }'
  chicago: Gburrek, Tobias, Joerg Schmalenstroeer, Jens Heitkaemper, and Reinhold
    Haeb-Umbach. “Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks
    for Meeting Transcription.” In <i>2022 International Workshop on Acoustic Signal
    Enhancement (IWAENC)</i>. IEEE, 2022. <a href="https://doi.org/10.1109/IWAENC53105.2022.9914772">https://doi.org/10.1109/IWAENC53105.2022.9914772</a>.
  ieee: 'T. Gburrek, J. Schmalenstroeer, J. Heitkaemper, and R. Haeb-Umbach, “Informed
    vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting Transcription,”
    presented at the 17th International Workshop on Acoustic Signal Enhancement (IWAENC
    2022),  Bamberg, Germany , 2022, doi: <a href="https://doi.org/10.1109/IWAENC53105.2022.9914772">10.1109/IWAENC53105.2022.9914772</a>.'
  mla: Gburrek, Tobias, et al. “Informed vs. Blind Beamforming in Ad-Hoc Acoustic
    Sensor Networks for Meeting Transcription.” <i>2022 International Workshop on
    Acoustic Signal Enhancement (IWAENC)</i>, IEEE, 2022, doi:<a href="https://doi.org/10.1109/IWAENC53105.2022.9914772">10.1109/IWAENC53105.2022.9914772</a>.
  short: 'T. Gburrek, J. Schmalenstroeer, J. Heitkaemper, R. Haeb-Umbach, in: 2022
    International Workshop on Acoustic Signal Enhancement (IWAENC), IEEE, 2022.'
conference:
  end_date: 2022-09-08
  location: ' Bamberg, Germany '
  name: 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022)
  start_date: 2022-09-05
date_created: 2022-10-18T09:30:24Z
date_updated: 2023-11-17T06:40:58Z
ddc:
- '004'
department:
- _id: '54'
doi: 10.1109/IWAENC53105.2022.9914772
file:
- access_level: open_access
  content_type: application/pdf
  creator: tgburrek
  date_created: 2023-11-17T06:40:40Z
  date_updated: 2023-11-17T06:40:40Z
  file_id: '48991'
  file_name: iwaenc_22_camera_ready_ieee_check.pdf
  file_size: 266475
  relation: main_file
file_date_updated: 2023-11-17T06:40:40Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
project:
- _id: '52'
  name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing'
publication: 2022 International Workshop on Acoustic Signal Enhancement (IWAENC)
publisher: IEEE
quality_controlled: '1'
status: public
title: Informed vs. Blind Beamforming in Ad-Hoc Acoustic Sensor Networks for Meeting
  Transcription
type: conference
user_id: '44006'
year: '2022'
...
---
_id: '33816'
author:
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Christoph
  full_name: Boeddeker, Christoph
  id: '40767'
  last_name: Boeddeker
- first_name: Thilo
  full_name: von Neumann, Thilo
  id: '49870'
  last_name: von Neumann
  orcid: https://orcid.org/0000-0002-7717-8670
- first_name: Tobias
  full_name: Cord-Landwehr, Tobias
  id: '44393'
  last_name: Cord-Landwehr
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: Gburrek T, Boeddeker C, von Neumann T, Cord-Landwehr T, Schmalenstroeer J,
    Haeb-Umbach R. <i>A Meeting Transcription System for an Ad-Hoc Acoustic Sensor
    Network</i>. arXiv; 2022. doi:<a href="https://doi.org/10.48550/ARXIV.2205.00944">10.48550/ARXIV.2205.00944</a>
  apa: Gburrek, T., Boeddeker, C., von Neumann, T., Cord-Landwehr, T., Schmalenstroeer,
    J., &#38; Haeb-Umbach, R. (2022). <i>A Meeting Transcription System for an Ad-Hoc
    Acoustic Sensor Network</i>. arXiv. <a href="https://doi.org/10.48550/ARXIV.2205.00944">https://doi.org/10.48550/ARXIV.2205.00944</a>
  bibtex: '@book{Gburrek_Boeddeker_von Neumann_Cord-Landwehr_Schmalenstroeer_Haeb-Umbach_2022,
    title={A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network},
    DOI={<a href="https://doi.org/10.48550/ARXIV.2205.00944">10.48550/ARXIV.2205.00944</a>},
    publisher={arXiv}, author={Gburrek, Tobias and Boeddeker, Christoph and von Neumann,
    Thilo and Cord-Landwehr, Tobias and Schmalenstroeer, Joerg and Haeb-Umbach, Reinhold},
    year={2022} }'
  chicago: Gburrek, Tobias, Christoph Boeddeker, Thilo von Neumann, Tobias Cord-Landwehr,
    Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. <i>A Meeting Transcription System
    for an Ad-Hoc Acoustic Sensor Network</i>. arXiv, 2022. <a href="https://doi.org/10.48550/ARXIV.2205.00944">https://doi.org/10.48550/ARXIV.2205.00944</a>.
  ieee: T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer,
    and R. Haeb-Umbach, <i>A Meeting Transcription System for an Ad-Hoc Acoustic Sensor
    Network</i>. arXiv, 2022.
  mla: Gburrek, Tobias, et al. <i>A Meeting Transcription System for an Ad-Hoc Acoustic
    Sensor Network</i>. arXiv, 2022, doi:<a href="https://doi.org/10.48550/ARXIV.2205.00944">10.48550/ARXIV.2205.00944</a>.
  short: T. Gburrek, C. Boeddeker, T. von Neumann, T. Cord-Landwehr, J. Schmalenstroeer,
    R. Haeb-Umbach, A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network,
    arXiv, 2022.
date_created: 2022-10-18T11:10:58Z
date_updated: 2025-02-12T09:03:42Z
ddc:
- '004'
department:
- _id: '54'
doi: 10.48550/ARXIV.2205.00944
file:
- access_level: open_access
  content_type: application/pdf
  creator: tgburrek
  date_created: 2023-11-17T06:42:04Z
  date_updated: 2023-11-17T06:42:04Z
  file_id: '48992'
  file_name: meeting_transcription_22.pdf
  file_size: 199006
  relation: main_file
file_date_updated: 2023-11-17T06:42:04Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
project:
- _id: '52'
  name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing'
- _id: '508'
  grant_number: '448568305'
  name: Automatische Transkription von Gesprächssituationen
publisher: arXiv
status: public
title: A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network
type: misc
user_id: '40767'
year: '2022'
...
---
_id: '22528'
abstract:
- lang: eng
  text: Due to the ad hoc nature of wireless acoustic sensor networks, the position
    of the sensor nodes is typically unknown. This contribution proposes a technique
    to estimate the position and orientation of the sensor nodes from the recorded
    speech signals. The method assumes that a node comprises a microphone array with
    synchronously sampled microphones rather than a single microphone, but does not
    require the sampling clocks of the nodes to be synchronized. From the observed
    audio signals, the distances between the acoustic sources and arrays, as well
    as the directions of arrival, are estimated. They serve as input to a non-linear
    least squares problem, from which both the sensor nodes’ positions and orientations,
    as well as the source positions, are alternatingly estimated in an iterative process.
    Given one set of unknowns, i.e., either the source positions or the sensor nodes’
    geometry, the other set of unknowns can be computed in closed-form. The proposed
    approach is computationally efficient and the first one, which employs both distance
    and directional information for geometry calibration in a common cost function.
    Since both distance and direction of arrival measurements suffer from outliers,
    e.g., caused by strong reflections of the sound waves on the surfaces of the room,
    we introduce measures to deemphasize or remove unreliable measurements. Additionally,
    we discuss modifications of our previously proposed deep neural network-based
    acoustic distance estimator, to account not only for omnidirectional sources but
    also for directional sources. Simulation results show good positioning accuracy
    and compare very favorably with alternative approaches from the literature.
author:
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Geometry calibration in wireless
    acoustic sensor networks utilizing DoA and distance information. <i>EURASIP Journal
    on Audio, Speech, and Music Processing</i>. Published online 2021. doi:<a href="https://doi.org/10.1186/s13636-021-00210-x">10.1186/s13636-021-00210-x</a>
  apa: Gburrek, T., Schmalenstroeer, J., &#38; Haeb-Umbach, R. (2021). Geometry calibration
    in wireless acoustic sensor networks utilizing DoA and distance information. <i>EURASIP
    Journal on Audio, Speech, and Music Processing</i>. <a href="https://doi.org/10.1186/s13636-021-00210-x">https://doi.org/10.1186/s13636-021-00210-x</a>
  bibtex: '@article{Gburrek_Schmalenstroeer_Haeb-Umbach_2021, title={Geometry calibration
    in wireless acoustic sensor networks utilizing DoA and distance information},
    DOI={<a href="https://doi.org/10.1186/s13636-021-00210-x">10.1186/s13636-021-00210-x</a>},
    journal={EURASIP Journal on Audio, Speech, and Music Processing}, author={Gburrek,
    Tobias and Schmalenstroeer, Joerg and Haeb-Umbach, Reinhold}, year={2021} }'
  chicago: Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Geometry
    Calibration in Wireless Acoustic Sensor Networks Utilizing DoA and Distance Information.”
    <i>EURASIP Journal on Audio, Speech, and Music Processing</i>, 2021. <a href="https://doi.org/10.1186/s13636-021-00210-x">https://doi.org/10.1186/s13636-021-00210-x</a>.
  ieee: 'T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “Geometry calibration
    in wireless acoustic sensor networks utilizing DoA and distance information,”
    <i>EURASIP Journal on Audio, Speech, and Music Processing</i>, 2021, doi: <a href="https://doi.org/10.1186/s13636-021-00210-x">10.1186/s13636-021-00210-x</a>.'
  mla: Gburrek, Tobias, et al. “Geometry Calibration in Wireless Acoustic Sensor Networks
    Utilizing DoA and Distance Information.” <i>EURASIP Journal on Audio, Speech,
    and Music Processing</i>, 2021, doi:<a href="https://doi.org/10.1186/s13636-021-00210-x">10.1186/s13636-021-00210-x</a>.
  short: T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, EURASIP Journal on Audio,
    Speech, and Music Processing (2021).
date_created: 2021-07-05T05:30:15Z
date_updated: 2023-11-17T06:36:17Z
department:
- _id: '54'
doi: 10.1186/s13636-021-00210-x
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://asmp-eurasipjournals.springeropen.com/articles/10.1186/s13636-021-00210-x
oa: '1'
publication: EURASIP Journal on Audio, Speech, and Music Processing
publication_identifier:
  issn:
  - 1687-4722
publication_status: published
quality_controlled: '1'
status: public
title: Geometry calibration in wireless acoustic sensor networks utilizing DoA and
  distance information
type: journal_article
user_id: '44006'
year: '2021'
...
---
_id: '23994'
author:
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Gburrek T, Schmalenstroeer J, Haeb-Umbach R. Iterative Geometry Calibration
    from Distance Estimates for Wireless Acoustic Sensor Networks. In: <i>ICASSP 2021
    - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing
    (ICASSP)</i>. ; 2021. doi:<a href="https://doi.org/10.1109/icassp39728.2021.9413831">10.1109/icassp39728.2021.9413831</a>'
  apa: Gburrek, T., Schmalenstroeer, J., &#38; Haeb-Umbach, R. (2021). Iterative Geometry
    Calibration from Distance Estimates for Wireless Acoustic Sensor Networks. <i>ICASSP
    2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing
    (ICASSP)</i>. <a href="https://doi.org/10.1109/icassp39728.2021.9413831">https://doi.org/10.1109/icassp39728.2021.9413831</a>
  bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Haeb-Umbach_2021, title={Iterative
    Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks},
    DOI={<a href="https://doi.org/10.1109/icassp39728.2021.9413831">10.1109/icassp39728.2021.9413831</a>},
    booktitle={ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech
    and Signal Processing (ICASSP)}, author={Gburrek, Tobias and Schmalenstroeer,
    Joerg and Haeb-Umbach, Reinhold}, year={2021} }'
  chicago: Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “Iterative
    Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks.”
    In <i>ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and
    Signal Processing (ICASSP)</i>, 2021. <a href="https://doi.org/10.1109/icassp39728.2021.9413831">https://doi.org/10.1109/icassp39728.2021.9413831</a>.
  ieee: 'T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “Iterative Geometry Calibration
    from Distance Estimates for Wireless Acoustic Sensor Networks,” 2021, doi: <a
    href="https://doi.org/10.1109/icassp39728.2021.9413831">10.1109/icassp39728.2021.9413831</a>.'
  mla: Gburrek, Tobias, et al. “Iterative Geometry Calibration from Distance Estimates
    for Wireless Acoustic Sensor Networks.” <i>ICASSP 2021 - 2021 IEEE International
    Conference on Acoustics, Speech and Signal Processing (ICASSP)</i>, 2021, doi:<a
    href="https://doi.org/10.1109/icassp39728.2021.9413831">10.1109/icassp39728.2021.9413831</a>.
  short: 'T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: ICASSP 2021 - 2021 IEEE
    International Conference on Acoustics, Speech and Signal Processing (ICASSP),
    2021.'
date_created: 2021-09-09T08:30:16Z
date_updated: 2023-11-17T06:30:12Z
ddc:
- '004'
department:
- _id: '54'
doi: 10.1109/icassp39728.2021.9413831
file:
- access_level: open_access
  content_type: application/pdf
  creator: tgburrek
  date_created: 2023-11-17T06:29:40Z
  date_updated: 2023-11-17T06:30:11Z
  file_id: '48988'
  file_name: icassp21.pdf
  file_size: 312400
  relation: main_file
file_date_updated: 2023-11-17T06:30:11Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
publication: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech
  and Signal Processing (ICASSP)
publication_status: published
quality_controlled: '1'
status: public
title: Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic
  Sensor Networks
type: conference
user_id: '44006'
year: '2021'
...
---
_id: '23999'
author:
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Gburrek T, Schmalenstroeer J, Haeb-Umbach R. On Source-Microphone Distance
    Estimation Using Convolutional Recurrent Neural Networks. In: <i>Speech Communication;
    14th ITG-Symposium</i>. ; 2021:1-5.'
  apa: Gburrek, T., Schmalenstroeer, J., &#38; Haeb-Umbach, R. (2021). On Source-Microphone
    Distance Estimation Using Convolutional Recurrent Neural Networks. <i>Speech Communication;
    14th ITG-Symposium</i>, 1–5.
  bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Haeb-Umbach_2021, title={On Source-Microphone
    Distance Estimation Using Convolutional Recurrent Neural Networks}, booktitle={Speech
    Communication; 14th ITG-Symposium}, author={Gburrek, Tobias and Schmalenstroeer,
    Joerg and Haeb-Umbach, Reinhold}, year={2021}, pages={1–5} }'
  chicago: Gburrek, Tobias, Joerg Schmalenstroeer, and Reinhold Haeb-Umbach. “On Source-Microphone
    Distance Estimation Using Convolutional Recurrent Neural Networks.” In <i>Speech
    Communication; 14th ITG-Symposium</i>, 1–5, 2021.
  ieee: T. Gburrek, J. Schmalenstroeer, and R. Haeb-Umbach, “On Source-Microphone
    Distance Estimation Using Convolutional Recurrent Neural Networks,” in <i>Speech
    Communication; 14th ITG-Symposium</i>, 2021, pp. 1–5.
  mla: Gburrek, Tobias, et al. “On Source-Microphone Distance Estimation Using Convolutional
    Recurrent Neural Networks.” <i>Speech Communication; 14th ITG-Symposium</i>, 2021,
    pp. 1–5.
  short: 'T. Gburrek, J. Schmalenstroeer, R. Haeb-Umbach, in: Speech Communication;
    14th ITG-Symposium, 2021, pp. 1–5.'
date_created: 2021-09-09T08:40:44Z
date_updated: 2023-11-17T06:32:20Z
ddc:
- '004'
department:
- _id: '54'
file:
- access_level: open_access
  content_type: application/pdf
  creator: tgburrek
  date_created: 2023-11-17T06:31:37Z
  date_updated: 2023-11-17T06:31:37Z
  file_id: '48989'
  file_name: dist_est.pdf
  file_size: 449694
  relation: main_file
file_date_updated: 2023-11-17T06:31:37Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
page: 1-5
publication: Speech Communication; 14th ITG-Symposium
quality_controlled: '1'
status: public
title: On Source-Microphone Distance Estimation Using Convolutional Recurrent Neural
  Networks
type: conference
user_id: '44006'
year: '2021'
...
---
_id: '23997'
author:
- first_name: Aleksej
  full_name: Chinaev, Aleksej
  last_name: Chinaev
- first_name: Gerald
  full_name: Enzner, Gerald
  last_name: Enzner
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
citation:
  ama: 'Chinaev A, Enzner G, Gburrek T, Schmalenstroeer J. Online Estimation of Sampling
    Rate Offsets in Wireless Acoustic Sensor Networks with Packet Loss. In: <i>29th
    European Signal Processing Conference (EUSIPCO)</i>. ; 2021:1-5.'
  apa: Chinaev, A., Enzner, G., Gburrek, T., &#38; Schmalenstroeer, J. (2021). Online
    Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks with
    Packet Loss. <i>29th European Signal Processing Conference (EUSIPCO)</i>, 1–5.
  bibtex: '@inproceedings{Chinaev_Enzner_Gburrek_Schmalenstroeer_2021, title={Online
    Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks with
    Packet Loss}, booktitle={29th European Signal Processing Conference (EUSIPCO)},
    author={Chinaev, Aleksej and Enzner, Gerald and Gburrek, Tobias and Schmalenstroeer,
    Joerg}, year={2021}, pages={1–5} }'
  chicago: Chinaev, Aleksej, Gerald Enzner, Tobias Gburrek, and Joerg Schmalenstroeer.
    “Online Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks
    with Packet Loss.” In <i>29th European Signal Processing Conference (EUSIPCO)</i>,
    1–5, 2021.
  ieee: A. Chinaev, G. Enzner, T. Gburrek, and J. Schmalenstroeer, “Online Estimation
    of Sampling Rate Offsets in Wireless Acoustic Sensor Networks with Packet Loss,”
    in <i>29th European Signal Processing Conference (EUSIPCO)</i>, 2021, pp. 1–5.
  mla: Chinaev, Aleksej, et al. “Online Estimation of Sampling Rate Offsets in Wireless
    Acoustic Sensor Networks with Packet Loss.” <i>29th European Signal Processing
    Conference (EUSIPCO)</i>, 2021, pp. 1–5.
  short: 'A. Chinaev, G. Enzner, T. Gburrek, J. Schmalenstroeer, in: 29th European
    Signal Processing Conference (EUSIPCO), 2021, pp. 1–5.'
date_created: 2021-09-09T08:39:06Z
date_updated: 2023-11-17T06:37:10Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://eurasip.org/Proceedings/Eusipco/Eusipco2021/pdfs/0001110.pdf
oa: '1'
page: 1-5
publication: 29th European Signal Processing Conference (EUSIPCO)
quality_controlled: '1'
status: public
title: Online Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks
  with Packet Loss
type: conference
user_id: '44006'
year: '2021'
...
---
_id: '18651'
abstract:
- lang: eng
  text: 'We present an approach to deep neural network based (DNN-based) distance
    estimation in reverberant rooms for supporting geometry calibration tasks in wireless
    acoustic sensor networks. Signal diffuseness information from acoustic signals
    is aggregated via the coherent-to-diffuse power ratio to obtain a distance-related
    feature, which is mapped to a source-to-microphone distance estimate by means
    of a DNN. This information is then combined with direction-of-arrival estimates
    from compact microphone arrays to infer the geometry of the sensor network. Unlike
    many other approaches to geometry calibration, the proposed scheme does only require
    that the sampling clocks of the sensor nodes are roughly synchronized. In simulations
    we show that the proposed DNN-based distance estimator generalizes to unseen acoustic
    environments and that precise estimates of the sensor node positions are obtained. '
author:
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Andreas
  full_name: Brendel, Andreas
  last_name: Brendel
- first_name: Walter
  full_name: Kellermann, Walter
  last_name: Kellermann
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Gburrek T, Schmalenstroeer J, Brendel A, Kellermann W, Haeb-Umbach R. Deep
    Neural Network based Distance Estimation for Geometry Calibration in Acoustic
    Sensor Network. In: <i>European Signal Processing Conference (EUSIPCO)</i>. ;
    2020.'
  apa: Gburrek, T., Schmalenstroeer, J., Brendel, A., Kellermann, W., &#38; Haeb-Umbach,
    R. (2020). Deep Neural Network based Distance Estimation for Geometry Calibration
    in Acoustic Sensor Network. <i>European Signal Processing Conference (EUSIPCO)</i>.
  bibtex: '@inproceedings{Gburrek_Schmalenstroeer_Brendel_Kellermann_Haeb-Umbach_2020,
    title={Deep Neural Network based Distance Estimation for Geometry Calibration
    in Acoustic Sensor Network}, booktitle={European Signal Processing Conference
    (EUSIPCO)}, author={Gburrek, Tobias and Schmalenstroeer, Joerg and Brendel, Andreas
    and Kellermann, Walter and Haeb-Umbach, Reinhold}, year={2020} }'
  chicago: Gburrek, Tobias, Joerg Schmalenstroeer, Andreas Brendel, Walter Kellermann,
    and Reinhold Haeb-Umbach. “Deep Neural Network Based Distance Estimation for Geometry
    Calibration in Acoustic Sensor Network.” In <i>European Signal Processing Conference
    (EUSIPCO)</i>, 2020.
  ieee: T. Gburrek, J. Schmalenstroeer, A. Brendel, W. Kellermann, and R. Haeb-Umbach,
    “Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic
    Sensor Network,” 2020.
  mla: Gburrek, Tobias, et al. “Deep Neural Network Based Distance Estimation for
    Geometry Calibration in Acoustic Sensor Network.” <i>European Signal Processing
    Conference (EUSIPCO)</i>, 2020.
  short: 'T. Gburrek, J. Schmalenstroeer, A. Brendel, W. Kellermann, R. Haeb-Umbach,
    in: European Signal Processing Conference (EUSIPCO), 2020.'
date_created: 2020-08-31T07:20:57Z
date_updated: 2023-11-17T06:23:39Z
ddc:
- '004'
department:
- _id: '54'
file:
- access_level: open_access
  content_type: application/pdf
  creator: tgburrek
  date_created: 2023-11-17T06:21:40Z
  date_updated: 2023-11-17T06:21:40Z
  file_id: '48987'
  file_name: Gburrek2020.pdf
  file_size: 292159
  relation: main_file
file_date_updated: 2023-11-17T06:21:40Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
publication: European Signal Processing Conference (EUSIPCO)
quality_controlled: '1'
status: public
title: Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic
  Sensor Network
type: conference
user_id: '44006'
year: '2020'
...
---
_id: '15237'
abstract:
- lang: eng
  text: This  paper  presents  an  approach  to  voice  conversion,  whichdoes neither
    require parallel data nor speaker or phone labels fortraining.  It can convert
    between speakers which are not in thetraining set by employing the previously
    proposed concept of afactorized hierarchical variational autoencoder. Here, linguisticand
    speaker induced variations are separated upon the notionthat content induced variations
    change at a much shorter timescale, i.e., at the segment level, than speaker induced
    variations,which vary at the longer utterance level. In this contribution wepropose
    to employ convolutional instead of recurrent networklayers  in  the  encoder  and  decoder  blocks,  which  is  shown  toachieve
    better phone recognition accuracy on the latent segmentvariables at frame-level
    due to their better temporal resolution.For voice conversion the mean of the utterance
    variables is re-placed with the respective estimated mean of the target speaker.The
    resulting log-mel spectra of the decoder output are used aslocal conditions of
    a WaveNet which is utilized for synthesis ofthe speech waveforms.  Experiments
    show both good disentan-glement properties of the latent space variables, and
    good voiceconversion performance.
author:
- first_name: Tobias
  full_name: Gburrek, Tobias
  id: '44006'
  last_name: Gburrek
- first_name: Thomas
  full_name: Glarner, Thomas
  id: '14169'
  last_name: Glarner
- first_name: Janek
  full_name: Ebbers, Janek
  id: '34851'
  last_name: Ebbers
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: Petra
  full_name: Wagner, Petra
  last_name: Wagner
citation:
  ama: 'Gburrek T, Glarner T, Ebbers J, Haeb-Umbach R, Wagner P. Unsupervised Learning
    of a Disentangled Speech Representation for Voice Conversion. In: <i>Proc. 10th
    ISCA Speech Synthesis Workshop</i>. ; 2019:81-86. doi:<a href="https://doi.org/10.21437/SSW.2019-15">10.21437/SSW.2019-15</a>'
  apa: Gburrek, T., Glarner, T., Ebbers, J., Haeb-Umbach, R., &#38; Wagner, P. (2019).
    Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion.
    <i>Proc. 10th ISCA Speech Synthesis Workshop</i>, 81–86. <a href="https://doi.org/10.21437/SSW.2019-15">https://doi.org/10.21437/SSW.2019-15</a>
  bibtex: '@inproceedings{Gburrek_Glarner_Ebbers_Haeb-Umbach_Wagner_2019, title={Unsupervised
    Learning of a Disentangled Speech Representation for Voice Conversion}, DOI={<a
    href="https://doi.org/10.21437/SSW.2019-15">10.21437/SSW.2019-15</a>}, booktitle={Proc.
    10th ISCA Speech Synthesis Workshop}, author={Gburrek, Tobias and Glarner, Thomas
    and Ebbers, Janek and Haeb-Umbach, Reinhold and Wagner, Petra}, year={2019}, pages={81–86}
    }'
  chicago: Gburrek, Tobias, Thomas Glarner, Janek Ebbers, Reinhold Haeb-Umbach, and
    Petra Wagner. “Unsupervised Learning of a Disentangled Speech Representation for
    Voice Conversion.” In <i>Proc. 10th ISCA Speech Synthesis Workshop</i>, 81–86,
    2019. <a href="https://doi.org/10.21437/SSW.2019-15">https://doi.org/10.21437/SSW.2019-15</a>.
  ieee: 'T. Gburrek, T. Glarner, J. Ebbers, R. Haeb-Umbach, and P. Wagner, “Unsupervised
    Learning of a Disentangled Speech Representation for Voice Conversion,” in <i>Proc.
    10th ISCA Speech Synthesis Workshop</i>, Vienna, 2019, pp. 81–86, doi: <a href="https://doi.org/10.21437/SSW.2019-15">10.21437/SSW.2019-15</a>.'
  mla: Gburrek, Tobias, et al. “Unsupervised Learning of a Disentangled Speech Representation
    for Voice Conversion.” <i>Proc. 10th ISCA Speech Synthesis Workshop</i>, 2019,
    pp. 81–86, doi:<a href="https://doi.org/10.21437/SSW.2019-15">10.21437/SSW.2019-15</a>.
  short: 'T. Gburrek, T. Glarner, J. Ebbers, R. Haeb-Umbach, P. Wagner, in: Proc.
    10th ISCA Speech Synthesis Workshop, 2019, pp. 81–86.'
conference:
  location: Vienna
  name: 10th ISCA Speech Synthesis Workshop
date_created: 2019-12-04T08:12:29Z
date_updated: 2023-11-17T06:20:39Z
department:
- _id: '54'
doi: 10.21437/SSW.2019-15
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://www.isca-speech.org/archive/pdfs/ssw_2019/gburrek19_ssw.pdf
oa: '1'
page: 81-86
publication: Proc. 10th ISCA Speech Synthesis Workshop
quality_controlled: '1'
related_material:
  link:
  - description: Listening examples
    relation: supplementary_material
    url: http://go.upb.de/vcex
status: public
title: Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion
type: conference
user_id: '44006'
year: '2019'
...
