---
_id: '11912'
abstract:
- lang: eng
  text: In this contribution we provide a unified treatment of blind source separation
    (BSS) and noise suppression, two tasks which have traditionally been considered
    different and for which quite different techniques have been developed. Exploiting
    the sparseness of the sources in the short time frequency domain and using a probabilistic
    model which accounts for the presence of additive noise and which captures the
    spatial information of the multi-channel recording, a speech enhancement system
    is developed which suppresses noise and simultaneously separates speakers in case
    multiple speakers are active. Source activity estimation and model parameter estimation
    form the E-step and the M-step of the Expectation Maximization algorithm, respectively.
    Experimental results obtained on the dataset of the Signal Separation Evaluation
    Campaign 2010 demonstrate the effectiveness of the proposed system.
author:
- first_name: Dang Hai
  full_name: Tran Vu, Dang Hai
  last_name: Tran Vu
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Tran Vu DH, Haeb-Umbach R. An EM Approach to Integrated Multichannel Speech
    Separation and Noise Suppression. In: <i>International Workshop on Acoustic Echo
    and Noise Control (IWAENC 2010)</i>. ; 2010.'
  apa: Tran Vu, D. H., &#38; Haeb-Umbach, R. (2010). An EM Approach to Integrated
    Multichannel Speech Separation and Noise Suppression. In <i>International Workshop
    on Acoustic Echo and Noise Control (IWAENC 2010)</i>.
  bibtex: '@inproceedings{Tran Vu_Haeb-Umbach_2010, title={An EM Approach to Integrated
    Multichannel Speech Separation and Noise Suppression}, booktitle={International
    Workshop on Acoustic Echo and Noise Control (IWAENC 2010)}, author={Tran Vu, Dang
    Hai and Haeb-Umbach, Reinhold}, year={2010} }'
  chicago: Tran Vu, Dang Hai, and Reinhold Haeb-Umbach. “An EM Approach to Integrated
    Multichannel Speech Separation and Noise Suppression.” In <i>International Workshop
    on Acoustic Echo and Noise Control (IWAENC 2010)</i>, 2010.
  ieee: D. H. Tran Vu and R. Haeb-Umbach, “An EM Approach to Integrated Multichannel
    Speech Separation and Noise Suppression,” in <i>International Workshop on Acoustic
    Echo and Noise Control (IWAENC 2010)</i>, 2010.
  mla: Tran Vu, Dang Hai, and Reinhold Haeb-Umbach. “An EM Approach to Integrated
    Multichannel Speech Separation and Noise Suppression.” <i>International Workshop
    on Acoustic Echo and Noise Control (IWAENC 2010)</i>, 2010.
  short: 'D.H. Tran Vu, R. Haeb-Umbach, in: International Workshop on Acoustic Echo
    and Noise Control (IWAENC 2010), 2010.'
date_created: 2019-07-12T05:30:39Z
date_updated: 2022-01-06T06:51:12Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2010/DaHa10-1.pdf
oa: '1'
publication: International Workshop on Acoustic Echo and Noise Control (IWAENC 2010)
status: public
title: An EM Approach to Integrated Multichannel Speech Separation and Noise Suppression
type: conference
user_id: '44006'
year: '2010'
...
---
_id: '11913'
abstract:
- lang: eng
  text: In this paper we propose to employ directional statistics in a complex vector
    space to approach the problem of blind speech separation in the presence of spatially
    correlated noise. We interpret the values of the short time Fourier transform
    of the microphone signals to be draws from a mixture of complex Watson distributions,
    a probabilistic model which naturally accounts for spatial aliasing. The parameters
    of the density are related to the a priori source probabilities, the power of
    the sources and the transfer function ratios from sources to sensors. Estimation
    formulas are derived for these parameters by employing the Expectation Maximization
    (EM) algorithm. The E-step corresponds to the estimation of the source presence
    probabilities for each time-frequency bin, while the M-step leads to a maximum
    signal-to-noise ratio (MaxSNR) beamformer in the presence of uncertainty about
    the source activity. Experimental results are reported for an implementation in
    a generalized sidelobe canceller (GSC) like spatial beamforming configuration
    for 3 speech sources with significant coherent noise in reverberant environments,
    demonstrating the usefulness of the novel modeling framework.
author:
- first_name: Dang Hai
  full_name: Tran Vu, Dang Hai
  last_name: Tran Vu
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Tran Vu DH, Haeb-Umbach R. Blind speech separation employing directional statistics
    in an Expectation Maximization framework. In: <i>IEEE International Conference
    on Acoustics, Speech and Signal Processing (ICASSP 2010)</i>. ; 2010:241-244.
    doi:<a href="https://doi.org/10.1109/ICASSP.2010.5495994">10.1109/ICASSP.2010.5495994</a>'
  apa: Tran Vu, D. H., &#38; Haeb-Umbach, R. (2010). Blind speech separation employing
    directional statistics in an Expectation Maximization framework. In <i>IEEE International
    Conference on Acoustics, Speech and Signal Processing (ICASSP 2010)</i> (pp. 241–244).
    <a href="https://doi.org/10.1109/ICASSP.2010.5495994">https://doi.org/10.1109/ICASSP.2010.5495994</a>
  bibtex: '@inproceedings{Tran Vu_Haeb-Umbach_2010, title={Blind speech separation
    employing directional statistics in an Expectation Maximization framework}, DOI={<a
    href="https://doi.org/10.1109/ICASSP.2010.5495994">10.1109/ICASSP.2010.5495994</a>},
    booktitle={IEEE International Conference on Acoustics, Speech and Signal Processing
    (ICASSP 2010)}, author={Tran Vu, Dang Hai and Haeb-Umbach, Reinhold}, year={2010},
    pages={241–244} }'
  chicago: Tran Vu, Dang Hai, and Reinhold Haeb-Umbach. “Blind Speech Separation Employing
    Directional Statistics in an Expectation Maximization Framework.” In <i>IEEE International
    Conference on Acoustics, Speech and Signal Processing (ICASSP 2010)</i>, 241–44,
    2010. <a href="https://doi.org/10.1109/ICASSP.2010.5495994">https://doi.org/10.1109/ICASSP.2010.5495994</a>.
  ieee: D. H. Tran Vu and R. Haeb-Umbach, “Blind speech separation employing directional
    statistics in an Expectation Maximization framework,” in <i>IEEE International
    Conference on Acoustics, Speech and Signal Processing (ICASSP 2010)</i>, 2010,
    pp. 241–244.
  mla: Tran Vu, Dang Hai, and Reinhold Haeb-Umbach. “Blind Speech Separation Employing
    Directional Statistics in an Expectation Maximization Framework.” <i>IEEE International
    Conference on Acoustics, Speech and Signal Processing (ICASSP 2010)</i>, 2010,
    pp. 241–44, doi:<a href="https://doi.org/10.1109/ICASSP.2010.5495994">10.1109/ICASSP.2010.5495994</a>.
  short: 'D.H. Tran Vu, R. Haeb-Umbach, in: IEEE International Conference on Acoustics,
    Speech and Signal Processing (ICASSP 2010), 2010, pp. 241–244.'
date_created: 2019-07-12T05:30:40Z
date_updated: 2022-01-06T06:51:12Z
department:
- _id: '54'
doi: 10.1109/ICASSP.2010.5495994
keyword:
- array signal processing
- blind source separation
- blind speech separation
- complex vector space
- complex Watson distribution
- directional statistics
- expectation-maximisation algorithm
- expectation maximization algorithm
- Fourier transform
- Fourier transforms
- generalized sidelobe canceller
- interference suppression
- maximum signal-to-noise ratio beamformer
- microphone signal
- probabilistic model
- spatial aliasing
- spatial beamforming configuration
- speech enhancement
- statistical distributions
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2010/DaHa10-2.pdf
oa: '1'
page: 241-244
publication: IEEE International Conference on Acoustics, Speech and Signal Processing
  (ICASSP 2010)
status: public
title: Blind speech separation employing directional statistics in an Expectation
  Maximization framework
type: conference
user_id: '44006'
year: '2010'
...
---
_id: '11892'
abstract:
- lang: eng
  text: For an environment to be perceived as being smart, contextual information
    has to be gathered to adapt the system's behavior and its interface towards the
    user. Being a rich source of context information speech can be acquired unobtrusively
    by microphone arrays and then processed to extract information about the user
    and his environment. In this paper, a system for joint temporal segmentation,
    speaker localization, and identification is presented, which is supported by face
    identification from video data obtained from a steerable camera. Special attention
    is paid to latency aspects and online processing capabilities, as they are important
    for the application under investigation, namely ambient communication. It describes
    the vision of terminal-less, session-less and multi-modal telecommunication with
    remote partners, where the user can move freely within his home while the communication
    follows him. The speaker diarization serves as a context source, which has been
    integrated in a service-oriented middleware architecture and provided to the application
    to select the most appropriate I/O device and to steer the camera towards the
    speaker during ambient communication.
author:
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: Schmalenstroeer J, Haeb-Umbach R. Online Diarization of Streaming Audio-Visual
    Data for Smart Environments. <i>IEEE Journal of Selected Topics in Signal Processing</i>.
    2010;4(5):845-856. doi:<a href="https://doi.org/10.1109/JSTSP.2010.2050519">10.1109/JSTSP.2010.2050519</a>
  apa: Schmalenstroeer, J., &#38; Haeb-Umbach, R. (2010). Online Diarization of Streaming
    Audio-Visual Data for Smart Environments. <i>IEEE Journal of Selected Topics in
    Signal Processing</i>, <i>4</i>(5), 845–856. <a href="https://doi.org/10.1109/JSTSP.2010.2050519">https://doi.org/10.1109/JSTSP.2010.2050519</a>
  bibtex: '@article{Schmalenstroeer_Haeb-Umbach_2010, title={Online Diarization of
    Streaming Audio-Visual Data for Smart Environments}, volume={4}, DOI={<a href="https://doi.org/10.1109/JSTSP.2010.2050519">10.1109/JSTSP.2010.2050519</a>},
    number={5}, journal={IEEE Journal of Selected Topics in Signal Processing}, author={Schmalenstroeer,
    Joerg and Haeb-Umbach, Reinhold}, year={2010}, pages={845–856} }'
  chicago: 'Schmalenstroeer, Joerg, and Reinhold Haeb-Umbach. “Online Diarization
    of Streaming Audio-Visual Data for Smart Environments.” <i>IEEE Journal of Selected
    Topics in Signal Processing</i> 4, no. 5 (2010): 845–56. <a href="https://doi.org/10.1109/JSTSP.2010.2050519">https://doi.org/10.1109/JSTSP.2010.2050519</a>.'
  ieee: 'J. Schmalenstroeer and R. Haeb-Umbach, “Online Diarization of Streaming Audio-Visual
    Data for Smart Environments,” <i>IEEE Journal of Selected Topics in Signal Processing</i>,
    vol. 4, no. 5, pp. 845–856, 2010, doi: <a href="https://doi.org/10.1109/JSTSP.2010.2050519">10.1109/JSTSP.2010.2050519</a>.'
  mla: Schmalenstroeer, Joerg, and Reinhold Haeb-Umbach. “Online Diarization of Streaming
    Audio-Visual Data for Smart Environments.” <i>IEEE Journal of Selected Topics
    in Signal Processing</i>, vol. 4, no. 5, 2010, pp. 845–56, doi:<a href="https://doi.org/10.1109/JSTSP.2010.2050519">10.1109/JSTSP.2010.2050519</a>.
  short: J. Schmalenstroeer, R. Haeb-Umbach, IEEE Journal of Selected Topics in Signal
    Processing 4 (2010) 845–856.
date_created: 2019-07-12T05:30:16Z
date_updated: 2023-10-26T08:10:18Z
department:
- _id: '54'
doi: 10.1109/JSTSP.2010.2050519
intvolume: '         4'
issue: '5'
keyword:
- audio streaming
- audio visual data streaming
- context information speech
- face identification
- face recognition
- image segmentation
- middleware
- multimodal telecommunication
- online diarization
- service oriented middleware architecture
- sessionless telecommunication
- software architecture
- speaker identification
- speaker localization
- speaker recognition
- steerable camera
- telecommunication computing
- temporal segmentation
- terminal-less telecommunication
- video streaming
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2010/ScHa10.pdf
oa: '1'
page: 845-856
publication: IEEE Journal of Selected Topics in Signal Processing
quality_controlled: '1'
status: public
title: Online Diarization of Streaming Audio-Visual Data for Smart Environments
type: journal_article
user_id: '460'
volume: 4
year: '2010'
...
---
_id: '11723'
abstract:
- lang: eng
  text: In this paper we present a novel vehicle tracking algorithm, which is based
    on multi-level sensor fusion of GPS (global positioning system) with Inertial
    Measurement Unit sensor data. It is shown that the robustness of the system to
    temporary dropouts of the GPS signal, which may occur due to limited visibility
    of satellites in narrow street canyons or tunnels, is greatly improved by sensor
    fusion. We further demonstrate how the observation and state noise covariances
    of the employed Kalman filters can be estimated alongside the filtering by an
    application of the Expectation-Maximization algorithm. The proposed time-variant
    multi-level Kalman filter is shown to outperform an Interacting Multiple Model
    approach while at the same time being computationally less demanding.
author:
- first_name: Maik
  full_name: Bevermeier, Maik
  last_name: Bevermeier
- first_name: Sven
  full_name: Peschke, Sven
  last_name: Peschke
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Bevermeier M, Peschke S, Haeb-Umbach R. Robust vehicle localization based
    on multi-level sensor fusion and online parameter estimation. In: <i>6th Workshop
    on Positioning Navigation and Communication (WPNC 2009)</i>. ; 2009:235-242. doi:<a
    href="https://doi.org/10.1109/WPNC.2009.4907833">10.1109/WPNC.2009.4907833</a>'
  apa: Bevermeier, M., Peschke, S., &#38; Haeb-Umbach, R. (2009). Robust vehicle localization
    based on multi-level sensor fusion and online parameter estimation. In <i>6th
    Workshop on Positioning Navigation and Communication (WPNC 2009)</i> (pp. 235–242).
    <a href="https://doi.org/10.1109/WPNC.2009.4907833">https://doi.org/10.1109/WPNC.2009.4907833</a>
  bibtex: '@inproceedings{Bevermeier_Peschke_Haeb-Umbach_2009, title={Robust vehicle
    localization based on multi-level sensor fusion and online parameter estimation},
    DOI={<a href="https://doi.org/10.1109/WPNC.2009.4907833">10.1109/WPNC.2009.4907833</a>},
    booktitle={6th Workshop on Positioning Navigation and Communication (WPNC 2009)},
    author={Bevermeier, Maik and Peschke, Sven and Haeb-Umbach, Reinhold}, year={2009},
    pages={235–242} }'
  chicago: Bevermeier, Maik, Sven Peschke, and Reinhold Haeb-Umbach. “Robust Vehicle
    Localization Based on Multi-Level Sensor Fusion and Online Parameter Estimation.”
    In <i>6th Workshop on Positioning Navigation and Communication (WPNC 2009)</i>,
    235–42, 2009. <a href="https://doi.org/10.1109/WPNC.2009.4907833">https://doi.org/10.1109/WPNC.2009.4907833</a>.
  ieee: M. Bevermeier, S. Peschke, and R. Haeb-Umbach, “Robust vehicle localization
    based on multi-level sensor fusion and online parameter estimation,” in <i>6th
    Workshop on Positioning Navigation and Communication (WPNC 2009)</i>, 2009, pp.
    235–242.
  mla: Bevermeier, Maik, et al. “Robust Vehicle Localization Based on Multi-Level
    Sensor Fusion and Online Parameter Estimation.” <i>6th Workshop on Positioning
    Navigation and Communication (WPNC 2009)</i>, 2009, pp. 235–42, doi:<a href="https://doi.org/10.1109/WPNC.2009.4907833">10.1109/WPNC.2009.4907833</a>.
  short: 'M. Bevermeier, S. Peschke, R. Haeb-Umbach, in: 6th Workshop on Positioning
    Navigation and Communication (WPNC 2009), 2009, pp. 235–242.'
date_created: 2019-07-12T05:27:01Z
date_updated: 2022-01-06T06:51:07Z
department:
- _id: '54'
doi: 10.1109/WPNC.2009.4907833
keyword:
- covariance matrices
- expectation-maximisation algorithm
- expectation-maximization algorithm
- global positioning system
- Global Positioning System
- GPS
- inertial measurement unit
- interacting multiple model approach
- Kalman filters
- multilevel sensor fusion
- narrow street canyons
- narrow tunnels
- online parameter estimation
- parameter estimation
- road vehicles
- robust vehicle localization
- sensor fusion
- state noise covariances
- time-variant multilevel Kalman filter
- vehicle tracking algorithm
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/BePeHa09.pdf
oa: '1'
page: 235-242
publication: 6th Workshop on Positioning Navigation and Communication (WPNC 2009)
status: public
title: Robust vehicle localization based on multi-level sensor fusion and online parameter
  estimation
type: conference
user_id: '44006'
year: '2009'
...
---
_id: '11724'
abstract:
- lang: eng
  text: In this paper we present a novel vehicle tracking method which is based on
    multi-stage Kalman filtering of GPS and IMU sensor data. After individual Kalman
    filtering of GPS and IMU measurements the estimates of the orientation of the
    vehicle are combined in an optimal manner to improve the robustness towards drift
    errors. The tracking algorithm incorporates the estimation of time-variant covariance
    parameters by using an iterative block Expectation-Maximization algorithm to account
    for time-variant driving conditions and measurement quality. The proposed system
    is compared to an interacting multiple model approach (IMM) and achieves improved
    localization accuracy at lower computational complexity. Furthermore we show how
    the joint parameter estimation and localizaiton can be conducted with streaming
    input data to be able to track vehicles in a real driving environment.
author:
- first_name: Maik
  full_name: Bevermeier, Maik
  last_name: Bevermeier
- first_name: Sven
  full_name: Peschke, Sven
  last_name: Peschke
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Bevermeier M, Peschke S, Haeb-Umbach R. Joint Parameter Estimation and Tracking
    in a Multi-Stage Kalman Filter for Vehicle Positioning. In: <i>IEEE 69th Vehicular
    Technology Conference (VTC 2009 Spring)</i>. ; 2009:1-5. doi:<a href="https://doi.org/10.1109/VETECS.2009.5073634">10.1109/VETECS.2009.5073634</a>'
  apa: Bevermeier, M., Peschke, S., &#38; Haeb-Umbach, R. (2009). Joint Parameter
    Estimation and Tracking in a Multi-Stage Kalman Filter for Vehicle Positioning.
    In <i>IEEE 69th Vehicular Technology Conference (VTC 2009 Spring)</i> (pp. 1–5).
    <a href="https://doi.org/10.1109/VETECS.2009.5073634">https://doi.org/10.1109/VETECS.2009.5073634</a>
  bibtex: '@inproceedings{Bevermeier_Peschke_Haeb-Umbach_2009, title={Joint Parameter
    Estimation and Tracking in a Multi-Stage Kalman Filter for Vehicle Positioning},
    DOI={<a href="https://doi.org/10.1109/VETECS.2009.5073634">10.1109/VETECS.2009.5073634</a>},
    booktitle={IEEE 69th Vehicular Technology Conference (VTC 2009 Spring)}, author={Bevermeier,
    Maik and Peschke, Sven and Haeb-Umbach, Reinhold}, year={2009}, pages={1–5} }'
  chicago: Bevermeier, Maik, Sven Peschke, and Reinhold Haeb-Umbach. “Joint Parameter
    Estimation and Tracking in a Multi-Stage Kalman Filter for Vehicle Positioning.”
    In <i>IEEE 69th Vehicular Technology Conference (VTC 2009 Spring)</i>, 1–5, 2009.
    <a href="https://doi.org/10.1109/VETECS.2009.5073634">https://doi.org/10.1109/VETECS.2009.5073634</a>.
  ieee: M. Bevermeier, S. Peschke, and R. Haeb-Umbach, “Joint Parameter Estimation
    and Tracking in a Multi-Stage Kalman Filter for Vehicle Positioning,” in <i>IEEE
    69th Vehicular Technology Conference (VTC 2009 Spring)</i>, 2009, pp. 1–5.
  mla: Bevermeier, Maik, et al. “Joint Parameter Estimation and Tracking in a Multi-Stage
    Kalman Filter for Vehicle Positioning.” <i>IEEE 69th Vehicular Technology Conference
    (VTC 2009 Spring)</i>, 2009, pp. 1–5, doi:<a href="https://doi.org/10.1109/VETECS.2009.5073634">10.1109/VETECS.2009.5073634</a>.
  short: 'M. Bevermeier, S. Peschke, R. Haeb-Umbach, in: IEEE 69th Vehicular Technology
    Conference (VTC 2009 Spring), 2009, pp. 1–5.'
date_created: 2019-07-12T05:27:02Z
date_updated: 2022-01-06T06:51:07Z
department:
- _id: '54'
doi: 10.1109/VETECS.2009.5073634
keyword:
- computational complexity
- expectation-maximisation algorithm
- Global Positioning System
- inertial measurement unit
- inertial navigation
- interacting multiple model
- iterative block expectation-maximization algorithm
- Kalman filters
- multi-stage Kalman filter
- parameter estimation
- road vehicles
- vehicle positioning
- vehicle tracking
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/BePeHa09-1.pdf
oa: '1'
page: 1-5
publication: IEEE 69th Vehicular Technology Conference (VTC 2009 Spring)
status: public
title: Joint Parameter Estimation and Tracking in a Multi-Stage Kalman Filter for
  Vehicle Positioning
type: conference
user_id: '44006'
year: '2009'
...
---
_id: '11725'
author:
- first_name: Maik
  full_name: Bevermeier, Maik
  last_name: Bevermeier
- first_name: Sven
  full_name: Peschke, Sven
  last_name: Peschke
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Bevermeier M, Peschke S, Haeb-Umbach R. Eine Plattform fuer Mehrwertdienste
    im Bereich Logistik - Drahtlose Fahrzeug- und Laderaumueberwachung fuer LKW mit
    Hilfe einer Maut-On-Board Unit. In: <i>DGON Navigationskonvent 2009</i>. ; 2009.'
  apa: Bevermeier, M., Peschke, S., &#38; Haeb-Umbach, R. (2009). Eine Plattform fuer
    Mehrwertdienste im Bereich Logistik - Drahtlose Fahrzeug- und Laderaumueberwachung
    fuer LKW mit Hilfe einer Maut-On-Board Unit. In <i>DGON Navigationskonvent 2009</i>.
  bibtex: '@inproceedings{Bevermeier_Peschke_Haeb-Umbach_2009, title={Eine Plattform
    fuer Mehrwertdienste im Bereich Logistik - Drahtlose Fahrzeug- und Laderaumueberwachung
    fuer LKW mit Hilfe einer Maut-On-Board Unit}, booktitle={DGON Navigationskonvent
    2009}, author={Bevermeier, Maik and Peschke, Sven and Haeb-Umbach, Reinhold},
    year={2009} }'
  chicago: Bevermeier, Maik, Sven Peschke, and Reinhold Haeb-Umbach. “Eine Plattform
    Fuer Mehrwertdienste Im Bereich Logistik - Drahtlose Fahrzeug- Und Laderaumueberwachung
    Fuer LKW Mit Hilfe Einer Maut-On-Board Unit.” In <i>DGON Navigationskonvent 2009</i>,
    2009.
  ieee: M. Bevermeier, S. Peschke, and R. Haeb-Umbach, “Eine Plattform fuer Mehrwertdienste
    im Bereich Logistik - Drahtlose Fahrzeug- und Laderaumueberwachung fuer LKW mit
    Hilfe einer Maut-On-Board Unit,” in <i>DGON Navigationskonvent 2009</i>, 2009.
  mla: Bevermeier, Maik, et al. “Eine Plattform Fuer Mehrwertdienste Im Bereich Logistik
    - Drahtlose Fahrzeug- Und Laderaumueberwachung Fuer LKW Mit Hilfe Einer Maut-On-Board
    Unit.” <i>DGON Navigationskonvent 2009</i>, 2009.
  short: 'M. Bevermeier, S. Peschke, R. Haeb-Umbach, in: DGON Navigationskonvent 2009,
    2009.'
date_created: 2019-07-12T05:27:03Z
date_updated: 2022-01-06T06:51:07Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/BePeHa09-2.pdf
oa: '1'
publication: DGON Navigationskonvent 2009
status: public
title: Eine Plattform fuer Mehrwertdienste im Bereich Logistik - Drahtlose Fahrzeug-
  und Laderaumueberwachung fuer LKW mit Hilfe einer Maut-On-Board Unit
type: conference
user_id: '44006'
year: '2009'
...
---
_id: '11847'
abstract:
- lang: eng
  text: In this paper we present a new feature space dereverberation technique for
    automatic speech recognition. We derive an expression for the dependence of the
    reverberant speech features in the log-mel spectral domain on the non-reverberant
    speech features and the room impulse response. The obtained observation model
    is used for a model based speech enhancement based on Kalman filtering. The performance
    of the proposed enhancement technique is studied on the AURORA5 database. In our
    currently best configuration, which includes uncertainty decoding, the number
    of recognition errors is approximately halved compared to the recognition of unprocessed
    speech.
author:
- first_name: Alexander
  full_name: Krueger, Alexander
  last_name: Krueger
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Krueger A, Haeb-Umbach R. Model based feature enhancement for automatic speech
    recognition in reverberant environments. In: <i>Interspeech 2009</i>. ; 2009.'
  apa: Krueger, A., &#38; Haeb-Umbach, R. (2009). Model based feature enhancement
    for automatic speech recognition in reverberant environments. In <i>Interspeech
    2009</i>.
  bibtex: '@inproceedings{Krueger_Haeb-Umbach_2009, title={Model based feature enhancement
    for automatic speech recognition in reverberant environments}, booktitle={Interspeech
    2009}, author={Krueger, Alexander and Haeb-Umbach, Reinhold}, year={2009} }'
  chicago: Krueger, Alexander, and Reinhold Haeb-Umbach. “Model Based Feature Enhancement
    for Automatic Speech Recognition in Reverberant Environments.” In <i>Interspeech
    2009</i>, 2009.
  ieee: A. Krueger and R. Haeb-Umbach, “Model based feature enhancement for automatic
    speech recognition in reverberant environments,” in <i>Interspeech 2009</i>, 2009.
  mla: Krueger, Alexander, and Reinhold Haeb-Umbach. “Model Based Feature Enhancement
    for Automatic Speech Recognition in Reverberant Environments.” <i>Interspeech
    2009</i>, 2009.
  short: 'A. Krueger, R. Haeb-Umbach, in: Interspeech 2009, 2009.'
date_created: 2019-07-12T05:29:24Z
date_updated: 2022-01-06T06:51:11Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/KrHa09.pdf
oa: '1'
publication: Interspeech 2009
status: public
title: Model based feature enhancement for automatic speech recognition in reverberant
  environments
type: conference
user_id: '44006'
year: '2009'
...
---
_id: '11859'
abstract:
- lang: eng
  text: In this paper we present an Uncertainty Decoding rule which exploits feature
    reliability information and interframe correlation for noise robust speech recognition.
    The reliability information can be obtained either from conditional Bayesian estimation,
    where speech and noise feature vectors are tracked jointly, or by augmenting conventional
    point estimation methods with heuristics about the estimator's reliability. Experimental
    results on the AURORA2 database demonstrate on the one hand that Uncertainty Decoding
    improves recognition performance, while on the other hand it is seen that the
    severe approximations needed to arrive at computationally tractable solutions
    have their noticable impact on recognition performance.
author:
- first_name: Volker
  full_name: Leutnant, Volker
  last_name: Leutnant
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Leutnant V, Haeb-Umbach R. On the Estimation and Use of Feature Reliability
    Information for Noise Robust Speech Recognition. In: <i>International Conference
    on Acoustics (NAG/DAGA 2009)</i>. ; 2009.'
  apa: Leutnant, V., &#38; Haeb-Umbach, R. (2009). On the Estimation and Use of Feature
    Reliability Information for Noise Robust Speech Recognition. In <i>International
    Conference on Acoustics (NAG/DAGA 2009)</i>.
  bibtex: '@inproceedings{Leutnant_Haeb-Umbach_2009, title={On the Estimation and
    Use of Feature Reliability Information for Noise Robust Speech Recognition}, booktitle={International
    Conference on Acoustics (NAG/DAGA 2009)}, author={Leutnant, Volker and Haeb-Umbach,
    Reinhold}, year={2009} }'
  chicago: Leutnant, Volker, and Reinhold Haeb-Umbach. “On the Estimation and Use
    of Feature Reliability Information for Noise Robust Speech Recognition.” In <i>International
    Conference on Acoustics (NAG/DAGA 2009)</i>, 2009.
  ieee: V. Leutnant and R. Haeb-Umbach, “On the Estimation and Use of Feature Reliability
    Information for Noise Robust Speech Recognition,” in <i>International Conference
    on Acoustics (NAG/DAGA 2009)</i>, 2009.
  mla: Leutnant, Volker, and Reinhold Haeb-Umbach. “On the Estimation and Use of Feature
    Reliability Information for Noise Robust Speech Recognition.” <i>International
    Conference on Acoustics (NAG/DAGA 2009)</i>, 2009.
  short: 'V. Leutnant, R. Haeb-Umbach, in: International Conference on Acoustics (NAG/DAGA
    2009), 2009.'
date_created: 2019-07-12T05:29:38Z
date_updated: 2022-01-06T06:51:11Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/LeHa09-1.pdf
oa: '1'
publication: International Conference on Acoustics (NAG/DAGA 2009)
status: public
title: On the Estimation and Use of Feature Reliability Information for Noise Robust
  Speech Recognition
type: conference
user_id: '44006'
year: '2009'
...
---
_id: '11860'
abstract:
- lang: eng
  text: In this paper we present an analytic derivation of the moments of the phase
    factor between clean speech and noise cepstral or log-mel-spectral feature vectors.
    The development shows, among others, that the probability density of the phase
    factor is of sub-Gaussian nature and that it is independent of the noise type
    and the signal-to-noise ratio, however dependent on the mel filter bank index.
    Further we show how to compute the contribution of the phase factor to both the
    mean and the vari- ance of the noisy speech observation likelihood, which relates
    the speech and noise feature vectors to those of noisy speech. The resulting phase-sensitive
    observation model is then used in model-based speech feature enhancement, leading
    to significant improvements in word accuracy on the AURORA2 database.
author:
- first_name: Volker
  full_name: Leutnant, Volker
  last_name: Leutnant
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Leutnant V, Haeb-Umbach R. An analytic derivation of a phase-sensitive observation
    model for noise robust speech recognition. In: <i>Interspeech 2009</i>. ; 2009.'
  apa: Leutnant, V., &#38; Haeb-Umbach, R. (2009). An analytic derivation of a phase-sensitive
    observation model for noise robust speech recognition. In <i>Interspeech 2009</i>.
  bibtex: '@inproceedings{Leutnant_Haeb-Umbach_2009, title={An analytic derivation
    of a phase-sensitive observation model for noise robust speech recognition}, booktitle={Interspeech
    2009}, author={Leutnant, Volker and Haeb-Umbach, Reinhold}, year={2009} }'
  chicago: Leutnant, Volker, and Reinhold Haeb-Umbach. “An Analytic Derivation of
    a Phase-Sensitive Observation Model for Noise Robust Speech Recognition.” In <i>Interspeech
    2009</i>, 2009.
  ieee: V. Leutnant and R. Haeb-Umbach, “An analytic derivation of a phase-sensitive
    observation model for noise robust speech recognition,” in <i>Interspeech 2009</i>,
    2009.
  mla: Leutnant, Volker, and Reinhold Haeb-Umbach. “An Analytic Derivation of a Phase-Sensitive
    Observation Model for Noise Robust Speech Recognition.” <i>Interspeech 2009</i>,
    2009.
  short: 'V. Leutnant, R. Haeb-Umbach, in: Interspeech 2009, 2009.'
date_created: 2019-07-12T05:29:39Z
date_updated: 2022-01-06T06:51:11Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/LeHa09-2.pdf
oa: '1'
publication: Interspeech 2009
status: public
title: An analytic derivation of a phase-sensitive observation model for noise robust
  speech recognition
type: conference
user_id: '44006'
year: '2009'
...
---
_id: '11881'
abstract:
- lang: eng
  text: A combination of GPS (global positioning system) and INS (inertial navigation
    system) is known to provide high precision and highly robust vehicle localization.
    Notably during times when the GPS signal has a poor quality, e.g. due to the lack
    of a sufficiently large number of visible satellites, the INS, which may consist
    of a gyroscope and an odometer, will lead to improved positioning accuracy. In
    this paper we show how velocity information obtained from GSM (global system for
    mobile communications) signalling, rather than from a tachometer, can be used
    together with a gyroscope sensor to support localization in the presence of temporarily
    unavailable GPS data. We propose a sensor fusion system architecture and present
    simulation results that show the effectiveness of this approach.
author:
- first_name: Sven
  full_name: Peschke, Sven
  last_name: Peschke
- first_name: Maik
  full_name: Bevermeier, Maik
  last_name: Bevermeier
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Peschke S, Bevermeier M, Haeb-Umbach R. A GPS positioning approach exploiting
    GSM velocity estimates. In: <i>6th Workshop on Positioning Navigation and Communication
    (WPNC 2009)</i>. ; 2009:195-202. doi:<a href="https://doi.org/10.1109/WPNC.2009.4907827">10.1109/WPNC.2009.4907827</a>'
  apa: Peschke, S., Bevermeier, M., &#38; Haeb-Umbach, R. (2009). A GPS positioning
    approach exploiting GSM velocity estimates. In <i>6th Workshop on Positioning
    Navigation and Communication (WPNC 2009)</i> (pp. 195–202). <a href="https://doi.org/10.1109/WPNC.2009.4907827">https://doi.org/10.1109/WPNC.2009.4907827</a>
  bibtex: '@inproceedings{Peschke_Bevermeier_Haeb-Umbach_2009, title={A GPS positioning
    approach exploiting GSM velocity estimates}, DOI={<a href="https://doi.org/10.1109/WPNC.2009.4907827">10.1109/WPNC.2009.4907827</a>},
    booktitle={6th Workshop on Positioning Navigation and Communication (WPNC 2009)},
    author={Peschke, Sven and Bevermeier, Maik and Haeb-Umbach, Reinhold}, year={2009},
    pages={195–202} }'
  chicago: Peschke, Sven, Maik Bevermeier, and Reinhold Haeb-Umbach. “A GPS Positioning
    Approach Exploiting GSM Velocity Estimates.” In <i>6th Workshop on Positioning
    Navigation and Communication (WPNC 2009)</i>, 195–202, 2009. <a href="https://doi.org/10.1109/WPNC.2009.4907827">https://doi.org/10.1109/WPNC.2009.4907827</a>.
  ieee: S. Peschke, M. Bevermeier, and R. Haeb-Umbach, “A GPS positioning approach
    exploiting GSM velocity estimates,” in <i>6th Workshop on Positioning Navigation
    and Communication (WPNC 2009)</i>, 2009, pp. 195–202.
  mla: Peschke, Sven, et al. “A GPS Positioning Approach Exploiting GSM Velocity Estimates.”
    <i>6th Workshop on Positioning Navigation and Communication (WPNC 2009)</i>, 2009,
    pp. 195–202, doi:<a href="https://doi.org/10.1109/WPNC.2009.4907827">10.1109/WPNC.2009.4907827</a>.
  short: 'S. Peschke, M. Bevermeier, R. Haeb-Umbach, in: 6th Workshop on Positioning
    Navigation and Communication (WPNC 2009), 2009, pp. 195–202.'
date_created: 2019-07-12T05:30:04Z
date_updated: 2022-01-06T06:51:11Z
department:
- _id: '54'
doi: 10.1109/WPNC.2009.4907827
keyword:
- cellular radio
- distance measurement
- global positioning system
- Global Positioning System
- global system for mobile communications
- GPS positioning approach
- GSM velocity
- gyroscopes
- gyroscope sensor
- inertial navigation
- inertial navigation system
- odometer
- sensor fusion system architecture
- sensors
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/PeBeHa09-1.pdf
oa: '1'
page: 195-202
publication: 6th Workshop on Positioning Navigation and Communication (WPNC 2009)
status: public
title: A GPS positioning approach exploiting GSM velocity estimates
type: conference
user_id: '44006'
year: '2009'
...
---
_id: '11882'
author:
- first_name: Sven
  full_name: Peschke, Sven
  last_name: Peschke
- first_name: Maik
  full_name: Bevermeier, Maik
  last_name: Bevermeier
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Peschke S, Bevermeier M, Haeb-Umbach R. Verbesserung von GPS-basierter Ortung
    durch GSM-Geschwindigkeitsschaetzungen. In: <i>DGON Navigationskonvent 2009</i>.
    ; 2009.'
  apa: Peschke, S., Bevermeier, M., &#38; Haeb-Umbach, R. (2009). Verbesserung von
    GPS-basierter Ortung durch GSM-Geschwindigkeitsschaetzungen. In <i>DGON Navigationskonvent
    2009</i>.
  bibtex: '@inproceedings{Peschke_Bevermeier_Haeb-Umbach_2009, title={Verbesserung
    von GPS-basierter Ortung durch GSM-Geschwindigkeitsschaetzungen}, booktitle={DGON
    Navigationskonvent 2009}, author={Peschke, Sven and Bevermeier, Maik and Haeb-Umbach,
    Reinhold}, year={2009} }'
  chicago: Peschke, Sven, Maik Bevermeier, and Reinhold Haeb-Umbach. “Verbesserung
    von GPS-Basierter Ortung Durch GSM-Geschwindigkeitsschaetzungen.” In <i>DGON Navigationskonvent
    2009</i>, 2009.
  ieee: S. Peschke, M. Bevermeier, and R. Haeb-Umbach, “Verbesserung von GPS-basierter
    Ortung durch GSM-Geschwindigkeitsschaetzungen,” in <i>DGON Navigationskonvent
    2009</i>, 2009.
  mla: Peschke, Sven, et al. “Verbesserung von GPS-Basierter Ortung Durch GSM-Geschwindigkeitsschaetzungen.”
    <i>DGON Navigationskonvent 2009</i>, 2009.
  short: 'S. Peschke, M. Bevermeier, R. Haeb-Umbach, in: DGON Navigationskonvent 2009,
    2009.'
date_created: 2019-07-12T05:30:05Z
date_updated: 2022-01-06T06:51:11Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/PeBeHa09-2.pdf
oa: '1'
publication: DGON Navigationskonvent 2009
status: public
title: Verbesserung von GPS-basierter Ortung durch GSM-Geschwindigkeitsschaetzungen
type: conference
user_id: '44006'
year: '2009'
...
---
_id: '11937'
abstract:
- lang: eng
  text: In automatic speech recognition, hidden Markov models (HMMs) are commonly
    used for speech decoding, while switching linear dynamic models (SLDMs) can be
    employed for a preceding model-based speech feature enhancement. In this paper,
    these model types are combined in order to obtain a novel iterative speech feature
    enhancement and recognition architecture. It is shown that speech feature enhancement
    with SLDMs can be improved by feeding back information from the HMM to the enhancement
    stage. Two different feedback structures are derived. In the first, the posteriors
    of the HMM states are used to control the model probabilities of the SLDMs, while
    in the second they are employed to directly influence the estimate of the speech
    feature distribution. Both approaches lead to improvements in recognition accuracy
    both on the AURORA2 and AURORA4 databases compared to non-iterative speech feature
    enhancement with SLDMs. It is also shown that a combination with uncertainty decoding
    further enhances performance.
author:
- first_name: Stefan
  full_name: Windmann, Stefan
  last_name: Windmann
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: Windmann S, Haeb-Umbach R. Approaches to Iterative Speech Feature Enhancement
    and Recognition. <i>IEEE Transactions on Audio, Speech, and Language Processing</i>.
    2009;17(5):974-984. doi:<a href="https://doi.org/10.1109/TASL.2009.2014894">10.1109/TASL.2009.2014894</a>
  apa: Windmann, S., &#38; Haeb-Umbach, R. (2009). Approaches to Iterative Speech
    Feature Enhancement and Recognition. <i>IEEE Transactions on Audio, Speech, and
    Language Processing</i>, <i>17</i>(5), 974–984. <a href="https://doi.org/10.1109/TASL.2009.2014894">https://doi.org/10.1109/TASL.2009.2014894</a>
  bibtex: '@article{Windmann_Haeb-Umbach_2009, title={Approaches to Iterative Speech
    Feature Enhancement and Recognition}, volume={17}, DOI={<a href="https://doi.org/10.1109/TASL.2009.2014894">10.1109/TASL.2009.2014894</a>},
    number={5}, journal={IEEE Transactions on Audio, Speech, and Language Processing},
    author={Windmann, Stefan and Haeb-Umbach, Reinhold}, year={2009}, pages={974–984}
    }'
  chicago: 'Windmann, Stefan, and Reinhold Haeb-Umbach. “Approaches to Iterative Speech
    Feature Enhancement and Recognition.” <i>IEEE Transactions on Audio, Speech, and
    Language Processing</i> 17, no. 5 (2009): 974–84. <a href="https://doi.org/10.1109/TASL.2009.2014894">https://doi.org/10.1109/TASL.2009.2014894</a>.'
  ieee: S. Windmann and R. Haeb-Umbach, “Approaches to Iterative Speech Feature Enhancement
    and Recognition,” <i>IEEE Transactions on Audio, Speech, and Language Processing</i>,
    vol. 17, no. 5, pp. 974–984, 2009.
  mla: Windmann, Stefan, and Reinhold Haeb-Umbach. “Approaches to Iterative Speech
    Feature Enhancement and Recognition.” <i>IEEE Transactions on Audio, Speech, and
    Language Processing</i>, vol. 17, no. 5, 2009, pp. 974–84, doi:<a href="https://doi.org/10.1109/TASL.2009.2014894">10.1109/TASL.2009.2014894</a>.
  short: S. Windmann, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language
    Processing 17 (2009) 974–984.
date_created: 2019-07-12T05:31:08Z
date_updated: 2022-01-06T06:51:12Z
department:
- _id: '54'
doi: 10.1109/TASL.2009.2014894
intvolume: '        17'
issue: '5'
keyword:
- AURORA2 databases
- AURORA4 databases
- automatic speech recognition
- feedback structures
- hidden Markov models
- HMM
- iterative methods
- iterative speech feature enhancement
- model probabilities
- speech decoding
- speech enhancement
- speech feature distribution
- speech recognition
- switching linear dynamic models
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/WiHa09-1.pdf
oa: '1'
page: 974-984
publication: IEEE Transactions on Audio, Speech, and Language Processing
status: public
title: Approaches to Iterative Speech Feature Enhancement and Recognition
type: journal_article
user_id: '44006'
volume: 17
year: '2009'
...
---
_id: '11938'
abstract:
- lang: eng
  text: In this paper, parameter estimation of a state-space model of noise or noisy
    speech cepstra is investigated. A blockwise EM algorithm is derived for the estimation
    of the state and observation noise covariance from noise-only input data. It is
    supposed to be used during the offline training mode of a speech recognizer. Further
    a sequential online EM algorithm is developed to adapt the observation noise covariance
    on noisy speech cepstra at its input. The estimated parameters are then used in
    model-based speech feature enhancement for noise-robust automatic speech recognition.
    Experiments on the AURORA4 database lead to improved recognition results with
    a linear state model compared to the assumption of stationary noise.
author:
- first_name: Stefan
  full_name: Windmann, Stefan
  last_name: Windmann
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: Windmann S, Haeb-Umbach R. Parameter Estimation of a State-Space Model of Noise
    for Robust Speech Recognition. <i>IEEE Transactions on Audio, Speech, and Language
    Processing</i>. 2009;17(8):1577-1590. doi:<a href="https://doi.org/10.1109/TASL.2009.2023172">10.1109/TASL.2009.2023172</a>
  apa: Windmann, S., &#38; Haeb-Umbach, R. (2009). Parameter Estimation of a State-Space
    Model of Noise for Robust Speech Recognition. <i>IEEE Transactions on Audio, Speech,
    and Language Processing</i>, <i>17</i>(8), 1577–1590. <a href="https://doi.org/10.1109/TASL.2009.2023172">https://doi.org/10.1109/TASL.2009.2023172</a>
  bibtex: '@article{Windmann_Haeb-Umbach_2009, title={Parameter Estimation of a State-Space
    Model of Noise for Robust Speech Recognition}, volume={17}, DOI={<a href="https://doi.org/10.1109/TASL.2009.2023172">10.1109/TASL.2009.2023172</a>},
    number={8}, journal={IEEE Transactions on Audio, Speech, and Language Processing},
    author={Windmann, Stefan and Haeb-Umbach, Reinhold}, year={2009}, pages={1577–1590}
    }'
  chicago: 'Windmann, Stefan, and Reinhold Haeb-Umbach. “Parameter Estimation of a
    State-Space Model of Noise for Robust Speech Recognition.” <i>IEEE Transactions
    on Audio, Speech, and Language Processing</i> 17, no. 8 (2009): 1577–90. <a href="https://doi.org/10.1109/TASL.2009.2023172">https://doi.org/10.1109/TASL.2009.2023172</a>.'
  ieee: S. Windmann and R. Haeb-Umbach, “Parameter Estimation of a State-Space Model
    of Noise for Robust Speech Recognition,” <i>IEEE Transactions on Audio, Speech,
    and Language Processing</i>, vol. 17, no. 8, pp. 1577–1590, 2009.
  mla: Windmann, Stefan, and Reinhold Haeb-Umbach. “Parameter Estimation of a State-Space
    Model of Noise for Robust Speech Recognition.” <i>IEEE Transactions on Audio,
    Speech, and Language Processing</i>, vol. 17, no. 8, 2009, pp. 1577–90, doi:<a
    href="https://doi.org/10.1109/TASL.2009.2023172">10.1109/TASL.2009.2023172</a>.
  short: S. Windmann, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language
    Processing 17 (2009) 1577–1590.
date_created: 2019-07-12T05:31:09Z
date_updated: 2022-01-06T06:51:12Z
department:
- _id: '54'
doi: 10.1109/TASL.2009.2023172
intvolume: '        17'
issue: '8'
keyword:
- AURORA4 database
- blockwise EM algorithm
- covariance analysis
- linear state model
- noise covariance
- noise-robust automatic speech recognition
- noisy speech cepstra
- offline training mode
- parameter estimation
- speech recognition
- speech recognition equipment
- speech recognizer
- state-space methods
- state-space model
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/WiHa09-2.pdf
oa: '1'
page: 1577-1590
publication: IEEE Transactions on Audio, Speech, and Language Processing
status: public
title: Parameter Estimation of a State-Space Model of Noise for Robust Speech Recognition
type: journal_article
user_id: '44006'
volume: 17
year: '2009'
...
---
_id: '11900'
author:
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Volker
  full_name: Leutnant, Volker
  last_name: Leutnant
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Schmalenstroeer J, Leutnant V, Haeb-Umbach R. Audio-Visual Data Processing
    for Ambient Communication. In: <i>1st International Workshop on Distributed Computing
    in Ambient Environments within 32nd Annual Conference on Artificial Intelligence</i>.
    ; 2009.'
  apa: Schmalenstroeer, J., Leutnant, V., &#38; Haeb-Umbach, R. (2009). Audio-Visual
    Data Processing for Ambient Communication. <i>1st International Workshop on Distributed
    Computing in Ambient Environments within 32nd Annual Conference on Artificial
    Intelligence</i>.
  bibtex: '@inproceedings{Schmalenstroeer_Leutnant_Haeb-Umbach_2009, title={Audio-Visual
    Data Processing for Ambient Communication}, booktitle={1st International Workshop
    on Distributed Computing in Ambient Environments within 32nd Annual Conference
    on Artificial Intelligence}, author={Schmalenstroeer, Joerg and Leutnant, Volker
    and Haeb-Umbach, Reinhold}, year={2009} }'
  chicago: Schmalenstroeer, Joerg, Volker Leutnant, and Reinhold Haeb-Umbach. “Audio-Visual
    Data Processing for Ambient Communication.” In <i>1st International Workshop on
    Distributed Computing in Ambient Environments within 32nd Annual Conference on
    Artificial Intelligence</i>, 2009.
  ieee: J. Schmalenstroeer, V. Leutnant, and R. Haeb-Umbach, “Audio-Visual Data Processing
    for Ambient Communication,” 2009.
  mla: Schmalenstroeer, Joerg, et al. “Audio-Visual Data Processing for Ambient Communication.”
    <i>1st International Workshop on Distributed Computing in Ambient Environments
    within 32nd Annual Conference on Artificial Intelligence</i>, 2009.
  short: 'J. Schmalenstroeer, V. Leutnant, R. Haeb-Umbach, in: 1st International Workshop
    on Distributed Computing in Ambient Environments within 32nd Annual Conference
    on Artificial Intelligence, 2009.'
date_created: 2019-07-12T05:30:25Z
date_updated: 2023-11-15T15:03:08Z
ddc:
- '004'
department:
- _id: '54'
file:
- access_level: open_access
  content_type: application/pdf
  creator: schmalen
  date_created: 2023-11-15T15:02:34Z
  date_updated: 2023-11-15T15:02:34Z
  file_id: '48934'
  file_name: SchLeuHae09.pdf
  file_size: 98062
  relation: main_file
file_date_updated: 2023-11-15T15:02:34Z
has_accepted_license: '1'
language:
- iso: eng
oa: '1'
publication: 1st International Workshop on Distributed Computing in Ambient Environments
  within 32nd Annual Conference on Artificial Intelligence
quality_controlled: '1'
status: public
title: Audio-Visual Data Processing for Ambient Communication
type: conference
user_id: '460'
year: '2009'
...
---
_id: '11806'
abstract:
- lang: eng
  text: Microphone arrays represent the basis for many challenging acoustic sensing
    tasks. The accuracy of techniques like beamforming directly depends on a precise
    knowledge of the relative positions of the sensors used. Unfortunately, for certain
    use cases manually measuring the geometry of an array is not feasible due to practical
    constraints. In this paper we present an approach to unsupervised shape calibration
    of microphone array networks. We developed a hierarchical procedure that first
    performs local shape calibration based on coherence analysis and then employs
    SRP-PHAT in a network calibration method. Practical experiments demonstrate the
    effectiveness of our approach especially for highly reverberant acoustic environments.
author:
- first_name: Marius
  full_name: Hennecke, Marius
  last_name: Hennecke
- first_name: Thomas
  full_name: Ploetz, Thomas
  last_name: Ploetz
- first_name: Gernot A.
  full_name: Fink, Gernot A.
  last_name: Fink
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Hennecke M, Ploetz T, Fink GA, Schmalenstroeer J, Haeb-Umbach R. A hierarchical
    approach to unsupervised shape calibration of microphone array networks. In: <i>IEEE/SP
    15th Workshop on Statistical Signal Processing (SSP 2009)</i>. ; 2009:257-260.
    doi:<a href="https://doi.org/10.1109/SSP.2009.5278589">10.1109/SSP.2009.5278589</a>'
  apa: Hennecke, M., Ploetz, T., Fink, G. A., Schmalenstroeer, J., &#38; Haeb-Umbach,
    R. (2009). A hierarchical approach to unsupervised shape calibration of microphone
    array networks. <i>IEEE/SP 15th Workshop on Statistical Signal Processing (SSP
    2009)</i>, 257–260. <a href="https://doi.org/10.1109/SSP.2009.5278589">https://doi.org/10.1109/SSP.2009.5278589</a>
  bibtex: '@inproceedings{Hennecke_Ploetz_Fink_Schmalenstroeer_Haeb-Umbach_2009, title={A
    hierarchical approach to unsupervised shape calibration of microphone array networks},
    DOI={<a href="https://doi.org/10.1109/SSP.2009.5278589">10.1109/SSP.2009.5278589</a>},
    booktitle={IEEE/SP 15th Workshop on Statistical Signal Processing (SSP 2009)},
    author={Hennecke, Marius and Ploetz, Thomas and Fink, Gernot A. and Schmalenstroeer,
    Joerg and Haeb-Umbach, Reinhold}, year={2009}, pages={257–260} }'
  chicago: Hennecke, Marius, Thomas Ploetz, Gernot A. Fink, Joerg Schmalenstroeer,
    and Reinhold Haeb-Umbach. “A Hierarchical Approach to Unsupervised Shape Calibration
    of Microphone Array Networks.” In <i>IEEE/SP 15th Workshop on Statistical Signal
    Processing (SSP 2009)</i>, 257–60, 2009. <a href="https://doi.org/10.1109/SSP.2009.5278589">https://doi.org/10.1109/SSP.2009.5278589</a>.
  ieee: 'M. Hennecke, T. Ploetz, G. A. Fink, J. Schmalenstroeer, and R. Haeb-Umbach,
    “A hierarchical approach to unsupervised shape calibration of microphone array
    networks,” in <i>IEEE/SP 15th Workshop on Statistical Signal Processing (SSP 2009)</i>,
    2009, pp. 257–260, doi: <a href="https://doi.org/10.1109/SSP.2009.5278589">10.1109/SSP.2009.5278589</a>.'
  mla: Hennecke, Marius, et al. “A Hierarchical Approach to Unsupervised Shape Calibration
    of Microphone Array Networks.” <i>IEEE/SP 15th Workshop on Statistical Signal
    Processing (SSP 2009)</i>, 2009, pp. 257–60, doi:<a href="https://doi.org/10.1109/SSP.2009.5278589">10.1109/SSP.2009.5278589</a>.
  short: 'M. Hennecke, T. Ploetz, G.A. Fink, J. Schmalenstroeer, R. Haeb-Umbach, in:
    IEEE/SP 15th Workshop on Statistical Signal Processing (SSP 2009), 2009, pp. 257–260.'
date_created: 2019-07-12T05:28:37Z
date_updated: 2023-10-26T08:09:22Z
department:
- _id: '54'
doi: 10.1109/SSP.2009.5278589
keyword:
- acoustic sensing tasks
- array geometry
- calibration
- coherence analysis
- hierarchical procedure
- local shape calibration
- microphone array networks
- microphone arrays
- network calibration method
- sensor arrays
- SRP-PHAT
- unsupervised shape calibration
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/HePlFiScHa09.pdf
oa: '1'
page: 257-260
publication: IEEE/SP 15th Workshop on Statistical Signal Processing (SSP 2009)
quality_controlled: '1'
status: public
title: A hierarchical approach to unsupervised shape calibration of microphone array
  networks
type: conference
user_id: '460'
year: '2009'
...
---
_id: '11899'
abstract:
- lang: eng
  text: In this paper we present a system for identifying and localizingspeakers using
    distant microphone arrays and a steerablepan-tilt-zoom camera. Audio and video
    streams are processedin real-time to obtain the diarization information {grqq}who
    speakswhen and where'' with low latency to be used in advanced videoconferencing
    systems or user-adaptive interfaces. A key featureof the proposed system is to
    first glean information about thespeaker{\rq}s location and identity from the
    audio and visual datastreams separately and then to fuse these data in a probabilisticframework
    employing the Viterbi algorithm. Here, visual evidenceof a person is utilized
    through a priori state probabilities,while location and speaker change information
    are employedvia time-variant transition probablities. Experiments show thatvideo
    information yields a substantial improvement comparedto pure audio-based diarization.
author:
- first_name: Joerg
  full_name: Schmalenstroeer, Joerg
  id: '460'
  last_name: Schmalenstroeer
- first_name: Martin
  full_name: Kelling, Martin
  last_name: Kelling
- first_name: Volker
  full_name: Leutnant, Volker
  last_name: Leutnant
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Schmalenstroeer J, Kelling M, Leutnant V, Haeb-Umbach R. Fusing Audio and
    Video Information for Online Speaker Diarization. In: <i>Interspeech 2009</i>.
    ; 2009.'
  apa: Schmalenstroeer, J., Kelling, M., Leutnant, V., &#38; Haeb-Umbach, R. (2009).
    Fusing Audio and Video Information for Online Speaker Diarization. <i>Interspeech
    2009</i>.
  bibtex: '@inproceedings{Schmalenstroeer_Kelling_Leutnant_Haeb-Umbach_2009, title={Fusing
    Audio and Video Information for Online Speaker Diarization}, booktitle={Interspeech
    2009}, author={Schmalenstroeer, Joerg and Kelling, Martin and Leutnant, Volker
    and Haeb-Umbach, Reinhold}, year={2009} }'
  chicago: Schmalenstroeer, Joerg, Martin Kelling, Volker Leutnant, and Reinhold Haeb-Umbach.
    “Fusing Audio and Video Information for Online Speaker Diarization.” In <i>Interspeech
    2009</i>, 2009.
  ieee: J. Schmalenstroeer, M. Kelling, V. Leutnant, and R. Haeb-Umbach, “Fusing Audio
    and Video Information for Online Speaker Diarization,” 2009.
  mla: Schmalenstroeer, Joerg, et al. “Fusing Audio and Video Information for Online
    Speaker Diarization.” <i>Interspeech 2009</i>, 2009.
  short: 'J. Schmalenstroeer, M. Kelling, V. Leutnant, R. Haeb-Umbach, in: Interspeech
    2009, 2009.'
date_created: 2019-07-12T05:30:24Z
date_updated: 2023-10-26T08:10:10Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/ScKeLeHa09.pdf
oa: '1'
publication: Interspeech 2009
quality_controlled: '1'
status: public
title: Fusing Audio and Video Information for Online Speaker Diarization
type: conference
user_id: '460'
year: '2009'
...
---
_id: '11776'
abstract:
- lang: eng
  text: 'The term uncertainty decoding has been phrased for a class of robustness
    enhancing algorithms in automatic speech recognition that replace point estimates
    and plug-in rules by posterior densities and optimal decision rules. While uncertainty
    can be incorporated in the model domain, in the feature domain, or even in both,
    we concentrate here on feature domain approaches as they tend to be computationally
    less demanding. We derive optimal decision rules in the presence of uncertain
    observations and discuss simplifications which result in computationally efficient
    realizations. The usefulness of the presented statistical framework is then exemplified
    for two types of realworld problems: The first is improving the robustness of
    speech recognition towards incomplete or corrupted feature vectors due to a lossy
    communication link between the speech capturing front end and the backend recognition
    engine. And the second is the well-known and extensively studied issue of improving
    the robustness of the recognizer towards environmental noise.'
author:
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: Haeb-Umbach R. Uncertainty Decoding in Automatic Speech Recognition. <i>2008
    ITG Conference on Voice Communication (SprachKommunikation)</i>. 2008:1-7.
  apa: Haeb-Umbach, R. (2008). Uncertainty Decoding in Automatic Speech Recognition.
    <i>2008 ITG Conference on Voice Communication (SprachKommunikation)</i>, 1–7.
  bibtex: '@article{Haeb-Umbach_2008, title={Uncertainty Decoding in Automatic Speech
    Recognition}, journal={2008 ITG Conference on Voice Communication (SprachKommunikation)},
    author={Haeb-Umbach, Reinhold}, year={2008}, pages={1–7} }'
  chicago: Haeb-Umbach, Reinhold. “Uncertainty Decoding in Automatic Speech Recognition.”
    <i>2008 ITG Conference on Voice Communication (SprachKommunikation)</i>, 2008,
    1–7.
  ieee: R. Haeb-Umbach, “Uncertainty Decoding in Automatic Speech Recognition,” <i>2008
    ITG Conference on Voice Communication (SprachKommunikation)</i>, pp. 1–7, 2008.
  mla: Haeb-Umbach, Reinhold. “Uncertainty Decoding in Automatic Speech Recognition.”
    <i>2008 ITG Conference on Voice Communication (SprachKommunikation)</i>, 2008,
    pp. 1–7.
  short: R. Haeb-Umbach, 2008 ITG Conference on Voice Communication (SprachKommunikation)
    (2008) 1–7.
date_created: 2019-07-12T05:28:02Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2008/Ha08.pdf
oa: '1'
page: 1-7
publication: 2008 ITG Conference on Voice Communication (SprachKommunikation)
status: public
title: Uncertainty Decoding in Automatic Speech Recognition
type: journal_article
user_id: '44006'
year: '2008'
...
---
_id: '11789'
abstract:
- lang: eng
  text: In distributed and network speech recognition the actual recognition task
    is not carried out on the user{\rq}s terminal but rather on a remote server in
    the network. While there are good reasons for doing so, a disadvantage of this
    client-server architecture is clearly that the communication medium may introduce
    errors, which then impairs speech recognition accuracy. Even sophisticated channel
    coding cannot completely prevent the occurrence of residual bit errors in the
    case of temporarily adverse channel conditions, and in packet-oriented transmission
    packets of data may arrive too late for the given real-time constraints and have
    to be declared lost. The goal of error concealment is to reduce the detrimental
    effect that such errors may induce on the recipient of the transmitted speech
    signal by exploiting residual redundancy in the bit stream at the source coder
    output. In classical speech transmission a human is the recipient, and erroneous
    data are reconstructed so as to reduce the subjectively annoying effect of corrupted
    bits or lost packets. Here, however, a statistical classifier is at the receiving
    end, which can benefit from knowledge about the quality of the reconstruction.
    In this book chapter we show how the classical Bayesian decision rule needs to
    be modified to account for uncertain features, and illustrate how the required
    feature posterior density can be estimated in the case of distributed speech recognition.
    Some other techniques for error concealment can be related to this approach. Experimental
    results are given for both a small and a medium vocabulary recognition task and
    both for a channel exhibiting bit errors and a packet erasure channel.
author:
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
- first_name: Valentin
  full_name: Ion, Valentin
  last_name: Ion
citation:
  ama: 'Haeb-Umbach R, Ion V. Error Concealement. In: Lindenberg B, Tan Z-H, eds.
    <i>Automatic Speech Recognition on Mobile Devices and over Communication Networks</i>.
    Vol Advances in Computer Vision and Pattern Recognition. Advances in Pattern Recognition.
    Springer; 2008:187-210.'
  apa: Haeb-Umbach, R., &#38; Ion, V. (2008). Error Concealement. In B. Lindenberg
    &#38; Z.-H. Tan (Eds.), <i>Automatic Speech Recognition on Mobile Devices and
    over Communication Networks</i> (Vol. Advances in Computer Vision and Pattern
    Recognition, pp. 187–210). Springer.
  bibtex: '@inbook{Haeb-Umbach_Ion_2008, series={Advances in Pattern Recognition},
    title={Error Concealement}, volume={Advances in Computer Vision and Pattern Recognition},
    booktitle={Automatic Speech Recognition on Mobile Devices and over Communication
    Networks}, publisher={Springer}, author={Haeb-Umbach, Reinhold and Ion, Valentin},
    editor={Lindenberg, Borge and Tan, Zheng-HuaEditors}, year={2008}, pages={187–210},
    collection={Advances in Pattern Recognition} }'
  chicago: Haeb-Umbach, Reinhold, and Valentin Ion. “Error Concealement.” In <i>Automatic
    Speech Recognition on Mobile Devices and over Communication Networks</i>, edited
    by Borge Lindenberg and Zheng-Hua Tan, Advances in Computer Vision and Pattern
    Recognition:187–210. Advances in Pattern Recognition. Springer, 2008.
  ieee: R. Haeb-Umbach and V. Ion, “Error Concealement,” in <i>Automatic Speech Recognition
    on Mobile Devices and over Communication Networks</i>, vol. Advances in Computer
    Vision and Pattern Recognition, B. Lindenberg and Z.-H. Tan, Eds. Springer, 2008,
    pp. 187–210.
  mla: Haeb-Umbach, Reinhold, and Valentin Ion. “Error Concealement.” <i>Automatic
    Speech Recognition on Mobile Devices and over Communication Networks</i>, edited
    by Borge Lindenberg and Zheng-Hua Tan, vol. Advances in Computer Vision and Pattern
    Recognition, Springer, 2008, pp. 187–210.
  short: 'R. Haeb-Umbach, V. Ion, in: B. Lindenberg, Z.-H. Tan (Eds.), Automatic Speech
    Recognition on Mobile Devices and over Communication Networks, Springer, 2008,
    pp. 187–210.'
date_created: 2019-07-12T05:28:17Z
date_updated: 2022-01-06T06:51:08Z
department:
- _id: '54'
editor:
- first_name: Borge
  full_name: Lindenberg, Borge
  last_name: Lindenberg
- first_name: Zheng-Hua
  full_name: Tan, Zheng-Hua
  last_name: Tan
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2008/HaIo08.pdf
oa: '1'
page: 187-210
publication: Automatic Speech Recognition on Mobile Devices and over Communication
  Networks
publisher: Springer
series_title: Advances in Pattern Recognition
status: public
title: Error Concealement
type: book_chapter
user_id: '44006'
volume: Advances in Computer Vision and Pattern Recognition
year: '2008'
...
---
_id: '11820'
abstract:
- lang: eng
  text: In this paper, we derive an uncertainty decoding rule for automatic speech
    recognition (ASR), which accounts for both corrupted observations and inter-frame
    correlation. The conditional independence assumption, prevalent in hidden Markov
    model-based ASR, is relaxed to obtain a clean speech posterior that is conditioned
    on the complete observed feature vector sequence. This is a more informative posterior
    than one conditioned only on the current observation. The novel decoding is used
    to obtain a transmission-error robust remote ASR system, where the speech capturing
    unit is connected to the decoder via an error-prone communication network. We
    show how the clean speech posterior can be computed for communication links being
    characterized by either bit errors or packet loss. Recognition results are presented
    for both distributed and network speech recognition, where in the latter case
    common voice-over-IP codecs are employed.
author:
- first_name: Valentin
  full_name: Ion, Valentin
  last_name: Ion
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: Ion V, Haeb-Umbach R. A Novel Uncertainty Decoding Rule With Applications to
    Transmission Error Robust Speech Recognition. <i>IEEE Transactions on Audio, Speech,
    and Language Processing</i>. 2008;16(5):1047-1060. doi:<a href="https://doi.org/10.1109/TASL.2008.925879">10.1109/TASL.2008.925879</a>
  apa: Ion, V., &#38; Haeb-Umbach, R. (2008). A Novel Uncertainty Decoding Rule With
    Applications to Transmission Error Robust Speech Recognition. <i>IEEE Transactions
    on Audio, Speech, and Language Processing</i>, <i>16</i>(5), 1047–1060. <a href="https://doi.org/10.1109/TASL.2008.925879">https://doi.org/10.1109/TASL.2008.925879</a>
  bibtex: '@article{Ion_Haeb-Umbach_2008, title={A Novel Uncertainty Decoding Rule
    With Applications to Transmission Error Robust Speech Recognition}, volume={16},
    DOI={<a href="https://doi.org/10.1109/TASL.2008.925879">10.1109/TASL.2008.925879</a>},
    number={5}, journal={IEEE Transactions on Audio, Speech, and Language Processing},
    author={Ion, Valentin and Haeb-Umbach, Reinhold}, year={2008}, pages={1047–1060}
    }'
  chicago: 'Ion, Valentin, and Reinhold Haeb-Umbach. “A Novel Uncertainty Decoding
    Rule With Applications to Transmission Error Robust Speech Recognition.” <i>IEEE
    Transactions on Audio, Speech, and Language Processing</i> 16, no. 5 (2008): 1047–60.
    <a href="https://doi.org/10.1109/TASL.2008.925879">https://doi.org/10.1109/TASL.2008.925879</a>.'
  ieee: V. Ion and R. Haeb-Umbach, “A Novel Uncertainty Decoding Rule With Applications
    to Transmission Error Robust Speech Recognition,” <i>IEEE Transactions on Audio,
    Speech, and Language Processing</i>, vol. 16, no. 5, pp. 1047–1060, 2008.
  mla: Ion, Valentin, and Reinhold Haeb-Umbach. “A Novel Uncertainty Decoding Rule
    With Applications to Transmission Error Robust Speech Recognition.” <i>IEEE Transactions
    on Audio, Speech, and Language Processing</i>, vol. 16, no. 5, 2008, pp. 1047–60,
    doi:<a href="https://doi.org/10.1109/TASL.2008.925879">10.1109/TASL.2008.925879</a>.
  short: V. Ion, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language
    Processing 16 (2008) 1047–1060.
date_created: 2019-07-12T05:28:53Z
date_updated: 2022-01-06T06:51:10Z
department:
- _id: '54'
doi: 10.1109/TASL.2008.925879
intvolume: '        16'
issue: '5'
keyword:
- automatic speech recognition
- bit errors
- codecs
- communication links
- corrupted observations
- decoding
- distributed speech recognition
- error-prone communication network
- feature vector sequence
- hidden Markov model-based ASR
- hidden Markov models
- inter-frame correlation
- Internet telephony
- network speech recognition
- packet loss
- speech posterior
- speech recognition
- transmission error robust speech recognition
- uncertainty decoding
- voice-over-IP codecs
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2008/IoHa08-1.pdf
oa: '1'
page: 1047-1060
publication: IEEE Transactions on Audio, Speech, and Language Processing
status: public
title: A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust
  Speech Recognition
type: journal_article
user_id: '44006'
volume: 16
year: '2008'
...
---
_id: '11821'
abstract:
- lang: eng
  text: This paper addresses the robustness of automatic speech recognition to environmental
    noise. In order to account for reliability of the clean feature estimate we employ
    the feature posterior density conditioned on observed noisy features to perform
    uncertainty decoding. We investigate two approaches to estimate the posterior
    using a discrete feature space, first conditioning only on the current observation,
    and second on the whole feature sequence of an utterance. Experiments with Aurora
    2 showed that the latter provides slightly better performance, as it allows for
    exploiting the temporal correlations between consecutive features.
author:
- first_name: Valentin
  full_name: Ion, Valentin
  last_name: Ion
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: Ion V, Haeb-Umbach R. Investigations into Uncertainty Decoding Employing a
    Discrete Feature Space for Noise Robust Automatic Speech Recognition. <i>2008
    ITG Conference on Voice Communication (SprachKommunikation)</i>. 2008:1-4.
  apa: Ion, V., &#38; Haeb-Umbach, R. (2008). Investigations into Uncertainty Decoding
    Employing a Discrete Feature Space for Noise Robust Automatic Speech Recognition.
    <i>2008 ITG Conference on Voice Communication (SprachKommunikation)</i>, 1–4.
  bibtex: '@article{Ion_Haeb-Umbach_2008, title={Investigations into Uncertainty Decoding
    Employing a Discrete Feature Space for Noise Robust Automatic Speech Recognition},
    journal={2008 ITG Conference on Voice Communication (SprachKommunikation)}, author={Ion,
    Valentin and Haeb-Umbach, Reinhold}, year={2008}, pages={1–4} }'
  chicago: Ion, Valentin, and Reinhold Haeb-Umbach. “Investigations into Uncertainty
    Decoding Employing a Discrete Feature Space for Noise Robust Automatic Speech
    Recognition.” <i>2008 ITG Conference on Voice Communication (SprachKommunikation)</i>,
    2008, 1–4.
  ieee: V. Ion and R. Haeb-Umbach, “Investigations into Uncertainty Decoding Employing
    a Discrete Feature Space for Noise Robust Automatic Speech Recognition,” <i>2008
    ITG Conference on Voice Communication (SprachKommunikation)</i>, pp. 1–4, 2008.
  mla: Ion, Valentin, and Reinhold Haeb-Umbach. “Investigations into Uncertainty Decoding
    Employing a Discrete Feature Space for Noise Robust Automatic Speech Recognition.”
    <i>2008 ITG Conference on Voice Communication (SprachKommunikation)</i>, 2008,
    pp. 1–4.
  short: V. Ion, R. Haeb-Umbach, 2008 ITG Conference on Voice Communication (SprachKommunikation)
    (2008) 1–4.
date_created: 2019-07-12T05:28:54Z
date_updated: 2022-01-06T06:51:10Z
department:
- _id: '54'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2008/IoHa08-2.pdf
oa: '1'
page: 1-4
publication: 2008 ITG Conference on Voice Communication (SprachKommunikation)
status: public
title: Investigations into Uncertainty Decoding Employing a Discrete Feature Space
  for Noise Robust Automatic Speech Recognition
type: journal_article
user_id: '44006'
year: '2008'
...
