---
_id: '11938'
abstract:
- lang: eng
  text: In this paper, parameter estimation of a state-space model of noise or noisy
    speech cepstra is investigated. A blockwise EM algorithm is derived for the estimation
    of the state and observation noise covariance from noise-only input data. It is
    supposed to be used during the offline training mode of a speech recognizer. Further
    a sequential online EM algorithm is developed to adapt the observation noise covariance
    on noisy speech cepstra at its input. The estimated parameters are then used in
    model-based speech feature enhancement for noise-robust automatic speech recognition.
    Experiments on the AURORA4 database lead to improved recognition results with
    a linear state model compared to the assumption of stationary noise.
author:
- first_name: Stefan
  full_name: Windmann, Stefan
  last_name: Windmann
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: Windmann S, Haeb-Umbach R. Parameter Estimation of a State-Space Model of Noise
    for Robust Speech Recognition. <i>IEEE Transactions on Audio, Speech, and Language
    Processing</i>. 2009;17(8):1577-1590. doi:<a href="https://doi.org/10.1109/TASL.2009.2023172">10.1109/TASL.2009.2023172</a>
  apa: Windmann, S., &#38; Haeb-Umbach, R. (2009). Parameter Estimation of a State-Space
    Model of Noise for Robust Speech Recognition. <i>IEEE Transactions on Audio, Speech,
    and Language Processing</i>, <i>17</i>(8), 1577–1590. <a href="https://doi.org/10.1109/TASL.2009.2023172">https://doi.org/10.1109/TASL.2009.2023172</a>
  bibtex: '@article{Windmann_Haeb-Umbach_2009, title={Parameter Estimation of a State-Space
    Model of Noise for Robust Speech Recognition}, volume={17}, DOI={<a href="https://doi.org/10.1109/TASL.2009.2023172">10.1109/TASL.2009.2023172</a>},
    number={8}, journal={IEEE Transactions on Audio, Speech, and Language Processing},
    author={Windmann, Stefan and Haeb-Umbach, Reinhold}, year={2009}, pages={1577–1590}
    }'
  chicago: 'Windmann, Stefan, and Reinhold Haeb-Umbach. “Parameter Estimation of a
    State-Space Model of Noise for Robust Speech Recognition.” <i>IEEE Transactions
    on Audio, Speech, and Language Processing</i> 17, no. 8 (2009): 1577–90. <a href="https://doi.org/10.1109/TASL.2009.2023172">https://doi.org/10.1109/TASL.2009.2023172</a>.'
  ieee: S. Windmann and R. Haeb-Umbach, “Parameter Estimation of a State-Space Model
    of Noise for Robust Speech Recognition,” <i>IEEE Transactions on Audio, Speech,
    and Language Processing</i>, vol. 17, no. 8, pp. 1577–1590, 2009.
  mla: Windmann, Stefan, and Reinhold Haeb-Umbach. “Parameter Estimation of a State-Space
    Model of Noise for Robust Speech Recognition.” <i>IEEE Transactions on Audio,
    Speech, and Language Processing</i>, vol. 17, no. 8, 2009, pp. 1577–90, doi:<a
    href="https://doi.org/10.1109/TASL.2009.2023172">10.1109/TASL.2009.2023172</a>.
  short: S. Windmann, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language
    Processing 17 (2009) 1577–1590.
date_created: 2019-07-12T05:31:09Z
date_updated: 2022-01-06T06:51:12Z
department:
- _id: '54'
doi: 10.1109/TASL.2009.2023172
intvolume: '        17'
issue: '8'
keyword:
- AURORA4 database
- blockwise EM algorithm
- covariance analysis
- linear state model
- noise covariance
- noise-robust automatic speech recognition
- noisy speech cepstra
- offline training mode
- parameter estimation
- speech recognition
- speech recognition equipment
- speech recognizer
- state-space methods
- state-space model
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2009/WiHa09-2.pdf
oa: '1'
page: 1577-1590
publication: IEEE Transactions on Audio, Speech, and Language Processing
status: public
title: Parameter Estimation of a State-Space Model of Noise for Robust Speech Recognition
type: journal_article
user_id: '44006'
volume: 17
year: '2009'
...
---
_id: '11931'
abstract:
- lang: eng
  text: The paper is concerned with binaural signal processing for a bimodal human-robot
    interface with hearing and vision. The two microphone signals are processed to
    obtain an enhanced single-channel input signal for the subsequent speech recognizer
    and to localize the acoustic source, an important information for establishing
    a natural human-robot communication. We utilize a robust adaptive algorithm for
    filter-and-sum beamforming (FSB) and extract speaker direction information from
    the resulting FIR filter coefficients. Further, particle filtering is applied
    which conducts a nonlinear Bayesian tracking of speaker movement. Good location
    accuracy can be achieved even in highly reverberant environments. The results
    obtained outperform the conventional generalized cross correlation (GCC) method.
author:
- first_name: Ernst
  full_name: Warsitz, Ernst
  last_name: Warsitz
- first_name: Reinhold
  full_name: Haeb-Umbach, Reinhold
  id: '242'
  last_name: Haeb-Umbach
citation:
  ama: 'Warsitz E, Haeb-Umbach R. Robust speaker direction estimation with particle
    filtering. In: <i>IEEE Workshop on Multimedia Signal Processing (MMSP 2004)</i>.
    ; 2004:367-370. doi:<a href="https://doi.org/10.1109/MMSP.2004.1436569">10.1109/MMSP.2004.1436569</a>'
  apa: Warsitz, E., &#38; Haeb-Umbach, R. (2004). Robust speaker direction estimation
    with particle filtering. In <i>IEEE Workshop on Multimedia Signal Processing (MMSP
    2004)</i> (pp. 367–370). <a href="https://doi.org/10.1109/MMSP.2004.1436569">https://doi.org/10.1109/MMSP.2004.1436569</a>
  bibtex: '@inproceedings{Warsitz_Haeb-Umbach_2004, title={Robust speaker direction
    estimation with particle filtering}, DOI={<a href="https://doi.org/10.1109/MMSP.2004.1436569">10.1109/MMSP.2004.1436569</a>},
    booktitle={IEEE Workshop on Multimedia Signal Processing (MMSP 2004)}, author={Warsitz,
    Ernst and Haeb-Umbach, Reinhold}, year={2004}, pages={367–370} }'
  chicago: Warsitz, Ernst, and Reinhold Haeb-Umbach. “Robust Speaker Direction Estimation
    with Particle Filtering.” In <i>IEEE Workshop on Multimedia Signal Processing
    (MMSP 2004)</i>, 367–70, 2004. <a href="https://doi.org/10.1109/MMSP.2004.1436569">https://doi.org/10.1109/MMSP.2004.1436569</a>.
  ieee: E. Warsitz and R. Haeb-Umbach, “Robust speaker direction estimation with particle
    filtering,” in <i>IEEE Workshop on Multimedia Signal Processing (MMSP 2004)</i>,
    2004, pp. 367–370.
  mla: Warsitz, Ernst, and Reinhold Haeb-Umbach. “Robust Speaker Direction Estimation
    with Particle Filtering.” <i>IEEE Workshop on Multimedia Signal Processing (MMSP
    2004)</i>, 2004, pp. 367–70, doi:<a href="https://doi.org/10.1109/MMSP.2004.1436569">10.1109/MMSP.2004.1436569</a>.
  short: 'E. Warsitz, R. Haeb-Umbach, in: IEEE Workshop on Multimedia Signal Processing
    (MMSP 2004), 2004, pp. 367–370.'
date_created: 2019-07-12T05:31:01Z
date_updated: 2022-01-06T06:51:12Z
department:
- _id: '54'
doi: 10.1109/MMSP.2004.1436569
keyword:
- bimodal human-robot interface
- binaural signal processing
- enhanced single-channel input signal
- filter-and-sum beamforming
- filtering theory
- FIR filter coefficient
- generalized cross correlation method
- microphones
- microphone signal
- nonlinear Bayesian tracking
- particle filtering
- robust adaptive algorithm
- robust speaker direction estimation
- signal processing
- speech enhancement
- speech recognition
- speech recognizer
- user interfaces
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://groups.uni-paderborn.de/nt/pubs/2004/WaHa04.pdf
oa: '1'
page: 367-370
publication: IEEE Workshop on Multimedia Signal Processing (MMSP 2004)
status: public
title: Robust speaker direction estimation with particle filtering
type: conference
user_id: '44006'
year: '2004'
...