<?xml version="1.0" encoding="UTF-8"?>

<modsCollection xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.loc.gov/mods/v3" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
<mods version="3.3">

<genre>article</genre>

<titleInfo><title>Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition</title></titleInfo>





<name type="personal">
  <namePart type="given">Volker</namePart>
  <namePart type="family">Leutnant</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Alexander</namePart>
  <namePart type="family">Krueger</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Reinhold</namePart>
  <namePart type="family">Haeb-Umbach</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">242</identifier></name>







<name type="corporate">
  <namePart></namePart>
  <identifier type="local">54</identifier>
  <role>
    <roleTerm type="text">department</roleTerm>
  </role>
</name>








<abstract lang="eng">In this contribution we extend a previously proposed Bayesian approach for the enhancement of reverberant logarithmic mel power spectral coefficients for robust automatic speech recognition to the additional compensation of background noise. A recently proposed observation model is employed whose time-variant observation error statistics are obtained as a side product of the inference of the a posteriori probability density function of the clean speech feature vectors. Further a reduction of the computational effort and the memory requirements are achieved by using a recursive formulation of the observation model. The performance of the proposed algorithms is first experimentally studied on a connected digits recognition task with artificially created noisy reverberant data. It is shown that the use of the time-variant observation error model leads to a significant error rate reduction at low signal-to-noise ratios compared to a time-invariant model. Further experiments were conducted on a 5000 word task recorded in a reverberant and noisy environment. A significant word error rate reduction was obtained demonstrating the effectiveness of the approach on real-world data.</abstract>

<originInfo><dateIssued encoding="w3cdtf">2013</dateIssued>
</originInfo>
<language><languageTerm authority="iso639-2b" type="code">eng</languageTerm>
</language>

<subject><topic>Bayes methods</topic><topic>compensation</topic><topic>error statistics</topic><topic>reverberation</topic><topic>speech recognition</topic><topic>Bayesian feature enhancement</topic><topic>background noise</topic><topic>clean speech feature vectors</topic><topic>compensation</topic><topic>connected digits recognition task</topic><topic>error statistics</topic><topic>memory requirements</topic><topic>noisy reverberant data</topic><topic>posteriori probability density function</topic><topic>recursive formulation</topic><topic>reverberant logarithmic mel power spectral coefficients</topic><topic>robust automatic speech recognition</topic><topic>signal-to-noise ratios</topic><topic>time-variant observation</topic><topic>word error rate reduction</topic><topic>Robust automatic speech recognition</topic><topic>model-based Bayesian feature enhancement</topic><topic>observation model for reverberant and noisy speech</topic><topic>recursive observation model</topic>
</subject>


<relatedItem type="host"><titleInfo><title>IEEE Transactions on Audio, Speech, and Language Processing</title></titleInfo><identifier type="doi">10.1109/TASL.2013.2258013</identifier>
<part><detail type="volume"><number>21</number></detail><detail type="issue"><number>8</number></detail><extent unit="pages">1640-1652</extent>
</part>
</relatedItem>


<extension>
<bibliographicCitation>
<ama>Leutnant V, Krueger A, Haeb-Umbach R. Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition. &lt;i&gt;IEEE Transactions on Audio, Speech, and Language Processing&lt;/i&gt;. 2013;21(8):1640-1652. doi:&lt;a href=&quot;https://doi.org/10.1109/TASL.2013.2258013&quot;&gt;10.1109/TASL.2013.2258013&lt;/a&gt;</ama>
<ieee>V. Leutnant, A. Krueger, and R. Haeb-Umbach, “Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition,” &lt;i&gt;IEEE Transactions on Audio, Speech, and Language Processing&lt;/i&gt;, vol. 21, no. 8, pp. 1640–1652, 2013.</ieee>
<chicago>Leutnant, Volker, Alexander Krueger, and Reinhold Haeb-Umbach. “Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition.” &lt;i&gt;IEEE Transactions on Audio, Speech, and Language Processing&lt;/i&gt; 21, no. 8 (2013): 1640–52. &lt;a href=&quot;https://doi.org/10.1109/TASL.2013.2258013&quot;&gt;https://doi.org/10.1109/TASL.2013.2258013&lt;/a&gt;.</chicago>
<bibtex>@article{Leutnant_Krueger_Haeb-Umbach_2013, title={Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition}, volume={21}, DOI={&lt;a href=&quot;https://doi.org/10.1109/TASL.2013.2258013&quot;&gt;10.1109/TASL.2013.2258013&lt;/a&gt;}, number={8}, journal={IEEE Transactions on Audio, Speech, and Language Processing}, author={Leutnant, Volker and Krueger, Alexander and Haeb-Umbach, Reinhold}, year={2013}, pages={1640–1652} }</bibtex>
<mla>Leutnant, Volker, et al. “Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition.” &lt;i&gt;IEEE Transactions on Audio, Speech, and Language Processing&lt;/i&gt;, vol. 21, no. 8, 2013, pp. 1640–52, doi:&lt;a href=&quot;https://doi.org/10.1109/TASL.2013.2258013&quot;&gt;10.1109/TASL.2013.2258013&lt;/a&gt;.</mla>
<short>V. Leutnant, A. Krueger, R. Haeb-Umbach, IEEE Transactions on Audio, Speech, and Language Processing 21 (2013) 1640–1652.</short>
<apa>Leutnant, V., Krueger, A., &amp;#38; Haeb-Umbach, R. (2013). Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition. &lt;i&gt;IEEE Transactions on Audio, Speech, and Language Processing&lt;/i&gt;, &lt;i&gt;21&lt;/i&gt;(8), 1640–1652. &lt;a href=&quot;https://doi.org/10.1109/TASL.2013.2258013&quot;&gt;https://doi.org/10.1109/TASL.2013.2258013&lt;/a&gt;</apa>
</bibliographicCitation>
</extension>
<recordInfo><recordIdentifier>11862</recordIdentifier><recordCreationDate encoding="w3cdtf">2019-07-12T05:29:42Z</recordCreationDate><recordChangeDate encoding="w3cdtf">2022-01-06T06:51:11Z</recordChangeDate>
</recordInfo>
</mods>
</modsCollection>
