<?xml version="1.0" encoding="UTF-8"?>

<modsCollection xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.loc.gov/mods/v3" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
<mods version="3.3">

<genre>conference paper</genre>

<titleInfo><title>A Comparison and Combination of Unsupervised Blind Source Separation  Techniques</title></titleInfo>





<name type="personal">
  <namePart type="given">Christoph</namePart>
  <namePart type="family">Boeddeker</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">40767</identifier></name>
<name type="personal">
  <namePart type="given">Frederik</namePart>
  <namePart type="family">Rautenberg</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">72602</identifier></name>
<name type="personal">
  <namePart type="given">Reinhold</namePart>
  <namePart type="family">Haeb-Umbach</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">242</identifier></name>







<name type="corporate">
  <namePart></namePart>
  <identifier type="local">54</identifier>
  <role>
    <roleTerm type="text">department</roleTerm>
  </role>
</name>



<name type="conference">
  <namePart>ITG Conference on Speech Communication</namePart>
</name>






<abstract lang="eng">Unsupervised blind source separation methods do not require a training phase
and thus cannot suffer from a train-test mismatch, which is a common concern in
neural network based source separation. The unsupervised techniques can be
categorized in two classes, those building upon the sparsity of speech in the
Short-Time Fourier transform domain and those exploiting non-Gaussianity or
non-stationarity of the source signals. In this contribution, spatial mixture
models which fall in the first category and independent vector analysis (IVA)
as a representative of the second category are compared w.r.t. their separation
performance and the performance of a downstream speech recognizer on a
reverberant dataset of reasonable size. Furthermore, we introduce a serial
concatenation of the two, where the result of the mixture model serves as
initialization of IVA, which achieves significantly better WER performance than
each algorithm individually and even approaches the performance of a much more
complex neural network based technique.</abstract>

<relatedItem type="constituent">
  <location>
    <url displayLabel="2106.05627.pdf">https://ris.uni-paderborn.de/download/44843/44856/2106.05627.pdf</url>
  </location>
  <physicalDescription><internetMediaType>application/pdf</internetMediaType></physicalDescription><accessCondition type="restrictionOnAccess">no</accessCondition>
</relatedItem>
<originInfo><dateIssued encoding="w3cdtf">2021</dateIssued><place><placeTerm type="text">Kiel</placeTerm></place>
</originInfo>
<language><languageTerm authority="iso639-2b" type="code">eng</languageTerm>
</language>



<relatedItem type="host"><titleInfo><title>ITG Conference on Speech Communication</title></titleInfo>
  <identifier type="arXiv">2106.05627</identifier>
<part>
</part>
</relatedItem>


<extension>
<bibliographicCitation>
<chicago>Boeddeker, Christoph, Frederik Rautenberg, and Reinhold Haeb-Umbach. “A Comparison and Combination of Unsupervised Blind Source Separation  Techniques.” In &lt;i&gt;ITG Conference on Speech Communication&lt;/i&gt;, 2021.</chicago>
<ieee>C. Boeddeker, F. Rautenberg, and R. Haeb-Umbach, “A Comparison and Combination of Unsupervised Blind Source Separation  Techniques,” presented at the ITG Conference on Speech Communication, Kiel, 2021.</ieee>
<ama>Boeddeker C, Rautenberg F, Haeb-Umbach R. A Comparison and Combination of Unsupervised Blind Source Separation  Techniques. In: &lt;i&gt;ITG Conference on Speech Communication&lt;/i&gt;. ; 2021.</ama>
<bibtex>@inproceedings{Boeddeker_Rautenberg_Haeb-Umbach_2021, title={A Comparison and Combination of Unsupervised Blind Source Separation  Techniques}, booktitle={ITG Conference on Speech Communication}, author={Boeddeker, Christoph and Rautenberg, Frederik and Haeb-Umbach, Reinhold}, year={2021} }</bibtex>
<short>C. Boeddeker, F. Rautenberg, R. Haeb-Umbach, in: ITG Conference on Speech Communication, 2021.</short>
<mla>Boeddeker, Christoph, et al. “A Comparison and Combination of Unsupervised Blind Source Separation  Techniques.” &lt;i&gt;ITG Conference on Speech Communication&lt;/i&gt;, 2021.</mla>
<apa>Boeddeker, C., Rautenberg, F., &amp;#38; Haeb-Umbach, R. (2021). A Comparison and Combination of Unsupervised Blind Source Separation  Techniques. &lt;i&gt;ITG Conference on Speech Communication&lt;/i&gt;. ITG Conference on Speech Communication, Kiel.</apa>
</bibliographicCitation>
</extension>
<recordInfo><recordIdentifier>44843</recordIdentifier><recordCreationDate encoding="w3cdtf">2023-05-15T07:59:33Z</recordCreationDate><recordChangeDate encoding="w3cdtf">2023-11-15T15:29:32Z</recordChangeDate>
</recordInfo>
</mods>
</modsCollection>
