<?xml version="1.0" encoding="UTF-8"?>

<modsCollection xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.loc.gov/mods/v3" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
<mods version="3.3">

<genre>conference paper</genre>

<titleInfo><title>Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models</title></titleInfo>





<name type="personal">
  <namePart type="given">Lukas</namePart>
  <namePart type="family">Drude</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">11213</identifier></name>
<name type="personal">
  <namePart type="given">Aleksej</namePart>
  <namePart type="family">Chinaev</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Dang Hai</namePart>
  <namePart type="family">Tran Vu</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Reinhold</namePart>
  <namePart type="family">Haeb-Umbach</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">242</identifier></name>







<name type="corporate">
  <namePart></namePart>
  <identifier type="local">54</identifier>
  <role>
    <roleTerm type="text">department</roleTerm>
  </role>
</name>








<abstract lang="eng">This contribution describes a step-wise source counting algorithm to determine the number of speakers in an offline scenario. Each speaker is identified by a variational expectation maximization (VEM) algorithm for complex Watson mixture models and therefore directly yields beamforming vectors for a subsequent speech separation process. An observation selection criterion is proposed which improves the robustness of the source counting in noise. The algorithm is compared to an alternative VEM approach with Gaussian mixture models based on directions of arrival and shown to deliver improved source counting accuracy. The article concludes by extending the offline algorithm towards a low-latency online estimation of the number of active sources from the streaming input data.</abstract>

<originInfo><dateIssued encoding="w3cdtf">2014</dateIssued>
</originInfo>
<language><languageTerm authority="iso639-2b" type="code">eng</languageTerm>
</language>

<subject><topic>Accuracy</topic><topic>Acoustics</topic><topic>Estimation</topic><topic>Mathematical model</topic><topic>Soruce separation</topic><topic>Speech</topic><topic>Vectors</topic><topic>Bayes methods</topic><topic>Blind source separation</topic><topic>Directional statistics</topic><topic>Number of speakers</topic><topic>Speaker diarization</topic>
</subject>


<relatedItem type="host"><titleInfo><title>14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014)</title></titleInfo>
<part><extent unit="pages">213-217</extent>
</part>
</relatedItem>


<relatedItem type="Supplementary material">
  <location>
  
     <url>https://groups.uni-paderborn.de/nt/pubs/2014/DrChTrHaeb14_Poster.pdf</url>
  
  </location>
</relatedItem>

<extension>
<bibliographicCitation>
<mla>Drude, Lukas, et al. “Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models.” &lt;i&gt;14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014)&lt;/i&gt;, 2014, pp. 213–17.</mla>
<bibtex>@inproceedings{Drude_Chinaev_Tran Vu_Haeb-Umbach_2014, title={Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models}, booktitle={14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014)}, author={Drude, Lukas and Chinaev, Aleksej and Tran Vu, Dang Hai and Haeb-Umbach, Reinhold}, year={2014}, pages={213–217} }</bibtex>
<ama>Drude L, Chinaev A, Tran Vu DH, Haeb-Umbach R. Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models. In: &lt;i&gt;14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014)&lt;/i&gt;. ; 2014:213-217.</ama>
<ieee>L. Drude, A. Chinaev, D. H. Tran Vu, and R. Haeb-Umbach, “Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models,” in &lt;i&gt;14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014)&lt;/i&gt;, 2014, pp. 213–217.</ieee>
<apa>Drude, L., Chinaev, A., Tran Vu, D. H., &amp;#38; Haeb-Umbach, R. (2014). Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models. In &lt;i&gt;14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014)&lt;/i&gt; (pp. 213–217).</apa>
<chicago>Drude, Lukas, Aleksej Chinaev, Dang Hai Tran Vu, and Reinhold Haeb-Umbach. “Towards Online Source Counting in Speech Mixtures Applying a Variational EM for Complex Watson Mixture Models.” In &lt;i&gt;14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014)&lt;/i&gt;, 213–17, 2014.</chicago>
<short>L. Drude, A. Chinaev, D.H. Tran Vu, R. Haeb-Umbach, in: 14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014), 2014, pp. 213–217.</short>
</bibliographicCitation>
</extension>
<recordInfo><recordIdentifier>11753</recordIdentifier><recordCreationDate encoding="w3cdtf">2019-07-12T05:27:35Z</recordCreationDate><recordChangeDate encoding="w3cdtf">2022-01-06T06:51:08Z</recordChangeDate>
</recordInfo>
</mods>
</modsCollection>
