<?xml version="1.0" encoding="UTF-8"?>

<modsCollection xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.loc.gov/mods/v3" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
<mods version="3.3">

<genre>conference paper</genre>

<titleInfo><title>Loose Coupling of Spectral and Spatial Models for Multi-Channel Diarization and Enhancement of Meetings in Dynamic Environments</title></titleInfo>





<name type="personal">
  <namePart type="given">Adrian Tobias</namePart>
  <namePart type="family">Meise</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">79268</identifier></name>
<name type="personal">
  <namePart type="given">Tobias</namePart>
  <namePart type="family">Cord-Landwehr</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">44393</identifier></name>
<name type="personal">
  <namePart type="given">Christoph</namePart>
  <namePart type="family">Boeddeker</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">40767</identifier></name>
<name type="personal">
  <namePart type="given">Marc</namePart>
  <namePart type="family">Delcroix</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Tomohiro</namePart>
  <namePart type="family">Nakatani</namePart>
  <role><roleTerm type="text">author</roleTerm> </role></name>
<name type="personal">
  <namePart type="given">Reinhold</namePart>
  <namePart type="family">Haeb-Umbach</namePart>
  <role><roleTerm type="text">author</roleTerm> </role><identifier type="local">242</identifier></name>







<name type="corporate">
  <namePart></namePart>
  <identifier type="local">54</identifier>
  <role>
    <roleTerm type="text">department</roleTerm>
  </role>
</name>



<name type="conference">
  <namePart> 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) </namePart>
</name>






<abstract lang="eng">Sound capture by microphone arrays opens the possibility to exploit spatial, in addition to spectral, information for diarization and signal enhancement, two important tasks in meeting transcription. However, there is no one-to-one mapping of positions in space to speakers if speakers move. Here, we address this by proposing a novel joint spatial and spectral mixture model, whose two submodels are loosely coupled by modeling the relationship between speaker and position index probabilistically. Thus, spatial and spectral information can be jointly exploited, while at the same time allowing for speakers speaking from different positions. Experiments on the LibriCSS data set with simulated speaker position changes show great improvements over tightly coupled subsystems.</abstract>

<originInfo><publisher>IEEE</publisher><dateIssued encoding="w3cdtf">2026</dateIssued><place><placeTerm type="text">Barcelona</placeTerm></place>
</originInfo>
<language><languageTerm authority="iso639-2b" type="code">eng</languageTerm>
</language>

<subject><topic>mixture models</topic><topic>meeting processing</topic><topic>diarization</topic><topic>source separation</topic>
</subject>


<relatedItem type="host"><titleInfo><title>ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</title></titleInfo>
  <identifier type="arXiv">https://arxiv.org/abs/2601.16077</identifier><identifier type="doi">10.1109/icassp55912.2026.11463540</identifier>
<part>
</part>
</relatedItem>


<extension>
<bibliographicCitation>
<ieee>A. T. Meise, T. Cord-Landwehr, C. Boeddeker, M. Delcroix, T. Nakatani, and R. Haeb-Umbach, “Loose Coupling of Spectral and Spatial Models for Multi-Channel Diarization and Enhancement of Meetings in Dynamic Environments,” presented at the  2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Barcelona, 2026, doi: &lt;a href=&quot;https://doi.org/10.1109/icassp55912.2026.11463540&quot;&gt;10.1109/icassp55912.2026.11463540&lt;/a&gt;.</ieee>
<chicago>Meise, Adrian Tobias, Tobias Cord-Landwehr, Christoph Boeddeker, Marc Delcroix, Tomohiro Nakatani, and Reinhold Haeb-Umbach. “Loose Coupling of Spectral and Spatial Models for Multi-Channel Diarization and Enhancement of Meetings in Dynamic Environments.” In &lt;i&gt;ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)&lt;/i&gt;. IEEE, 2026. &lt;a href=&quot;https://doi.org/10.1109/icassp55912.2026.11463540&quot;&gt;https://doi.org/10.1109/icassp55912.2026.11463540&lt;/a&gt;.</chicago>
<ama>Meise AT, Cord-Landwehr T, Boeddeker C, Delcroix M, Nakatani T, Haeb-Umbach R. Loose Coupling of Spectral and Spatial Models for Multi-Channel Diarization and Enhancement of Meetings in Dynamic Environments. In: &lt;i&gt;ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)&lt;/i&gt;. IEEE; 2026. doi:&lt;a href=&quot;https://doi.org/10.1109/icassp55912.2026.11463540&quot;&gt;10.1109/icassp55912.2026.11463540&lt;/a&gt;</ama>
<apa>Meise, A. T., Cord-Landwehr, T., Boeddeker, C., Delcroix, M., Nakatani, T., &amp;#38; Haeb-Umbach, R. (2026). Loose Coupling of Spectral and Spatial Models for Multi-Channel Diarization and Enhancement of Meetings in Dynamic Environments. &lt;i&gt;ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)&lt;/i&gt;.  2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Barcelona. &lt;a href=&quot;https://doi.org/10.1109/icassp55912.2026.11463540&quot;&gt;https://doi.org/10.1109/icassp55912.2026.11463540&lt;/a&gt;</apa>
<short>A.T. Meise, T. Cord-Landwehr, C. Boeddeker, M. Delcroix, T. Nakatani, R. Haeb-Umbach, in: ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2026.</short>
<bibtex>@inproceedings{Meise_Cord-Landwehr_Boeddeker_Delcroix_Nakatani_Haeb-Umbach_2026, title={Loose Coupling of Spectral and Spatial Models for Multi-Channel Diarization and Enhancement of Meetings in Dynamic Environments}, DOI={&lt;a href=&quot;https://doi.org/10.1109/icassp55912.2026.11463540&quot;&gt;10.1109/icassp55912.2026.11463540&lt;/a&gt;}, booktitle={ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, publisher={IEEE}, author={Meise, Adrian Tobias and Cord-Landwehr, Tobias and Boeddeker, Christoph and Delcroix, Marc and Nakatani, Tomohiro and Haeb-Umbach, Reinhold}, year={2026} }</bibtex>
<mla>Meise, Adrian Tobias, et al. “Loose Coupling of Spectral and Spatial Models for Multi-Channel Diarization and Enhancement of Meetings in Dynamic Environments.” &lt;i&gt;ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)&lt;/i&gt;, IEEE, 2026, doi:&lt;a href=&quot;https://doi.org/10.1109/icassp55912.2026.11463540&quot;&gt;10.1109/icassp55912.2026.11463540&lt;/a&gt;.</mla>
</bibliographicCitation>
</extension>
<recordInfo><recordIdentifier>65606</recordIdentifier><recordCreationDate encoding="w3cdtf">2026-05-11T14:20:48Z</recordCreationDate><recordChangeDate encoding="w3cdtf">2026-05-11T15:37:36Z</recordChangeDate>
</recordInfo>
</mods>
</modsCollection>
