---
_id: '56273'
abstract:
- lang: eng
  text: "This paper presents the CHiME-8 DASR challenge which carries on from the\r\nprevious
    edition CHiME-7 DASR (C7DASR) and the past CHiME-6 challenge. It\r\nfocuses on
    joint multi-channel distant speech recognition (DASR) and\r\ndiarization with
    one or more, possibly heterogeneous, devices. The main goal is\r\nto spur research
    towards meeting transcription approaches that can generalize\r\nacross arbitrary
    number of speakers, diverse settings (formal vs. informal\r\nconversations), meeting
    duration, wide-variety of acoustic scenarios and\r\ndifferent recording configurations.
    Novelties with respect to C7DASR include:\r\ni) the addition of NOTSOFAR-1, an
    additional office/corporate meeting scenario,\r\nii) a manually corrected Mixer
    6 development set, iii) a new track in which we\r\nallow the use of large-language
    models (LLM) iv) a jury award mechanism to\r\nencourage participants to explore
    also more practical and innovative solutions.\r\nTo lower the entry barrier for
    participants, we provide a standalone toolkit\r\nfor downloading and preparing
    such datasets as well as performing text\r\nnormalization and scoring their submissions.
    Furthermore, this year we also\r\nprovide two baseline systems, one directly inherited
    from C7DASR and based on\r\nESPnet and another one developed on NeMo and based
    on NeMo team submission in\r\nlast year C7DASR. Baseline system results suggest
    that the addition of the\r\nNOTSOFAR-1 scenario significantly increases the task's
    difficulty due to its\r\nhigh number of speakers and very short duration."
author:
- first_name: Samuele
  full_name: Cornell, Samuele
  last_name: Cornell
- first_name: Taejin
  full_name: Park, Taejin
  last_name: Park
- first_name: Steve
  full_name: Huang, Steve
  last_name: Huang
- first_name: Christoph
  full_name: Boeddeker, Christoph
  id: '40767'
  last_name: Boeddeker
- first_name: Xuankai
  full_name: Chang, Xuankai
  last_name: Chang
- first_name: Matthew
  full_name: Maciejewski, Matthew
  last_name: Maciejewski
- first_name: Matthew
  full_name: Wiesner, Matthew
  last_name: Wiesner
- first_name: Paola
  full_name: Garcia, Paola
  last_name: Garcia
- first_name: Shinji
  full_name: Watanabe, Shinji
  last_name: Watanabe
citation:
  ama: Cornell S, Park T, Huang S, et al. The CHiME-8 DASR Challenge for Generalizable
    and Array Agnostic Distant  Automatic Speech Recognition and Diarization. <i>arXiv:240716447</i>.
    Published online 2024.
  apa: Cornell, S., Park, T., Huang, S., Boeddeker, C., Chang, X., Maciejewski, M.,
    Wiesner, M., Garcia, P., &#38; Watanabe, S. (2024). The CHiME-8 DASR Challenge
    for Generalizable and Array Agnostic Distant  Automatic Speech Recognition and
    Diarization. In <i>arXiv:2407.16447</i>.
  bibtex: '@article{Cornell_Park_Huang_Boeddeker_Chang_Maciejewski_Wiesner_Garcia_Watanabe_2024,
    title={The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant 
    Automatic Speech Recognition and Diarization}, journal={arXiv:2407.16447}, author={Cornell,
    Samuele and Park, Taejin and Huang, Steve and Boeddeker, Christoph and Chang,
    Xuankai and Maciejewski, Matthew and Wiesner, Matthew and Garcia, Paola and Watanabe,
    Shinji}, year={2024} }'
  chicago: Cornell, Samuele, Taejin Park, Steve Huang, Christoph Boeddeker, Xuankai
    Chang, Matthew Maciejewski, Matthew Wiesner, Paola Garcia, and Shinji Watanabe.
    “The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant  Automatic
    Speech Recognition and Diarization.” <i>ArXiv:2407.16447</i>, 2024.
  ieee: S. Cornell <i>et al.</i>, “The CHiME-8 DASR Challenge for Generalizable and
    Array Agnostic Distant  Automatic Speech Recognition and Diarization,” <i>arXiv:2407.16447</i>.
    2024.
  mla: Cornell, Samuele, et al. “The CHiME-8 DASR Challenge for Generalizable and
    Array Agnostic Distant  Automatic Speech Recognition and Diarization.” <i>ArXiv:2407.16447</i>,
    2024.
  short: S. Cornell, T. Park, S. Huang, C. Boeddeker, X. Chang, M. Maciejewski, M.
    Wiesner, P. Garcia, S. Watanabe, ArXiv:2407.16447 (2024).
date_created: 2024-09-30T08:08:46Z
date_updated: 2024-09-30T08:09:40Z
department:
- _id: '54'
external_id:
  arxiv:
  - '2407.16447'
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: https://arxiv.org/pdf/2407.16447
oa: '1'
publication: arXiv:2407.16447
status: public
title: The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant  Automatic
  Speech Recognition and Diarization
type: preprint
user_id: '40767'
year: '2024'
...
