{"department":[{"_id":"54"}],"date_created":"2024-09-30T08:08:46Z","citation":{"chicago":"Cornell, Samuele, Taejin Park, Steve Huang, Christoph Boeddeker, Xuankai Chang, Matthew Maciejewski, Matthew Wiesner, Paola Garcia, and Shinji Watanabe. “The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant  Automatic Speech Recognition and Diarization.” ArXiv:2407.16447, 2024.","apa":"Cornell, S., Park, T., Huang, S., Boeddeker, C., Chang, X., Maciejewski, M., Wiesner, M., Garcia, P., & Watanabe, S. (2024). The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant  Automatic Speech Recognition and Diarization. In arXiv:2407.16447.","short":"S. Cornell, T. Park, S. Huang, C. Boeddeker, X. Chang, M. Maciejewski, M. Wiesner, P. Garcia, S. Watanabe, ArXiv:2407.16447 (2024).","bibtex":"@article{Cornell_Park_Huang_Boeddeker_Chang_Maciejewski_Wiesner_Garcia_Watanabe_2024, title={The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant  Automatic Speech Recognition and Diarization}, journal={arXiv:2407.16447}, author={Cornell, Samuele and Park, Taejin and Huang, Steve and Boeddeker, Christoph and Chang, Xuankai and Maciejewski, Matthew and Wiesner, Matthew and Garcia, Paola and Watanabe, Shinji}, year={2024} }","mla":"Cornell, Samuele, et al. “The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant  Automatic Speech Recognition and Diarization.” ArXiv:2407.16447, 2024.","ama":"Cornell S, Park T, Huang S, et al. The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant  Automatic Speech Recognition and Diarization. arXiv:240716447. Published online 2024.","ieee":"S. Cornell et al., “The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant  Automatic Speech Recognition and Diarization,” arXiv:2407.16447. 2024."},"status":"public","author":[{"first_name":"Samuele","last_name":"Cornell","full_name":"Cornell, Samuele"},{"first_name":"Taejin","last_name":"Park","full_name":"Park, Taejin"},{"last_name":"Huang","first_name":"Steve","full_name":"Huang, Steve"},{"first_name":"Christoph","last_name":"Boeddeker","full_name":"Boeddeker, Christoph","id":"40767"},{"full_name":"Chang, Xuankai","first_name":"Xuankai","last_name":"Chang"},{"first_name":"Matthew","last_name":"Maciejewski","full_name":"Maciejewski, Matthew"},{"full_name":"Wiesner, Matthew","last_name":"Wiesner","first_name":"Matthew"},{"last_name":"Garcia","first_name":"Paola","full_name":"Garcia, Paola"},{"full_name":"Watanabe, Shinji","last_name":"Watanabe","first_name":"Shinji"}],"user_id":"40767","language":[{"iso":"eng"}],"main_file_link":[{"url":"https://arxiv.org/pdf/2407.16447","open_access":"1"}],"abstract":[{"lang":"eng","text":"This paper presents the CHiME-8 DASR challenge which carries on from the\r\nprevious edition CHiME-7 DASR (C7DASR) and the past CHiME-6 challenge. It\r\nfocuses on joint multi-channel distant speech recognition (DASR) and\r\ndiarization with one or more, possibly heterogeneous, devices. The main goal is\r\nto spur research towards meeting transcription approaches that can generalize\r\nacross arbitrary number of speakers, diverse settings (formal vs. informal\r\nconversations), meeting duration, wide-variety of acoustic scenarios and\r\ndifferent recording configurations. Novelties with respect to C7DASR include:\r\ni) the addition of NOTSOFAR-1, an additional office/corporate meeting scenario,\r\nii) a manually corrected Mixer 6 development set, iii) a new track in which we\r\nallow the use of large-language models (LLM) iv) a jury award mechanism to\r\nencourage participants to explore also more practical and innovative solutions.\r\nTo lower the entry barrier for participants, we provide a standalone toolkit\r\nfor downloading and preparing such datasets as well as performing text\r\nnormalization and scoring their submissions. Furthermore, this year we also\r\nprovide two baseline systems, one directly inherited from C7DASR and based on\r\nESPnet and another one developed on NeMo and based on NeMo team submission in\r\nlast year C7DASR. Baseline system results suggest that the addition of the\r\nNOTSOFAR-1 scenario significantly increases the task's difficulty due to its\r\nhigh number of speakers and very short duration."}],"oa":"1","_id":"56273","publication":"arXiv:2407.16447","external_id":{"arxiv":["2407.16447"]},"date_updated":"2024-09-30T08:09:40Z","year":"2024","title":"The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization","type":"preprint"}