CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings

S. Watanabe, M. Mandel, J. Barker, E. Vincent, A. Arora, X. Chang, S. Khudanpur, V. Manohar, D. Povey, D. Raj, D. Snyder, A.S. Subramanian, J. Trmal, B.B. Yair, C. Boeddeker, Z. Ni, Y. Fujita, S. Horiguchi, N. Kanda, T. Yoshioka, N. Ryant, ArXiv:2004.09249 (2020).

Download
No fulltext has been uploaded.
Preprint | English
Author
Watanabe, Shinji; Mandel, Michael; Barker, Jon; Vincent, Emmanuel; Arora, Ashish; Chang, Xuankai; Khudanpur, Sanjeev; Manohar, Vimal; Povey, Daniel; Raj, Desh; Snyder, David; Subramanian, Aswin Shanmugam
All
Abstract
Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the 6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge revisits the previous CHiME-5 challenge and further considers the problem of distant multi-microphone conversational speech diarization and recognition in everyday home environments. Speech material is the same as the previous CHiME-5 recordings except for accurate array synchronization. The material was elicited using a dinner party scenario with efforts taken to capture data that is representative of natural conversational speech. This paper provides a baseline description of the CHiME-6 challenge for both segmented multispeaker speech recognition (Track 1) and unsegmented multispeaker speech recognition (Track 2). Of note, Track 2 is the first challenge activity in the community to tackle an unsegmented multispeaker speech recognition scenario with a complete set of reproducible open source baselines providing speech enhancement, speaker diarization, and speech recognition modules.
Publishing Year
Journal Title
arXiv:2004.09249
LibreCat-ID

Cite this

Watanabe S, Mandel M, Barker J, et al. CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings. arXiv:200409249. Published online 2020.
Watanabe, S., Mandel, M., Barker, J., Vincent, E., Arora, A., Chang, X., Khudanpur, S., Manohar, V., Povey, D., Raj, D., Snyder, D., Subramanian, A. S., Trmal, J., Yair, B. B., Boeddeker, C., Ni, Z., Fujita, Y., Horiguchi, S., Kanda, N., … Ryant, N. (2020). CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings. In arXiv:2004.09249.
@article{Watanabe_Mandel_Barker_Vincent_Arora_Chang_Khudanpur_Manohar_Povey_Raj_et al._2020, title={CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings}, journal={arXiv:2004.09249}, author={Watanabe, Shinji and Mandel, Michael and Barker, Jon and Vincent, Emmanuel and Arora, Ashish and Chang, Xuankai and Khudanpur, Sanjeev and Manohar, Vimal and Povey, Daniel and Raj, Desh and et al.}, year={2020} }
Watanabe, Shinji, Michael Mandel, Jon Barker, Emmanuel Vincent, Ashish Arora, Xuankai Chang, Sanjeev Khudanpur, et al. “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings.” ArXiv:2004.09249, 2020.
S. Watanabe et al., “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings,” arXiv:2004.09249. 2020.
Watanabe, Shinji, et al. “CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for  Unsegmented Recordings.” ArXiv:2004.09249, 2020.

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar