{"abstract":[{"lang":"eng","text":"We present a multi-channel database of overlapping speech for training, evaluation, and detailed analysis of source separation and extraction algorithms: SMS-WSJ -- Spatialized Multi-Speaker Wall Street Journal. It consists of artificially mixed speech taken from the WSJ database, but unlike earlier databases we consider all WSJ0+1 utterances and take care of strictly separating the speaker sets present in the training, validation and test sets. When spatializing the data we ensure a high degree of randomness w.r.t. room size, array center and rotation, as well as speaker position. Furthermore, this paper offers a critical assessment of recently proposed measures of source separation performance. Alongside the code to generate the database we provide a source separation baseline and a Kaldi recipe with competitive word error rates to provide common ground for evaluation."}],"oa":"1","author":[{"first_name":"Lukas","full_name":"Drude, Lukas","last_name":"Drude"},{"last_name":"Heitkaemper","id":"27643","full_name":"Heitkaemper, Jens","first_name":"Jens"},{"last_name":"Boeddeker","id":"40767","full_name":"Boeddeker, Christoph","first_name":"Christoph"},{"first_name":"Reinhold","id":"242","last_name":"Haeb-Umbach","full_name":"Haeb-Umbach, Reinhold"}],"year":"2019","status":"public","has_accepted_license":"1","project":[{"_id":"52","name":"Computing Resources Provided by the Paderborn Center for Parallel Computing"}],"date_created":"2020-09-16T07:59:46Z","user_id":"40767","citation":{"chicago":"Drude, Lukas, Jens Heitkaemper, Christoph Boeddeker, and Reinhold Haeb-Umbach. “SMS-WSJ: Database, Performance Measures, and Baseline Recipe for Multi-Channel Source Separation and Recognition.” ArXiv E-Prints, 2019.","apa":"Drude, L., Heitkaemper, J., Boeddeker, C., & Haeb-Umbach, R. (2019). SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. ArXiv E-Prints.","short":"L. Drude, J. Heitkaemper, C. Boeddeker, R. Haeb-Umbach, ArXiv E-Prints (2019).","ieee":"L. Drude, J. Heitkaemper, C. Boeddeker, and R. Haeb-Umbach, “SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition,” ArXiv e-prints, 2019.","bibtex":"@article{Drude_Heitkaemper_Boeddeker_Haeb-Umbach_2019, title={SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition}, journal={ArXiv e-prints}, author={Drude, Lukas and Heitkaemper, Jens and Boeddeker, Christoph and Haeb-Umbach, Reinhold}, year={2019} }","mla":"Drude, Lukas, et al. “SMS-WSJ: Database, Performance Measures, and Baseline Recipe for Multi-Channel Source Separation and Recognition.” ArXiv E-Prints, 2019.","ama":"Drude L, Heitkaemper J, Boeddeker C, Haeb-Umbach R. SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. ArXiv e-prints. 2019."},"title":"SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition","_id":"19446","department":[{"_id":"54"}],"file_date_updated":"2020-12-11T12:22:31Z","ddc":["000"],"language":[{"iso":"eng"}],"type":"journal_article","date_updated":"2022-01-06T06:54:04Z","publication":"ArXiv e-prints","file":[{"file_size":288594,"creator":"huesera","date_updated":"2020-12-11T12:22:31Z","date_created":"2020-09-16T08:00:56Z","relation":"main_file","content_type":"application/pdf","access_level":"open_access","file_name":"ArXiv_2019_Drude.pdf","file_id":"19448"}]}