{"department":[{"_id":"54"}],"user_id":"59789","citation":{"short":"S. Watanabe, T. Hori, S. Karita, T. Hayashi, J. Nishitoba, Y. Unno, N. Enrique Yalta Soplin, J. Heymann, M. Wiesner, N. Chen, A. Renduchintala, T. Ochiai, in: INTERSPEECH 2018, Hyderabad, India, 2018, pp. 2207–2211.","bibtex":"@inproceedings{Watanabe_Hori_Karita_Hayashi_Nishitoba_Unno_Enrique Yalta Soplin_Heymann_Wiesner_Chen_et al._2018, title={ESPnet: End-to-End Speech Processing Toolkit}, DOI={10.21437/Interspeech.2018-1456}, booktitle={INTERSPEECH 2018, Hyderabad, India}, author={Watanabe, Shinji and Hori, Takaaki and Karita, Shigeki and Hayashi, Tomoki and Nishitoba, Jiro and Unno, Yuya and Enrique Yalta Soplin, Nelson and Heymann, Jahn and Wiesner, Matthew and Chen, Nanxin and et al.}, year={2018}, pages={2207–2211} }","chicago":"Watanabe, Shinji, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, et al. “ESPnet: End-to-End Speech Processing Toolkit.” In INTERSPEECH 2018, Hyderabad, India, 2207–2211, 2018. https://doi.org/10.21437/Interspeech.2018-1456.","ama":"Watanabe S, Hori T, Karita S, et al. ESPnet: End-to-End Speech Processing Toolkit. In: INTERSPEECH 2018, Hyderabad, India. ; 2018:2207–2211. doi:10.21437/Interspeech.2018-1456","mla":"Watanabe, Shinji, et al. “ESPnet: End-to-End Speech Processing Toolkit.” INTERSPEECH 2018, Hyderabad, India, 2018, pp. 2207–2211, doi:10.21437/Interspeech.2018-1456.","apa":"Watanabe, S., Hori, T., Karita, S., Hayashi, T., Nishitoba, J., Unno, Y., Enrique Yalta Soplin, N., Heymann, J., Wiesner, M., Chen, N., Renduchintala, A., & Ochiai, T. (2018). ESPnet: End-to-End Speech Processing Toolkit. INTERSPEECH 2018, Hyderabad, India, 2207–2211. https://doi.org/10.21437/Interspeech.2018-1456","ieee":"S. Watanabe et al., “ESPnet: End-to-End Speech Processing Toolkit,” in INTERSPEECH 2018, Hyderabad, India, 2018, pp. 2207–2211, doi: 10.21437/Interspeech.2018-1456."},"title":"ESPnet: End-to-End Speech Processing Toolkit","has_accepted_license":"1","date_updated":"2023-01-11T11:23:19Z","language":[{"iso":"eng"}],"ddc":["000"],"abstract":[{"lang":"eng","text":"This paper introduces a new open source platform for end-toend speech processing named ESPnet. ESPnet mainly focuses on end-to-end automatic speech recognition (ASR), and adopts widely-used dynamic neural network toolkits, Chainer and Py-Torch, as a main deep learning engine. ESPnet also follows the Kaldi ASR toolkit style for data processing, feature extraction/format, and recipes to provide a complete setup for speech recognition and other speech processing experiments. This paper explains a major architecture of this software platform, several important functionalities, which differentiate ESPnet from other open source ASR toolkits, and experimental results with\r\nmajor ASR benchmarks."}],"doi":"10.21437/Interspeech.2018-1456","file":[{"date_created":"2022-02-23T08:03:13Z","file_id":"29954","access_level":"open_access","relation":"main_file","content_type":"application/pdf","creator":"huesera","file_name":"INTERSPEECH_2018_Heymann_Paper.pdf","file_size":288907,"date_updated":"2022-02-23T08:03:13Z"}],"oa":"1","file_date_updated":"2022-02-23T08:03:13Z","type":"conference","page":"2207–2211","date_created":"2022-02-21T10:34:37Z","publication":"INTERSPEECH 2018, Hyderabad, India","author":[{"first_name":"Shinji","last_name":"Watanabe","full_name":"Watanabe, Shinji"},{"first_name":"Takaaki","last_name":"Hori","full_name":"Hori, Takaaki"},{"last_name":"Karita","full_name":"Karita, Shigeki","first_name":"Shigeki"},{"last_name":"Hayashi","full_name":"Hayashi, Tomoki","first_name":"Tomoki"},{"full_name":"Nishitoba, Jiro","last_name":"Nishitoba","first_name":"Jiro"},{"first_name":"Yuya","full_name":"Unno, Yuya","last_name":"Unno"},{"full_name":"Enrique Yalta Soplin, Nelson","last_name":"Enrique Yalta Soplin","first_name":"Nelson"},{"id":"9168","first_name":"Jahn","last_name":"Heymann","full_name":"Heymann, Jahn"},{"first_name":"Matthew","full_name":"Wiesner, Matthew","last_name":"Wiesner"},{"full_name":"Chen, Nanxin","last_name":"Chen","first_name":"Nanxin"},{"last_name":"Renduchintala","full_name":"Renduchintala, Adithya","first_name":"Adithya"},{"full_name":"Ochiai, Tsubasa","last_name":"Ochiai","first_name":"Tsubasa"}],"status":"public","year":"2018","_id":"29923"}