Improving CTC Using Stimulated Learning for Sequence Modeling
J. Heymann, B.L. Khe Chai Sim, in: ICASSP 2019, Brighton, UK, 2019.
Download
ICASSP_2019_Heymann_1_Paper.pdf
239.66 KB
Conference Paper
| English
Author
Heymann, Jahn;
Khe Chai Sim, Bo Li
Abstract
Connectionist temporal classification (CTC) is a sequence-level loss that has been successfully applied to train recurrent neural network (RNN) models for automatic speech recognition. However, one major weakness of CTC is the conditional independence assumption that makes it difficult for the model to learn label dependencies. In this paper, we propose stimulated CTC, which uses stimulated learning to help CTC models learn label dependencies implicitly by using an auxiliary RNN to generate the appropriate stimuli. This stimuli comes in the form of an additional stimulation loss term which encourages the model to learn said label dependencies. The auxiliary network is only used during training and the inference model has the same structure as a standard CTC model. The proposed stimulated CTC model achieves about 35% relative character error rate improvements on a synthetic gesture keyboard recognition task and over 30% relative word error rate improvements on the Librispeech automatic speech recognition tasks over a baseline model trained with CTC only.
Publishing Year
Proceedings Title
ICASSP 2019, Brighton, UK
LibreCat-ID
Cite this
Heymann J, Khe Chai Sim BL. Improving CTC Using Stimulated Learning for Sequence Modeling. In: ICASSP 2019, Brighton, UK. ; 2019.
Heymann, J., & Khe Chai Sim, B. L. (2019). Improving CTC Using Stimulated Learning for Sequence Modeling. In ICASSP 2019, Brighton, UK.
@inproceedings{Heymann_Khe Chai Sim_2019, title={Improving CTC Using Stimulated Learning for Sequence Modeling}, booktitle={ICASSP 2019, Brighton, UK}, author={Heymann, Jahn and Khe Chai Sim, Bo Li}, year={2019} }
Heymann, Jahn, and Bo Li Khe Chai Sim. “Improving CTC Using Stimulated Learning for Sequence Modeling.” In ICASSP 2019, Brighton, UK, 2019.
J. Heymann and B. L. Khe Chai Sim, “Improving CTC Using Stimulated Learning for Sequence Modeling,” in ICASSP 2019, Brighton, UK, 2019.
Heymann, Jahn, and Bo Li Khe Chai Sim. “Improving CTC Using Stimulated Learning for Sequence Modeling.” ICASSP 2019, Brighton, UK, 2019.
All files available under the following license(s):
Creative Commons Public Domain Dedication (CC0 1.0):
Main File(s)
File Name
ICASSP_2019_Heymann_1_Paper.pdf
239.66 KB
Access Level
Open Access
Last Uploaded
2020-02-06T07:24:26Z