Improving CTC Using Stimulated Learning for Sequence Modeling

J. Heymann, B.L. Khe Chai Sim, in: ICASSP 2019, Brighton, UK, 2019.

Download
OA 239.66 KB
Conference Paper | English
Author
;
Abstract
Connectionist temporal classification (CTC) is a sequence-level loss that has been successfully applied to train recurrent neural network (RNN) models for automatic speech recognition. However, one major weakness of CTC is the conditional independence assumption that makes it difficult for the model to learn label dependencies. In this paper, we propose stimulated CTC, which uses stimulated learning to help CTC models learn label dependencies implicitly by using an auxiliary RNN to generate the appropriate stimuli. This stimuli comes in the form of an additional stimulation loss term which encourages the model to learn said label dependencies. The auxiliary network is only used during training and the inference model has the same structure as a standard CTC model. The proposed stimulated CTC model achieves about 35% relative character error rate improvements on a synthetic gesture keyboard recognition task and over 30% relative word error rate improvements on the Librispeech automatic speech recognition tasks over a baseline model trained with CTC only.
Publishing Year
Proceedings Title
ICASSP 2019, Brighton, UK
LibreCat-ID

Cite this

Heymann J, Khe Chai Sim BL. Improving CTC Using Stimulated Learning for Sequence Modeling. In: ICASSP 2019, Brighton, UK. ; 2019.
Heymann, J., & Khe Chai Sim, B. L. (2019). Improving CTC Using Stimulated Learning for Sequence Modeling. In ICASSP 2019, Brighton, UK.
@inproceedings{Heymann_Khe Chai Sim_2019, title={Improving CTC Using Stimulated Learning for Sequence Modeling}, booktitle={ICASSP 2019, Brighton, UK}, author={Heymann, Jahn and Khe Chai Sim, Bo Li}, year={2019} }
Heymann, Jahn, and Bo Li Khe Chai Sim. “Improving CTC Using Stimulated Learning for Sequence Modeling.” In ICASSP 2019, Brighton, UK, 2019.
J. Heymann and B. L. Khe Chai Sim, “Improving CTC Using Stimulated Learning for Sequence Modeling,” in ICASSP 2019, Brighton, UK, 2019.
Heymann, Jahn, and Bo Li Khe Chai Sim. “Improving CTC Using Stimulated Learning for Sequence Modeling.” ICASSP 2019, Brighton, UK, 2019.
All files available under the following license(s):
Creative Commons License:
CC0Creative Commons Public Domain Dedication (CC0 1.0)
Main File(s)
Access Level
OA Open Access
Last Uploaded
2020-02-06T07:24:26Z


Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar