papers AI Learner
The Github is limit! Click to go to the new site.

Curriculum Learning and Minibatch Bucketing in Neural Machine Translation

2017-07-29
Tom Kocmi, Ondrej Bojar

Abstract

We examine the effects of particular orderings of sentence pairs on the on-line training of neural machine translation (NMT). We focus on two types of such orderings: (1) ensuring that each minibatch contains sentences similar in some aspect and (2) gradual inclusion of some sentence types as the training progresses (so called “curriculum learning”). In our English-to-Czech experiments, the internal homogeneity of minibatches has no effect on the training but some of our “curricula” achieve a small improvement over the baseline.

Abstract (translated by Google)
URL

https://arxiv.org/abs/1707.09533

PDF

https://arxiv.org/pdf/1707.09533


Comments

Content