papers AI Learner
The Github is limit! Click to go to the new site.

Using Artificial Tokens to Control Languages for Multilingual Image Caption Generation

2017-06-20
Satoshi Tsutsui, David Crandall

Abstract

Recent work in computer vision has yielded impressive results in automatically describing images with natural language. Most of these systems generate captions in a sin- gle language, requiring multiple language-specific models to build a multilingual captioning system. We propose a very simple technique to build a single unified model across languages, using artificial tokens to control the language, making the captioning system more compact. We evaluate our approach on generating English and Japanese captions, and show that a typical neural captioning architecture is capable of learning a single model that can switch between two different languages.

Abstract (translated by Google)
URL

https://arxiv.org/abs/1706.06275

PDF

https://arxiv.org/pdf/1706.06275


Similar Posts

上一篇 Dualing GANs

Comments