Abstract
In this paper we describe the Microsoft COCO Caption dataset and evaluation server. When completed, the dataset will contain over one and a half million captions describing over 330,000 images. For the training and validation images, five independent human generated captions will be provided. To ensure consistency in evaluation of automatic caption generation algorithms, an evaluation server is used. The evaluation server receives candidate captions and scores them using several popular metrics, including BLEU, METEOR, ROUGE and CIDEr. Instructions for using the evaluation server are provided.
Abstract (translated by Google)
在本文中,我们描述了Microsoft COCO Caption数据集和评估服务器。完成后,数据集将包含超过一百五十万个字幕,描述超过33万张图像。对于训练和验证图像,将提供五个独立的人造字幕。为了确保自动字幕生成算法的评估的一致性,使用评估服务器。评估服务器接收候选字幕并使用多种常用指标(包括BLEU,METEOR,ROUGE和CIDEr)对其进行评分。提供了使用评估服务器的说明。
URL
https://arxiv.org/abs/1504.00325