Challenges and Prospects in Vision and Language Research

2019-04-19

Kushal Kafle, Robik Shrestha, Christopher Kanan

arXiv_CV

Abstract
Abstract (translated by Google)
URL
PDF

Abstract

Language grounded image understanding tasks have often been proposed as a method for evaluating progress in artificial intelligence. Ideally, these tasks should test a plethora of capabilities that integrate computer vision, reasoning, and natural language understanding. However, rather than behaving as visual Turing tests, recent studies have demonstrated state-of-the-art systems are achieving good performance through flaws in datasets and evaluation procedures. We review the current state of affairs and outline a path forward.

Abstract (translated by Google)

URL

http://arxiv.org/abs/1904.09317

PDF

http://arxiv.org/pdf/1904.09317

Challenges and Prospects in Vision and Language Research

Abstract

Abstract (translated by Google)

URL

PDF

Similar Posts

Comments