papers AI Learner
The Github is limit! Click to go to the new site.

VQD: Visual Query Detection in Natural Scenes

2019-04-04
Manoj Acharya, Karan Jariwala, Christopher Kanan

Abstract

We propose Visual Query Detection (VQD), a new visual grounding task. In VQD, a system is guided by natural language to localize a \emph{variable} number of objects in an image. VQD is related to visual referring expression recognition, where the task is to localize only \emph{one} object. We describe the first dataset for VQD and we propose baseline algorithms that demonstrate the difficulty of the task compared to referring expression recognition.

Abstract (translated by Google)
URL

https://arxiv.org/abs/1904.02794

PDF

https://arxiv.org/pdf/1904.02794


Similar Posts

Comments