VQD: Visual Query Detection in Natural Scenes

2019-04-04

Manoj Acharya, Karan Jariwala, Christopher Kanan

arXiv_CV

arXiv_CV Detection Recognition

Abstract
Abstract (translated by Google)
URL
PDF

Abstract

We propose Visual Query Detection (VQD), a new visual grounding task. In VQD, a system is guided by natural language to localize a \emph{variable} number of objects in an image. VQD is related to visual referring expression recognition, where the task is to localize only \emph{one} object. We describe the first dataset for VQD and we propose baseline algorithms that demonstrate the difficulty of the task compared to referring expression recognition.

Abstract (translated by Google)

URL

https://arxiv.org/abs/1904.02794

PDF

https://arxiv.org/pdf/1904.02794

VQD: Visual Query Detection in Natural Scenes

Abstract

Abstract (translated by Google)

URL

PDF

Similar Posts

Comments