Misleading Failures of Partial-input Baselines

2019-05-14

Shi Feng, Eric Wallace, Jordan Boyd-Graber

arXiv_AI

arXiv_AI Inference

Abstract
Abstract (translated by Google)
URL
PDF

Abstract

Recent work establishes dataset difficulty and removes annotation artifacts via partial-input baselines (e.g., hypothesis-only or image-only models). While the success of a partial-input baseline indicates a dataset is cheatable, our work cautions the converse is not necessarily true. Using artificial datasets, we illustrate how the failure of a partial-input baseline might shadow more trivial patterns that are only visible in the full input. We also identify such artifacts in real natural language inference datasets. Our work provides an alternative view on the use of partial-input baselines in future dataset creation.

Abstract (translated by Google)

URL

http://arxiv.org/abs/1905.05778

PDF

http://arxiv.org/pdf/1905.05778

Misleading Failures of Partial-input Baselines

Abstract

Abstract (translated by Google)

URL

PDF

Similar Posts

Comments