Probing What Different NLP Tasks Teach Machines about Function Word Comprehension

Abstract
Abstract (translated by Google)
URL
PDF

Abstract

We introduce a set of nine challenge tasks that test for the understanding of function words. These tasks are created by structurally mutating sentences from existing datasets to target the comprehension of specific types of function words (e.g., prepositions, wh-words). Using these probing tasks, we explore the effects of various pretraining objectives for sentence encoders (e.g., language modeling, CCG supertagging and natural language inference (NLI)) on the learned representations. Our results show that pretraining on CCG—our most syntactic objective—performs the best on average across our probing tasks, suggesting that syntactic knowledge helps function word comprehension. Language modeling also shows strong performance, supporting its widespread use for pretraining state-of-the-art NLP models. Overall, no pretraining objective dominates across the board, and our function word probing tasks highlight several intuitive differences between pretraining objectives, e.g., that NLI helps the comprehension of negation.

Abstract (translated by Google)

URL

http://arxiv.org/abs/1904.11544

PDF

http://arxiv.org/pdf/1904.11544

Probing What Different NLP Tasks Teach Machines about Function Word Comprehension

Abstract

Abstract (translated by Google)

URL

PDF

Comments