LS-Tree: Model Interpretation When the Data Are Linguistic

Abstract
Abstract (translated by Google)
URL
PDF

Abstract

We study the problem of interpreting trained classification models in the setting of linguistic data sets. Leveraging a parse tree, we propose to assign least-squares based importance scores to each word of an instance by exploiting syntactic constituency structure. We establish an axiomatic characterization of these importance scores by relating them to the Banzhaf value in coalitional game theory. Based on these importance scores, we develop a principled method for detecting and quantifying interactions between words in a sentence. We demonstrate that the proposed method can aid in interpretability and diagnostics for several widely-used language models.

Abstract (translated by Google)

URL

http://arxiv.org/abs/1902.04187

PDF

http://arxiv.org/pdf/1902.04187

LS-Tree: Model Interpretation When the Data Are Linguistic

Abstract

Abstract (translated by Google)

URL

PDF

Similar Posts

Comments