Evaluating KGR10 Polish word embeddings in the recognition of temporal expressions using BiLSTM-CRF

Abstract
Abstract (translated by Google)
URL
PDF

Abstract

The article introduces a new set of Polish word embeddings, built using KGR10 corpus, which contains more than 4 billion words. These embeddings are evaluated in the problem of recognition of temporal expressions (timexes) for the Polish language. We described the process of KGR10 corpus creation and a new approach to the recognition problem using Bidirectional Long-Short Term Memory (BiLSTM) network with additional CRF layer, where specific embeddings are essential. We presented experiments and conclusions drawn from them.

Abstract (translated by Google)

URL

http://arxiv.org/abs/1904.04055

PDF

http://arxiv.org/pdf/1904.04055

Evaluating KGR10 Polish word embeddings in the recognition of temporal expressions using BiLSTM-CRF

Abstract

Abstract (translated by Google)

URL

PDF

Similar Posts

Comments