papers AI Learner
The Github is limit! Click to go to the new site.

UniSent: Universal Adaptable Sentiment Lexica for 1000+ Languages

2019-04-21
Ehsaneddin Asgari, Fabienne Braune, Christoph Ringlstetter, Mohammad R.K. Mofrad

Abstract

In this paper, we introduce UniSent a universal sentiment lexica for 1000 languages created using an English sentiment lexicon and a massively parallel corpus in the Bible domain. To the best of our knowledge, UniSent is the largest sentiment resource to date in terms of number of covered languages, including many low resource languages. To create UniSent, we propose Adapted Sentiment Pivot, a novel method that combines annotation projection, vocabulary expansion, and unsupervised domain adaptation. We evaluate the quality of UniSent for Macedonian, Czech, German, Spanish, and French and show that its quality is comparable to manually or semi-manually created sentiment resources. With the publication of this paper, we release UniSent lexica as well as Adapted Sentiment Pivot related codes. method.

Abstract (translated by Google)
URL

http://arxiv.org/abs/1904.09678

PDF

http://arxiv.org/pdf/1904.09678


Comments

Content