papers AI Learner
The Github is limit! Click to go to the new site.

Microblog Hashtag Generation via Encoding Conversation Contexts

2019-05-18
Yue Wang, Jing Li, Irwin King, Michael R. Lyu, Shuming Shi

Abstract

Automatic hashtag annotation plays an important role in content understanding for microblog posts. To date, progress made in this field has been restricted to phrase selection from limited candidates, or word-level hashtag discovery using topic models. Different from previous work considering hashtags to be inseparable, our work is the first effort to annotate hashtags with a novel sequence generation framework via viewing the hashtag as a short sequence of words. Moreover, to address the data sparsity issue in processing short microblog posts, we propose to jointly model the target posts and the conversation contexts initiated by them with bidirectional attention. Extensive experimental results on two large-scale datasets, newly collected from English Twitter and Chinese Weibo, show that our model significantly outperforms state-of-the-art models based on classification. Further studies demonstrate our ability to effectively generate rare and even unseen hashtags, which is however not possible for most existing methods.

Abstract (translated by Google)
URL

http://arxiv.org/abs/1905.07584

PDF

http://arxiv.org/pdf/1905.07584


Similar Posts

Comments