Interactive Area Topics Extraction with Policy Gradient

Extracting representative topics and improving the extraction performance is rather challenging. In this work, we formulate a novel problem, called Interactive Area Topics Extraction, and propose a learning interactive topics extraction (LITE) model to regard this problem as a sequential decision making process and construct an end-to-end framework to use interaction with users. In particular, we use recurrent neural network (RNN) decoder to address the problem and policy gradient method to tune the model parameters considering user feedback. Experimental result has shown the effectiveness of the proposed framework.

[1]  Timothy Baldwin,et al.  Automatic Labelling of Topic Models , 2011, ACL.

[2]  C. Lee Giles,et al.  Extracting Semantic Relations for Scholarly Knowledge Base Construction , 2018, 2018 IEEE 12th International Conference on Semantic Computing (ICSC).

[3]  Estevam R. Hruschka,et al.  Toward an Architecture for Never-Ending Language Learning , 2010, AAAI.

[4]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[5]  Paolo Cremonesi,et al.  Letting Users Assist What to Watch: An Interactive Query-by-Example Movie Recommendation System , 2017, IIR.

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[8]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[9]  Vincent Ng,et al.  Automatic Keyphrase Extraction: A Survey of the State of the Art , 2014, ACL.

[10]  Fang Zhang,et al.  Fast Top-k Area Topics Extraction with Knowledge Base , 2018, 2018 IEEE Third International Conference on Data Science in Cyberspace (DSC).

[11]  Sebastian Ruder,et al.  An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[12]  Ron Kohavi,et al.  Controlled experiments on the web: survey and practical guide , 2009, Data Mining and Knowledge Discovery.

[13]  ChengXiang Zhai,et al.  Automatic labeling of multinomial topic models , 2007, KDD '07.

[14]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[15]  Jie Tang,et al.  Beyond Query: Interactive User Intention Understanding , 2015, 2015 IEEE International Conference on Data Mining.

[16]  Yi Zhang,et al.  On the Transitivity of Hypernym-Hyponym Relations in Data-Driven Lexical Taxonomies , 2017, AAAI.

[17]  Xin Jiang,et al.  A ranking approach to keyphrase extraction , 2009, SIGIR.

[18]  Hanan Samet,et al.  The Design and Analysis of Spatial Data Structures , 1989 .

[19]  Aoying Zhou,et al.  Predicting hypernym–hyponym relations for Chinese taxonomy learning , 2018, Knowledge and Information Systems.

[20]  Simone Paolo Ponzetto,et al.  WikiTaxonomy: A Large Scale Knowledge Resource , 2008, ECAI.