A framework for semantic connection based topic evolution with DeepWalk

One of the prevalent studies on Topic Detection and Tracking (TDT) is topic evolution. With the emergence of Internet data, there is a clear need for intuitive and adaptive methods to analyze a series of evolutions. Most approaches completely depend on topic models and only focus on whether topics are changed while ignoring the degree of changes, resulting in poor quality topics and insensitivity of changes. In this paper, we propose a framework of topic evolution based on semantic connections which not only indicates the content similarity between documents but also shows the time decay for an adaptive number of topics and rapid responses to the changes of contents. Additionally, semantic connection features can be used to visualize topic evolution, which makes the analyses much easier. For empirical studies, three data sets in real applications are chosen to prove the effectiveness of our method, and the results show that our method has a better performance in reducing redundant topics, avoiding topic suppression, and discerning the vanishment of old topics and the appearance of new topics.

[1]  Yuan Zuo,et al.  Word network topic model: a simple but general solution for short and imbalanced texts , 2014, Knowledge and Information Systems.

[2]  Qingming Huang,et al.  Cross-media topic detection: A multi-modality fusion framework , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[3]  Bin Li,et al.  Modeling the evolution of development topics using Dynamic Topic Models , 2015, 2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER).

[4]  Wei Xu,et al.  Discovering Topic Evolution Topology in a Microblog Corpus , 2015, 2015 Third International Conference on Advanced Cloud and Big Data.

[5]  Daniel Barbará,et al.  On-line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[6]  Bingbing Ni,et al.  Facilitating Image Search With a Scalable and Compact Semantic Mapping , 2015, IEEE Transactions on Cybernetics.

[7]  Meng Wang,et al.  Beyond Object Proposals: Random Crop Pooling for Multi-Label Image Recognition , 2016, IEEE Transactions on Image Processing.

[8]  Jian Wang,et al.  News Topic Evolution Tracking by Incorporating Temporal Information , 2014, NLPCC.

[9]  Bin Li,et al.  Explore the evolution of development topics via on-line LDA , 2015, 2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER).

[10]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[11]  Henry Lieberman,et al.  You Too?! Mixed-Initiative LDA Story Matching to Help Teens in Distress , 2012, ICWSM.

[12]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[13]  Qingming Huang,et al.  Cross media topic analytics based on synergetic content and user behavior modeling , 2014, 2014 IEEE International Conference on Multimedia and Expo (ICME).

[14]  Alessandro Laio,et al.  Clustering by fast search and find of density peaks , 2014, Science.

[15]  Naonori Ueda,et al.  Sequential Modeling of Topic Dynamics with Multiple Timescales , 2012, TKDD.

[16]  Li Lin,et al.  Mapping the topic evolution using citation-topic model and social network analysis , 2015, 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[17]  Yong Wang,et al.  Topic Detection and Evolution Analysis on Microblog , 2014, Intelligent Information Processing.

[18]  Jinyun Yan,et al.  LASTA: large scale topic assignment on multiple social networks , 2014, KDD.

[19]  Chang-Tien Lu,et al.  Topical Analysis of Interactions Between News and Social Media , 2016, AAAI.

[20]  Dinh Q. Phung,et al.  Hierarchical Dirichlet Process for Tracking Complex Topical Structure Evolution and Its Application to Autism Research Literature , 2015, PAKDD.

[21]  Lei Li,et al.  Two Phase Based Spammer Detection in Weibo , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[22]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[23]  Liang Zheng LDA-based Model for Online Topic Evolution Mining , 2010 .

[24]  Qingqiang Wu,et al.  LDA-based model for topic evolution mining on text , 2011, 2011 6th International Conference on Computer Science & Education (ICCSE).

[25]  Lenhart K. Schubert Semantic Representation , 2015, AAAI.

[26]  Mitsunori Ogihara,et al.  Analysis of the Evolution of Research Groups and Topics in the ISMIR Conference , 2015, ISMIR.

[27]  Cao Jian Bi-path Evolution Model for Online Topic Model Based on LDA , 2014 .

[28]  Gert R. G. Lanckriet,et al.  Leveraging Social Context for Modeling Topic Evolution , 2015, KDD.

[29]  Shuai Wang,et al.  Targeted Topic Modeling for Focused Analysis , 2016, KDD.

[30]  Hui Xiong,et al.  Tracking the evolution of social emotions with topic models , 2016, Knowledge and Information Systems.

[31]  Ashish Sureka,et al.  Mining issue tracking systems using topic models for trend analysis, corpus exploration, and understanding evolution , 2014, RAISE 2014.

[32]  Huaqing Min,et al.  Discovering Event Evolution Graphs Based on News Articles Relationships , 2014, 2014 IEEE 11th International Conference on e-Business Engineering.

[33]  Tianqing Zhu,et al.  Privacy-preserving topic model for tagging recommender systems , 2015, Knowledge and Information Systems.

[34]  Michael W. Godfrey,et al.  What's hot and what's not: Windowed developer topic analysis , 2009, 2009 IEEE International Conference on Software Maintenance.

[35]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[36]  Eric P. Xing,et al.  Dynamic Non-Parametric Mixture Models and the Recurrent Chinese Restaurant Process: with Applications to Evolutionary Clustering , 2008, SDM.

[37]  Sara Elena Garza Villarreal,et al.  Local bilateral clustering for identifying research topics and groups from bibliographical data , 2015, Knowledge and Information Systems.

[38]  Stefan Feuerriegel,et al.  Analysis of How Underlying Topics in Financial News Affect Stock Prices Using Latent Dirichlet Allocation , 2016, 2016 49th Hawaii International Conference on System Sciences (HICSS).

[39]  Cai-Nicolas Ziegler,et al.  Tracking Topic Evolution in News Environments , 2008, 2008 10th IEEE Conference on E-Commerce Technology and the Fifth IEEE Conference on Enterprise Computing, E-Commerce and E-Services.

[40]  Sabine Loudcher,et al.  A Joint Model for Topic-Sentiment Evolution over Time , 2014, 2014 IEEE International Conference on Data Mining.

[41]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[42]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[43]  Meng Wang,et al.  Trust Agent-Based Behavior Induction in Social Networks , 2016, IEEE Intelligent Systems.