Research on Micro-Blog Information Perception and Mining Platform

To predict the tendency of Micro-blog information dissemination, provide the early warning of the Internet emergencies, and contribute to the content security of micro-blog, the paper offers a platform for Micro-blog information perceiving and mining. This platform is an integration of Micro-blog data collection and processing module, topic detection and tracking module, user behavior analysis module, trend prediction module, etc. It could access and analyze micro-blog information automatically, leading a positive significance to grasp the emergencies on micro-blog. This paper puts forward methods based on the Latent Dirichlet Allocation (LDA) document clustering and hot topics prediction, which could analysis and predict the micro-blog data effectively, avoiding the problems in the traditional algorithm. Also, these methods have a higher accuracy for clustering and prediction.

[1]  Rui Wang,et al.  An Empirical Study on the Relationship between the Followers' Number and Influence of Microblogging , 2010, 2010 International Conference on E-Business and E-Government.

[2]  Yuefeng Li,et al.  Hot Topic Detection in Professional Blogs , 2011, AMT.

[3]  Yun Liu,et al.  Hot Post Prediction in BBS Forums Based on Multifactor Fusion , 2012 .

[4]  Qiudan Li,et al.  QuestionHolic: Hot topic discovery and trend analysis in community question answering systems , 2011, Expert Syst. Appl..

[5]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[6]  Ying Zhang,et al.  Statistically Modeling the Effectiveness of Disaster Information in Social Media , 2011, 2011 IEEE Global Humanitarian Technology Conference.

[7]  Bruno Pouliquen,et al.  Multilingual and cross-lingual news topic tracking , 2004, COLING.

[8]  Richard M. Schwartz,et al.  Topic tracking for radio, TV broadcast, and newswire , 1999, EUROSPEECH.

[9]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[10]  Ruixia-Han The influence of microblogging on personal public participation , 2010 .

[11]  Hector Garcia-Molina,et al.  Overview of multidatabase transaction management , 2005, The VLDB Journal.

[12]  Yiming Yang,et al.  A study of retrospective and on-line event detection , 1998, SIGIR '98.

[13]  Daniel Gatica-Perez,et al.  Discovering routines from large-scale human locations using probabilistic topic models , 2011, TIST.

[14]  Fazli Can,et al.  Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases , 1990, TODS.