A Topic Detection Method Based on KeyGraph and Community Partition

More and more media stream data is created on the Internet every day. It's more difficult for persons to obtain valuable information due to information overload. Topic detection is the method that extracts valuable hot topics from media stream data. It is the tool to help to solve the problem of overload information. The topic positive accuracy of cluster method is very low. In this paper, we proposed one topic detection method based on KeyGraph to improve the positive accuracy, and took experiments compared with baseline method on corpus marked by graduate students. In the result, the positive accuracy of KeyGraph method reaches 88.48% with great improvement. The result verified the effectiveness of our proposed method.

[1]  Bo Hu,et al.  An Improved Single-Pass Algorithm for Chinese Microblog Topic Detection and Tracking , 2016, 2016 IEEE International Congress on Big Data (BigData Congress).

[2]  Hendri Murfi,et al.  The K-means with mini batch algorithm for topics detection on online news , 2016, 2016 4th International Conference on Information and Communication Technology (ICoICT).

[3]  Chen Zhang,et al.  A hybrid term-term relations analysis approach for topic detection , 2016, Knowl. Based Syst..

[4]  Xiaohui Hu,et al.  IdeaGraph: A Graph-Based Algorithm of Mining Latent Information for Human Cognition , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[5]  Yiannis Kompatsiaris,et al.  Sensing Trending Topics in Twitter , 2013, IEEE Transactions on Multimedia.

[6]  Louiqa Raschid,et al.  A Graph Analytical Approach for Topic Detection , 2013, TOIT.

[7]  Kasun Perera,et al.  KeyGraph and WordNet hypernyms for topic detection , 2015, 2015 12th International Joint Conference on Computer Science and Software Engineering (JCSSE).

[8]  Shie-Jue Lee,et al.  A Similarity Measure for Text Classification and Clustering , 2014, IEEE Transactions on Knowledge and Data Engineering.

[9]  Yukio Ohsawa,et al.  KeyGraph: automatic indexing by co-occurrence graph based on building construction metaphor , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[10]  Jie Cao,et al.  A pattern-based topic detection and analysis system on Chinese tweets , 2017, J. Comput. Sci..

[11]  Desheng Dash Wu,et al.  Using text mining and sentiment analysis for online forums hotspot detection and forecast , 2010, Decis. Support Syst..

[12]  Sarunas Girdzijauskas,et al.  DeGPar: Large Scale Topic Detection Using Node-Cut Partitioning on Dense Weighted Graphs , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[13]  Fanjiang Xu,et al.  An improved ideagraph algorithm for discovering important rare events , 2014, 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[14]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[15]  Xin Guo,et al.  Semantic-based topic detection using Markov decision processes , 2017, Neurocomputing.

[16]  Xiao Hu News hotspots detection and tracking based on LDA topic model , 2016, 2016 International Conference on Progress in Informatics and Computing (PIC).