A Fast Hierarchical Topic Detection Method

Facing to huge network text data, how to analyze Internet news reports timely and effectively is more and more concerned. A fast hierarchical topic detection method is proposed with following two improvements. On the one hand, based on the traditional topic detection method of K-means, the new method improves the detection process by using a new parameter of news’s contribution for topics to have better adaptability of hierarchical topics. The experimental results also present that this new method has better detection performance, especially for those hierarchical topics. On the other hand, based on the above-mentioned method, an improved hierarchical clustering algorithm is further explored. The result demonstrates that different aspects of hierarchical topics could be fully described with low time processing complexity.

[1]  Qiudan Li,et al.  QuestionHolic: Hot topic discovery and trend analysis in community question answering systems , 2011, Expert Syst. Appl..

[2]  Bing He,et al.  Modeling topic and community structure in social tagging: The TTR-LDA-Community model , 2011, J. Assoc. Inf. Sci. Technol..

[3]  Fabio Crestani,et al.  Construction of topics and clusters in Topic Detection and Tracking tasks , 2011, 2011 International Conference on Semantic Technology and Information Retrieval.

[4]  Hao Sun,et al.  Unsupervised video-based lane detection using location-enhanced topic models , 2010 .

[5]  A. Aydin Alatan,et al.  Dominant sets based movie scene detection , 2012, Signal Process..

[6]  Holger Hintelmann,et al.  A novel approach for determining total titanium from titanium dioxide nanoparticles suspended in water and biosolids by digestion with ammonium persulfate. , 2012, Analytica chimica acta.

[7]  Long Zhi-yi Online Topic Detection Algorithm for Internet News , 2009 .

[8]  Shigeo Kawata,et al.  PSE Park: Framework for Problem Solving Environments , 2010, J. Convergence Inf. Technol..

[9]  Jianping Zeng,et al.  Variable space hidden Markov model for topic detection and analysis , 2007, Knowl. Based Syst..

[10]  Martin Oberhofer,et al.  Knowledge Discovery in the Blogosphere: Approaches and Challenges , 2010, IEEE Internet Computing.

[11]  Igor Santos,et al.  Enhanced Topic-based Vector Space Model for semantics-aware spam filtering , 2012, Expert Syst. Appl..

[12]  Leila Parsa,et al.  Recent Advances in Modeling and Online Detection of Stator Interturn Faults in Electrical Motors , 2011, IEEE Transactions on Industrial Electronics.

[13]  Frank Elberzhager,et al.  A systematic mapping study on the combination of static and dynamic quality assurance techniques , 2012, Inf. Softw. Technol..

[14]  Azzam Sleit,et al.  A Cluster-Based Approach for Supporting QoS in Mobile Ad Hoc Networks , 2011 .

[15]  Özcan Özyurt,et al.  Chat mining: Automatically determination of chat conversations' topic in Turkish text based chat mediums , 2010, Expert Syst. Appl..

[16]  Dan Zhang,et al.  Topic detection based on K-means , 2011, 2011 International Conference on Electronics, Communications and Control (ICECC).

[17]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[18]  Yong Hu,et al.  The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature , 2011, Decis. Support Syst..

[19]  Qi He,et al.  Keep It Simple with Time: A Reexamination of Probabilistic Topic Detection Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Armando J. Pinho,et al.  An efficient omnidirectional vision system for soccer robots: From calibration to object detection , 2011 .