Topic Information Collection Based on the Hidden Markov Model

The topic information collection algorithm is widely used for its accuracy. The Hidden Markov Model (HMM) is used to learn and judge the relevance between the Uniform Resource Locator and the topic information. The Rocchio method is used to construct the prototype vectors relevant to the topic information, and the HMM is used to learn the preferred browsing paths. The concept maps including the semantics of the webpage are constructed and the web’s link structures can be decided. The validity of the algorithm is proved by the experiment at last. Comparing with the Best-First algorithm, this algorithm can get more information pages and has higher precision ratio.

[1]  Thomas L. Griffiths,et al.  Learning author-topic models from text corpora , 2010, TOIS.

[2]  Mansur R. Kabuka,et al.  Ontology matching with semantic verification , 2009, J. Web Semant..

[3]  Wang Ke Image Denoising Based on Wavelet Domain Markov Random Field Prior Model , 2006 .

[4]  Adel Torkaman Rahmani,et al.  Focused Crawling Using Vision-Based Page Segmentation , 2012, ICISTM.

[5]  Jiang Xiao-yi Research Advancement of Hidden Markov Model and Its Application in Management , 2012 .

[6]  Hou Yan-ping Approach of face recognition based on continuous hidden Markov model , 2008 .

[7]  Xin Chen,et al.  An Improved Hidden Markov Model for Literature Metadata Extraction , 2010, ICIC.

[8]  Javaan Singh Chahl,et al.  Innovations in intelligent machines , 2007 .

[9]  Yuan Li-chi A speech recognition method based on improved hidden Markov model , 2008 .

[10]  He Li Prediction Method of User Browsing Behaviors Based on Classified Markov Chain , 2008 .

[11]  Prasenjit Mitra,et al.  Clustering-based incremental web crawling , 2010, TOIS.

[12]  Tang Chaojing Technique of Protocol Identification Using Profile Hidden Markov Model , 2008 .

[13]  Yu Yi-biao Digital recognition based on acoustic universal structure , 2012 .

[14]  Guo Hong-zhi,et al.  Adaptive On-Line Web Topic Detection Method for Web News Recommendation System , 2010 .

[15]  Kweku-Muata Osei-Bryson,et al.  Using ontologies to facilitate post-processing of association rules by domain experts , 2011, Inf. Sci..

[16]  Teruaki Kitasuka,et al.  An Effectively Focused Crawling System , 2012 .

[17]  Zhao Jie,et al.  Facial Expression Recognition Using an Improved Embedded HMM , 2008 .