论文信息 - A Novel Method for Crawl Forum Threads

A Novel Method for Crawl Forum Threads

In internet crawler it is program that visits websites and reads their pages and other information in order to create entries for a search engine index. Now we introducing forum crawler which traverses the web forums. But present crawlers are not providing relevant content as well as URL type reorganization problem to forum users. In order to overcome this problem we present Forum Crawler under Supervision which crawl relevant forum content from the web with less overhead

Chandra Sekhara Rao | Ch V N Krishna Murthy

[1] Gerard Salton,et al. Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.

[2] Bing Liu,et al. Structured Data Extraction from the Web Based on Partial Tree Alignment , 2006, IEEE Transactions on Knowledge and Data Engineering.

[3] Matthew Hurst,et al. Deriving marketing intelligence from online discussion , 2005, KDD '05.

[4] Suk Hwan Lim,et al. Extracting and Ranking Product Features in Opinion Documents , 2010, COLING.

[5] Jing Liu,et al. Automatic extraction of web data records containing user-generated content , 2010, CIKM.

[6] Yida Wang,et al. Incorporating site-level knowledge to extract structured data from web forums , 2009, WWW '09.

[7] Gurmeet Singh Manku,et al. Detecting near-duplicates for web crawling , 2007, WWW '07.

[8] Mark S. Ackerman,et al. Expertise networks in online communities: structure and algorithms , 2007, WWW '07.

[9] Hema Swetha Koppula,et al. Learning URL patterns for webpage de-duplication , 2010, WSDM '10.

[10] Yida Wang,et al. iRobot: an intelligent crawler for web forums , 2008, WWW.

[11] Anirban Dasgupta,et al. De-duping URLs via rewrite rules , 2008, KDD.

[12] Young-In Song,et al. Finding question-answer pairs from online forums , 2008, SIGIR '08.

[13] Monika Henzinger,et al. Finding near-duplicate web pages: a large-scale evaluation of algorithms , 2006, SIGIR.