Method and system for fetching news comment page
暂无分享,去创建一个
The invention discloses a method and a system for fetching a news comment page, and belongs to the technical field of information retrieval and data integration. The method comprises the following steps of: performing breadth traversal on the pages from an initial page of a news website, and acquiring page information meeting depth limitation in the traversal process; then calculating the characteristic values of the pages, and identifying the news comment page from the pages according to the size relationship between the characteristic values and a preset threshold value; and finally, acquiring a page turning link of the news comment page, and acquiring other news comment pages according to the page turning link. The method and the system can automatically fetch the news comment page from the web pages of the news website, the fetching speed is high, and the fetched news comment page is comprehensive.