WSO-LINK: Algorithm to Eliminate Web Structure Outliers in Web Pages
暂无分享,去创建一个
Web Mining is specialized field of Data Mining which deals with the methods and techniques of data mining to extract useful patterns from the web data that is available in web server logs/databases. Web content mining is one of the classifications of web mining which extracts information from the web documents containing texts, links, videos and multimedia data available in World Wide Web databases. Further, web structure mining is a kind of web content mining which extracts patterns and meaningful information from the structure of hyperlinks contained in web documents having the same domain. The hyperlinks which are not related to content or the invalid ones are called web structure outliers. In this paper the basic aim is to find out these web structure outliers. Keywords- Outliers, web outlier mining, web structure mining, Web mining, web structure documents.
[1] Theodore Johnson,et al. Fast Computation of 2-Dimensional Depth Contours , 1998, KDD.
[2] Kevin Chen-Chuan Chang,et al. Editorial: special issue on web content mining , 2004, SKDD.
[3] Xia Huosong,et al. Chinese Web Text Outlier Mining Based on Domain Knowledge , 2010, 2010 Second WRI Global Congress on Intelligent Systems.
[4] Hongqi Li,et al. Research on the Techniques for Effectively Searching and Retrieving Information from Internet , 2008, 2008 International Symposium on Electronic Commerce and Security.