Architecture for a Parallel Focused Crawler for Clickstream Analysis
暂无分享,去创建一个
[1] Mehran Sahami,et al. Text Mining: Classification, Clustering, and Applications , 2009 .
[2] Jon Kleinberg,et al. Authoritative sources in a hyperlinked environment , 1999, SODA '98.
[3] Jon M. Kleinberg,et al. Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text , 1998, Comput. Networks.
[4] Marco Gori,et al. Focused Crawling Using Context Graphs , 2000, VLDB.
[5] Martin van den Berg,et al. Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery , 1999, Comput. Networks.
[6] Soumen Chakrabarti,et al. Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction , 2001, WWW '01.
[7] Jon M. Kleinberg,et al. Mining the Web's Link Structure , 1999, Computer.
[8] Fatemeh Ahmadi-Abkenari,et al. Application of clickstream analysis as Web page importance metric in parallel crawlers , 2010, 2010 International Symposium on Information Technology.
[9] Sergey Brin,et al. The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.
[10] Soumen Chakrabarti,et al. Mining the web - discovering knowledge from hypertext data , 2002 .
[11] Hector Garcia-Molina,et al. Parallel crawlers , 2002, WWW.
[12] Filippo Menczer,et al. Topical web crawlers: Evaluating adaptive algorithms , 2004, TOIT.
[13] Hector Garcia-Molina,et al. Efficient Crawling Through URL Ordering , 1998, Comput. Networks.