News Page Discovery Policy for Instant Crawlers
暂无分享,去创建一个
[1] Marc Najork,et al. A large‐scale study of the evolution of Web pages , 2003, WWW '03.
[2] Filippo Menczer,et al. Topical web crawlers: Evaluating adaptive algorithms , 2004, TOIT.
[3] Serge Abiteboul,et al. Adaptive on-line page importance computation , 2003, WWW '03.
[4] Hector Garcia-Molina,et al. Efficient Crawling Through URL Ordering , 1998, Comput. Networks.
[5] J. Curran,et al. Domain-specific Web site identification: the CROSSMARC focused Web crawler , 2003 .
[6] George Cybenko,et al. How dynamic is the Web? , 2000, Comput. Networks.
[7] Kevin S. McCurley,et al. Ranking the web frontier , 2004, WWW '04.
[8] Kevin S. McCurley,et al. Locality, Hierarchy, and Bidirectionality in the Web∗ , 2003 .
[9] Filippo Menczer,et al. Topical Crawling for Business Intelligence , 2003, ECDL.
[10] Hector Garcia-Molina,et al. Effective page refresh policies for Web crawlers , 2003, TODS.
[11] Filippo Menczer,et al. Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web , 2000, Machine Learning.
[12] Torsten Suel,et al. Design and implementation of a high-performance distributed Web crawler , 2002, Proceedings 18th International Conference on Data Engineering.
[13] Stuart Macdonald,et al. User Engagement in Research Data Curation , 2009, ECDL.
[14] Ana Carolina Salgado,et al. Looking at both the present and the past to efficiently update replicas of web content , 2005, WIDM '05.
[15] Filippo Menczer,et al. Topic-Driven Crawlers: Machine Learning Issues , 2002 .