Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology Extraction
暂无分享,去创建一个
[1] Martin van den Berg,et al. Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery , 1999, Comput. Networks.
[2] Adam Kilgarriff,et al. Introduction to the Special Issue on the Web as Corpus , 2003, CL.
[3] Serge Abiteboul,et al. Adaptive on-line page importance computation , 2003, WWW '03.
[4] Hector Garcia-Molina,et al. Efficient Crawling Through URL Ordering , 1998, Comput. Networks.
[5] Marco Baroni,et al. Building general- and special-purpose corpora by Web crawling , 2006 .
[6] Silvia Bernardini,et al. BootCaT: Bootstrapping Corpora and Terms from the Web , 2004, LREC.