Mining Web informative structures and contents based on entropy analysis
暂无分享,去创建一个
Ming-Syan Chen | Jan-Ming Ho | Hung-Yu Kao | Shian-Hua Lin | Ming-Syan Chen | Jan-Ming Ho | Hung-Yu Kao | Shian-Hua Lin | Hung-Yu kao
[1] Gerard Salton,et al. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .
[2] Ramana Rao,et al. Silk from a sow's ear: extracting usable structures from the Web , 1996, CHI.
[3] Claire Cardie,et al. Empirical Methods in Information Extraction , 1997, AI Mag..
[4] Lee-Feng Chien,et al. PAT-tree-based keyword extraction for Chinese information retrieval , 1997, SIGIR '97.
[5] Geoffrey Zweig,et al. Syntactic Clustering of the Web , 1997, Comput. Networks.
[6] Filippo Neri,et al. Machine Learning for Information Extraction , 1997, SCIE.
[7] Nicholas Kushmerick,et al. Wrapper Induction for Information Extraction , 1997, IJCAI.
[8] Sergey Brin,et al. The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.
[9] Krishna Bharat,et al. Improved algorithms for topic distillation in a hyperlinked environment , 1998, SIGIR '98.
[10] Philip S. Yu,et al. Efficient Data Mining for Path Traversal Patterns , 1998, IEEE Trans. Knowl. Data Eng..
[11] Chun-Nan Hsu,et al. Generating Finite-State Transducers for Semi-Structured Data Extraction from the Web , 1998, Inf. Syst..
[12] Jon M. Kleinberg,et al. Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text , 1998, Comput. Networks.
[13] M. KleinbergJon. Authoritative sources in a hyperlinked environment , 1999 .
[14] Craig A. Knoblock,et al. A hierarchical approach to wrapper induction , 1999, AGENTS '99.
[15] Nicholas Kushmerick,et al. Learning to remove Internet advertisements , 1999, AGENTS '99.
[16] Andrei Z. Broder,et al. Mirror, Mirror on the Web: A Study of Host Pairs with Replicated Content , 1999, Comput. Networks.
[17] Jon M. Kleinberg,et al. Mining the Web's Link Structure , 1999, Computer.
[18] Andrei Z. Broder,et al. A Comparison of Techniques to Find Mirrored Hosts on the WWW , 2000, IEEE Data Eng. Bull..
[19] Ke Wang,et al. Discovering Structural Association of Semistructured Data , 2000, IEEE Trans. Knowl. Data Eng..
[20] Loren G. Terveen,et al. Does “authority” mean quality? predicting expert quality ratings of Web documents , 2000, SIGIR '00.
[21] Shlomo Moran,et al. The stochastic approach for link-structure analysis (SALSA) and the TKC effect , 2000, Comput. Networks.
[22] Brian D. Davison. Recognizing Nepotistic Links on the Web , 2000 .
[23] Andrei Z. Broder,et al. Graph structure in the Web , 2000, Comput. Networks.
[24] Boris Chidlovskii. Wrapper generation by -reversible grammar induction , 2000 .
[25] Maarten de Rijke,et al. Wrapper Generation via Grammar Induction , 2000, ECML.
[26] Allan Borodin,et al. Finding authorities and hubs from link structures on the World Wide Web , 2001, WWW '01.
[27] Valter Crescenzi,et al. RoadRunner: Towards Automatic Data Extraction from Large Web Sites , 2001, VLDB.
[28] Soumen Chakrabarti,et al. Enhanced topic distillation using text, markup tags, and hyperlinks , 2001, SIGIR '01.
[29] Soumen Chakrabarti,et al. Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction , 2001, WWW '01.
[30] Wen-Syan Li,et al. Constructing multi-granular and topic-focused web site maps , 2001, WWW '01.
[31] Joel C. Miller,et al. Modifications of Kleinberg's HITS algorithm using matrix exponentiation and web log records , 2001, SIGIR '01.
[32] Chia-Hui Chang,et al. IEPAD: information extraction based on pattern discovery , 2001, WWW '01.
[33] C. Nédellec. Machine Learning for Information Extraction , 2001 .
[34] Jan-Ming Ho,et al. Discovering informative content blocks from Web documents , 2002, KDD.
[35] Ming-Syan Chen,et al. Entropy-based link analysis for mining web informative structures , 2002, CIKM '02.
[36] Sang Joon Kim,et al. A Mathematical Theory of Communication , 2006 .
[37] Robert Richards,et al. Document Object Model (DOM) , 2006 .