Eliminating noisy information in Web pages for data mining
暂无分享,去创建一个
[1] Michael R. Anderberg,et al. Cluster Analysis for Applications , 1973 .
[2] William A. Gale,et al. A sequential algorithm for training text classifiers , 1994, SIGIR '94.
[3] John D. Lafferty,et al. A Model of Lexical Attraction and Repulsion , 1997, ACL.
[4] Geoffrey Zweig,et al. Syntactic Clustering of the Web , 1997, Comput. Networks.
[5] Yiming Yang,et al. A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.
[6] Andrew McCallum,et al. A comparison of event models for naive bayes text classification , 1998, AAAI 1998.
[7] M. KleinbergJon. Authoritative sources in a hyperlinked environment , 1999 .
[8] Nicholas Kushmerick,et al. Learning to remove Internet advertisements , 1999, AGENTS '99.
[9] Brian D. Davison. Recognizing Nepotistic Links on the Web , 2000 .
[10] Tok Wang Ling,et al. IntelliClean: a knowledge-based intelligent data cleaner , 2000, KDD '00.
[11] Un Yong Nahm and Mikhail Bilenko and Raymond J. Mooney,et al. Two Approaches to Handling Noisy Variation in Text Mining , 2002 .
[12] Jan-Ming Ho,et al. Discovering informative content blocks from Web documents , 2002, KDD.
[13] Ming-Syan Chen,et al. Entropy-based link analysis for mining web informative structures , 2002, CIKM '02.
[14] Jiawei Han,et al. Data Mining for Web Intelligence , 2002, Computer.
[15] Ziv Bar-Yossef,et al. Template detection via data mining and its applications , 2002, WWW.
[16] Alejandro A. Vaisman,et al. Enhancing Web access using data mining techniques , 2003, 14th International Workshop on Database and Expert Systems Applications, 2003. Proceedings..
[17] Chaomei Chen,et al. Mining the Web: Discovering knowledge from hypertext data , 2004, J. Assoc. Inf. Sci. Technol..
[18] Dik Lun Lee,et al. Clustering search engine query log containing noisy clickthroughs , 2004, 2004 International Symposium on Applications and the Internet. Proceedings..
[19] John D. Lafferty,et al. Statistical Models for Text Segmentation , 1999, Machine Learning.