Analyzing the Impact of Unbalanced Data on Web Spam Classification
暂无分享,去创建一个
Florentino Fernández Riverola | José Ramon Méndez | Rosalía Laza | Reyes Pavón | David Ruano-Ordás | Jorge Fdez-Glez | J. R. Méndez | F. F. Riverola | David Ruano-Ordás | Rosalía Laza | Reyes Pavón | Jorge Fdez-Glez
[1] Florentino Fernández Riverola,et al. Assessing the Suitability of MeSH Ontology for Classifying Medline Documents , 2011, PACBB.
[2] Kumar Chellapilla,et al. Fourth international workshop on adversarial information retrieval on the web (AIRWeb 2008) , 2008, WWW.
[3] Francisco Herrera,et al. Evolutionary-based selection of generalized instances for imbalanced classification , 2012, Knowl. Based Syst..
[4] Haibo He,et al. Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.
[5] Carlos Castillo,et al. Web spam identification through content and hyperlinks , 2008, AIRWeb '08.
[6] Ludovic Denoyer,et al. Web spam challenge 2008 , 2008, AIRWeb 2008.
[7] Marc Najork,et al. Detecting phrase-level duplication on the world wide web , 2005, SIGIR '05.
[8] András A. Benczúr,et al. Web spam classification: a few features worth more , 2011, WebQuality '11.
[9] András A. Benczúr,et al. SpamRank -- Fully Automatic Link Spam Detection , 2005, AIRWeb.
[10] Gustavo E. A. P. A. Batista,et al. A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.
[11] Robert C. Holte,et al. C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling beats Over-Sampling , 2003 .
[12] Chunheng Wang,et al. Boosting the Performance of Web Spam Detection with Ensemble Under-Sampling Classification , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).
[13] Marc Najork,et al. Detecting spam web pages through content analysis , 2006, WWW '06.
[14] Hector Garcia-Molina,et al. Link spam detection based on mass estimation , 2006, VLDB.
[15] Taghi M. Khoshgoftaar,et al. Experimental perspectives on learning from imbalanced data , 2007, ICML '07.
[16] Jaber Karimpour,et al. Web Spam Detection by Learning from Small Labeled Samples , 2012 .
[17] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.