The Effect of Oversampling and Undersampling on Classifying Imbalanced Text Datasets
暂无分享,去创建一个
[1] Peter E. Hart,et al. Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.
[2] Andrew McCallum,et al. A comparison of event models for naive bayes text classification , 1998, AAAI 1998.
[3] Charles X. Ling,et al. Data Mining for Direct Marketing: Problems and Solutions , 1998, KDD.
[4] Thorsten Joachims,et al. Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.
[5] B. Schölkopf,et al. Advances in kernel methods: support vector learning , 1999 .
[6] Nathalie Japkowicz,et al. Concept-Learning in the Presence of Between-Class and Within-Class Imbalances , 2001, Canadian Conference on AI.
[7] Evangelos E. Milios,et al. Using Unsupervised Learning to Guide Resampling in Imbalanced Data Sets , 2001, AISTATS.
[8] JapkowiczNathalie,et al. The class imbalance problem: A systematic study , 2002 .
[9] Nitesh V. Chawla,et al. SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..
[10] R. Barandelaa,et al. Strategies for learning in class imbalance problems , 2003, Pattern Recognit..
[11] R. Srihari,et al. Optimally Combining Positive and Negative Features for Text Categorization , 2003 .
[12] Robert C. Holte,et al. C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling beats Over-Sampling , 2003 .
[13] Marko Grobelnik,et al. Training text classifiers with SVM on very few positive examples , 2003 .
[14] Edward Y. Chang,et al. Class-Boundary Alignment for Imbalanced Dataset Learning , 2003 .
[15] Shi Zhong,et al. A Comparative Study of Generative Models for Document Clustering , 2003 .
[16] M. Maloof. Learning When Data Sets are Imbalanced and When Costs are Unequal and Unknown , 2003 .
[17] Foster J. Provost,et al. Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction , 2003, J. Artif. Intell. Res..
[18] Taeho Jo,et al. Class imbalances versus small disjuncts , 2004, SKDD.
[19] M. Dolores del Castillo,et al. A multistrategy approach for digital text categorization from imbalanced documents , 2004, SKDD.
[20] Gustavo E. A. P. A. Batista,et al. A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.
[21] Rohini K. Srihari,et al. Feature selection for text categorization on imbalanced data , 2004, SKDD.