论文信息 - Imbalanced Datasets: From Sampling to Classifiers - 字舞流文

Imbalanced Datasets: From Sampling to Classifiers

Yunqian Ma | Haibo He | Haibo He | Yunqian Ma

[1] Mehmet Kuzu,et al. Dynamic planning approach to automated web service composition , 2010, Applied Intelligence.

[2] Chumphol Bunkhumpornpat,et al. Safe-Level-SMOTE: Safe-Level-Synthetic Minority Over-Sampling TEchnique for Handling the Class Imbalanced Problem , 2009, PAKDD.

[3] Hisashi Kashima,et al. Roughly balanced bagging for imbalanced data , 2009, Stat. Anal. Data Min..

[4] David A. Cieslak,et al. Automatically countering imbalance and its empirical relationship to cost , 2008, Data Mining and Knowledge Discovery.

[5] David A. Cieslak,et al. Learning Decision Trees for Unbalanced Data , 2008, ECML/PKDD.

[6] Zhi-Hua Zhou,et al. Exploratory Under-Sampling for Class-Imbalance Learning , 2006, Sixth International Conference on Data Mining (ICDM'06).

[7] Mark Goadrich,et al. The relationship between Precision-Recall and ROC curves , 2006, ICML.

[8] Hui Han,et al. Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning , 2005, ICIC.

[9] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[10] Nitesh V. Chawla,et al. Classification and knowledge discovery in protein databases , 2004, J. Biomed. Informatics.

[11] Taeho Jo,et al. Class imbalances versus small disjuncts , 2004, SKDD.

[12] Herna L. Viktor,et al. Learning from imbalanced data sets with boosting and data generation: the DataBoost-IM approach , 2004, SKDD.

[13] Gustavo E. A. P. A. Batista,et al. A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.

[14] Pedro M. Domingos,et al. Tree Induction for Probability-Based Ranking , 2003, Machine Learning.

[15] David J. Hand,et al. A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems , 2001, Machine Learning.

[16] Leo Breiman,et al. Random Forests , 2001, Machine Learning.

[17] Tom Fawcett,et al. Robust Classification for Imprecise Environments , 2000, Machine Learning.

[18] Eric Bauer,et al. An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants , 1999, Machine Learning.

[19] Nitesh V. Chawla,et al. SMOTEBoost: Improving Prediction of the Minority Class in Boosting , 2003, PKDD.

[20] Nitesh V. Chawla,et al. SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[21] Jorma Laurikkala,et al. Improving Identification of Difficult Small Classes by Balancing Class Distribution , 2001, AIME.

[22] Kai Ming Ting,et al. A Comparative Study of Cost-Sensitive Boosting Algorithms , 2000, ICML.

[23] Thomas G. Dietterich. Ensemble Methods in Machine Learning , 2000, Multiple Classifier Systems.

[24] An Empirical Study of MetaCost Using Boosting Algorithms , 2000, ECML.

[25] N. Japkowicz. Learning from Imbalanced Data Sets: A Comparison of Various Strategies * , 2000 .

[26] Tin Kam Ho,et al. The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[27] Andrew P. Bradley,et al. The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[28] Stan Matwin,et al. Addressing the Curse of Imbalanced Training Sets: One-Sided Selection , 1997, ICML.

[29] Yoav Freund,et al. Experiments with a New Boosting Algorithm , 1996, ICML.

[30] Fredric C. Gey,et al. The Relationship between Recall and Precision , 1994, J. Am. Soc. Inf. Sci..

[31] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .

[32] Lars Kai Hansen,et al. Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[33] J A Swets,et al. Measuring the accuracy of diagnostic systems. , 1988, Science.

[34] James P. Egan,et al. Signal detection theory and ROC analysis , 1975 .

[35] Nils J. Nilsson,et al. A Formal Basis for the Heuristic Determination of Minimum Cost Paths , 1968, IEEE Trans. Syst. Sci. Cybern..