Bias analysis in text classification for highly skewed data
暂无分享,去创建一个
[1] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .
[2] Ian H. Witten,et al. Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.
[3] Malik Yousef,et al. One-Class SVMs for Document Classification , 2002, J. Mach. Learn. Res..
[4] Rohini K. Srihari,et al. Feature selection for text categorization on imbalanced data , 2004, SKDD.
[5] Foster J. Provost,et al. Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction , 2003, J. Artif. Intell. Res..
[6] Edward Y. Chang,et al. Adaptive Feature-Space Conformal Transformation for Imbalanced-Data Learning , 2003, ICML.
[7] William A. Gale,et al. A sequential algorithm for training text classifiers , 1994, SIGIR '94.
[8] Edward Y. Chang,et al. Class-Boundary Alignment for Imbalanced Dataset Learning , 2003 .
[9] Tom Fawcett,et al. Robust Classification for Imprecise Environments , 2000, Machine Learning.
[10] Salvatore J. Stolfo,et al. Toward Scalable Learning with Non-Uniform Class and Cost Distributions: A Case Study in Credit Card Fraud Detection , 1998, KDD.
[11] Bernhard Schölkopf,et al. Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.
[12] Nathalie Japkowicz,et al. The class imbalance problem: A systematic study , 2002, Intell. Data Anal..
[13] John Shawe-Taylor,et al. Optimizing Classifers for Imbalanced Training Sets , 1998, NIPS.
[14] Herna L. Viktor,et al. Learning from imbalanced data sets with boosting and data generation: the DataBoost-IM approach , 2004, SKDD.
[15] Andrew McCallum,et al. A comparison of event models for naive bayes text classification , 1998, AAAI 1998.
[16] Damminda Alahakoon,et al. Minority report in fraud detection: classification of skewed data , 2004, SKDD.
[17] Nitesh V. Chawla,et al. SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..
[18] M. Maloof. Learning When Data Sets are Imbalanced and When Costs are Unequal and Unknown , 2003 .
[19] Tat-Seng Chua,et al. A maximal figure-of-merit learning approach to text categorization , 2003, SIGIR.
[20] Vipin Kumar,et al. Mining needle in a haystack: classifying rare classes via two-phase rule induction , 2001, SIGMOD '01.
[21] Gary M. Weiss. Mining with rarity: a unifying framework , 2004, SKDD.
[22] Charles Elkan,et al. The Foundations of Cost-Sensitive Learning , 2001, IJCAI.
[23] Yiming Yang,et al. A re-examination of text categorization methods , 1999, SIGIR '99.
[24] George Forman,et al. An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..
[25] Gustavo E. A. P. A. Batista,et al. A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.
[26] Dunja Mladenic,et al. Feature Selection for Unbalanced Class Distribution and Naive Bayes , 1999, ICML.
[27] Stan Matwin,et al. Machine Learning for the Detection of Oil Spills in Satellite Radar Images , 1998, Machine Learning.
[28] Stan Matwin,et al. Addressing the Curse of Imbalanced Training Sets: One-Sided Selection , 1997, ICML.
[29] Charles X. Ling,et al. Data Mining for Direct Marketing: Problems and Solutions , 1998, KDD.
[30] Yiming Yang,et al. A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.
[31] Thorsten Joachims,et al. Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.