论文信息 - The impact of class imbalance techniques on crashing fault residence prediction models - 字舞流文

The impact of class imbalance techniques on crashing fault residence prediction models

Zhou Xu | J. Keung | T. Zhang | Kunsong Zhao | Lei Xue | Meng Yan | Ming Fan

[1] Zhou Xu,et al. Effort-Aware Just-in-Time Bug Prediction for Mobile Apps Via Cross-Triplet Deep Feature Embedding , 2022, IEEE Transactions on Reliability.

[2] Zhou Xu,et al. A comprehensive investigation of the impact of feature selection techniques on crashing fault residence prediction models , 2021, Inf. Softw. Technol..

[3] Zhou Xu,et al. Predicting Crash Fault Residence via Simplified Deep Forest Based on A Reduced Feature Set , 2021, 2021 IEEE/ACM 29th International Conference on Program Comprehension (ICPC).

[4] Tao Zhang,et al. Simplified Deep Forest Model based Just-In-Time Defect Prediction for Android Mobile Apps , 2020, 2020 IEEE 20th International Conference on Software Quality, Reliability and Security (QRS).

[5] Xiaohong Zhang,et al. Imbalanced metric learning for crashing fault residence prediction , 2020, J. Syst. Softw..

[6] Xin Wang,et al. Detecting and Explaining Self-Admitted Technical Debts with Attention-based Neural Networks , 2020, 2020 35th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[7] Qingkai Shi,et al. Functional code clone detection with syntax and semantics fusion learning , 2020, ISSTA.

[8] Kay Chen Tan,et al. Understanding the Automated Parameter Optimization on Transfer Learning for Cross-Project Defect Prediction: An Empirical Study , 2020, 2020 IEEE/ACM 42nd International Conference on Software Engineering (ICSE).

[9] David Lo,et al. Chaff from the Wheat: Characterizing and Determining Valid Bug Reports , 2020, IEEE Transactions on Software Engineering.

[10] Qinbao Song,et al. A Comprehensive Investigation of the Role of Imbalanced Learning for Software Defect Prediction , 2019, IEEE Transactions on Software Engineering.

[11] Xiapu Luo,et al. LDFR: Learning deep feature representation for software defect prediction , 2019, J. Syst. Softw..

[12] Jin Liu,et al. Identifying Crashing Fault Residence Based on Cross Project Model , 2019, 2019 IEEE 30th International Symposium on Software Reliability Engineering (ISSRE).

[13] Tie-Yan Liu,et al. Self-paced Ensemble for Highly Imbalanced Massive Data Classification , 2019, 2020 IEEE 36th International Conference on Data Engineering (ICDE).

[14] Mozhan Soltani,et al. A benchmark-based evaluation of search-based crash reproduction , 2019, Empirical Software Engineering.

[15] J. Grundy,et al. Neural Network-based Detection of Self-Admitted Technical Debt , 2019, ACM Transactions on Software Engineering and Methodology.

[16] Leandro L. Minku,et al. Class Imbalance Evolution and Verification Latency in Just-in-Time Software Defect Prediction , 2019, 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE).

[17] Gemma Catolino,et al. Cross-Project Just-in-Time Bug Prediction for Mobile Apps: An Empirical Assessment , 2019, 2019 IEEE/ACM 6th International Conference on Mobile Software Engineering and Systems (MOBILESoft).

[18] Hongyu Zhang,et al. Does the fault reside in a stack trace? Assisting crash localization by predicting crashing fault residence , 2019, J. Syst. Softw..

[19] Shi Ying,et al. EH-Recommender: Recommending Exception Handling Strategies Based on Program Context , 2018, 2018 23rd International Conference on Engineering of Complex Computer Systems (ICECCS).

[20] Akito Monden,et al. On the relative value of data resampling approaches for software defect prediction , 2018, Empirical Software Engineering.

[21] Ming Wen,et al. ChangeLocator: locate crash-inducing changes based on crash reports , 2018, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[22] Ahmed E. Hassan,et al. The Impact of Class Rebalancing Techniques on the Performance and Interpretation of Defect Prediction Models , 2018, IEEE Transactions on Software Engineering.

[23] Steffen Herbold,et al. Comments on ScottKnottESD in Response to “An Empirical Comparison of Model Validation Techniques for Defect Prediction Models” , 2017, IEEE Transactions on Software Engineering.

[24] Gemma Catolino,et al. Just-In-Time Bug Prediction in Mobile Applications: The Domain Matters! , 2017, 2017 IEEE/ACM 4th International Conference on Mobile Software Engineering and Systems (MOBILESoft).

[25] A. Panichella,et al. A guided genetic algorithm for automated crash reproduction , 2017, ICSE 2017.

[26] Tim Menzies,et al. Is "Better Data" Better Than "Better Data Miners"? , 2017, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[27] A. Hamou-Lhadj,et al. A bug reproduction approach based on directed model checking and crash traces , 2017, J. Softw. Evol. Process..

[28] A. Hassan,et al. Studying just-in-time defect prediction using cross-project models , 2016, Empirical Software Engineering.

[29] Renaud Pawlak,et al. SPOON: A library for implementing analyses and transformations of Java source code , 2016, Softw. Pract. Exp..

[30] Luís Torgo,et al. A Survey of Predictive Modeling on Imbalanced Domains , 2016, ACM Comput. Surv..

[31] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[32] Martin Monperrus,et al. Crash reproduction via test case mutation: let existing test cases help , 2015, ESEC/SIGSOFT FSE.

[33] Baowen Xu,et al. Heterogeneous cross-company defect prediction by unified metric representation and CCA-based transfer learning , 2015, ESEC/SIGSOFT FSE.

[34] Sashank Dara,et al. Online Defect Prediction for Imbalanced Data , 2015, 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering.

[35] Ning Chen,et al. STAR: Stack Trace Based Automatic Crash Reproduction via Symbolic Execution , 2015, IEEE Transactions on Software Engineering.

[36] Andrian Marcus,et al. On the Use of Stack Traces to Improve Text Retrieval-Based Bug Localization , 2014, 2014 IEEE International Conference on Software Maintenance and Evolution.

[37] Lu Zhang,et al. Boosting Bug-Report-Oriented Fault Localization with Segmentation and Stack-Trace Analysis , 2014, 2014 IEEE International Conference on Software Maintenance and Evolution.

[38] Rongxin Wu,et al. CrashLocator: locating crashing faults based on crash stacks , 2014, ISSTA 2014.

[39] Tony R. Martinez,et al. An instance level analysis of data complexity , 2014, Machine Learning.

[40] Liang Gong,et al. Locating Crashing Faults based on Crash Stack Traces , 2014, ArXiv.

[41] Audris Mockus,et al. A large-scale empirical study of just-in-time quality assurance , 2013, IEEE Transactions on Software Engineering.

[42] Sinno Jialin Pan,et al. Transfer defect learning , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[43] Xin Yao,et al. Using Class Imbalance Learning for Software Defect Prediction , 2013, IEEE Transactions on Reliability.

[44] Gilles Louppe,et al. Ensembles on Random Patches , 2012, ECML/PKDD.

[45] Chih-Jen Lin,et al. Dual coordinate descent methods for logistic regression and maximum entropy models , 2011, Machine Learning.

[46] Foutse Khomh,et al. Classifying field crash reports for fixing bugs: A case study of Mozilla Firefox , 2011, 2011 27th IEEE International Conference on Software Maintenance (ICSM).

[47] Rahul Premraj,et al. Do stack traces help developers fix bugs? , 2010, 2010 7th IEEE Working Conference on Mining Software Repositories (MSR 2010).

[48] Hien M. Nguyen,et al. Borderline over-sampling for imbalanced data classification , 2009, Int. J. Knowl. Eng. Soft Data Paradigms.

[49] Xin Yao,et al. Diversity analysis on imbalanced data sets by using ensemble models , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[50] Haibo He,et al. ADASYN: Adaptive synthetic sampling approach for imbalanced learning , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[51] Friedrich Leisch,et al. A toolbox for K-centroids cluster analysis , 2006 .

[52] Hui Han,et al. Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning , 2005, ICIC.

[53] Gustavo E. A. P. A. Batista,et al. A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.

[54] Nitesh V. Chawla,et al. SMOTEBoost: Improving Prediction of the Minority Class in Boosting , 2003, PKDD.

[55] Leo Breiman,et al. Random Forests , 2001, Machine Learning.

[56] Jorma Laurikkala,et al. Improving Identification of Difficult Small Classes by Balancing Class Distribution , 2001, AIME.

[57] Paul A. Viola,et al. Fast and Robust Classification using Asymmetric AdaBoost and a Detector Cascade , 2001, NIPS.

[58] Salvatore J. Stolfo,et al. AdaCost: Misclassification Cost-Sensitive Boosting , 1999, ICML.

[59] JOHANNES FÜRNKRANZ,et al. Separate-and-Conquer Rule Learning , 1999, Artificial Intelligence Review.

[60] John Shawe-Taylor,et al. Optimizing Classifers for Imbalanced Training Sets , 1998, NIPS.

[61] Tin Kam Ho,et al. The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[62] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[63] David W. Opitz,et al. An Empirical Evaluation of Bagging and Boosting , 1997, AAAI/IAAI.

[64] Geoffrey E. Hinton. Connectionist Learning Procedures , 1989, Artif. Intell..

[65] Dennis L. Wilson,et al. Asymptotic Properties of Nearest Neighbor Rules Using Edited Data , 1972, IEEE Trans. Syst. Man Cybern..

[66] Peter E. Hart,et al. The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[67] Seetha Hari,et al. Learning From Imbalanced Data , 2019, Advances in Computer and Electrical Engineering.

[68] Zhenchang Xing,et al. Neural Network-based Detection of Self-Admitted Technical Debt: From Performance to Explainability , 2019, ACM Trans. Softw. Eng. Methodol..

[69] Shane McIntosh,et al. An Empirical Comparison of Model Validation Techniques for Defect Prediction Models , 2017, IEEE Transactions on Software Engineering.

[70] Wei-Yin Loh,et al. Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[71] Taghi M. Khoshgoftaar,et al. RUSBoost: A Hybrid Approach to Alleviating Class Imbalance , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[72] Zhi-Hua Zhou,et al. Exploratory Undersampling for Class-Imbalance Learning , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[73] Chao Chen,et al. Using Random Forest to Learn Imbalanced Data , 2004 .

[74] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[75] Ana L. C. Bazzan,et al. Balancing Training Data for Automated Annotation of Keywords: a Case Study , 2003, WOB.

[76] Nitesh V. Chawla,et al. SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[77] John C. Platt,et al. Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[78] Stan Matwin,et al. Addressing the Curse of Imbalanced Training Sets: One-Sided Selection , 1997, ICML.

[79] S. Yitzhaki,et al. A note on the calculation and interpretation of the Gini index , 1984 .

[80] I. Tomek. An Experiment with the Edited Nearest-Neighbor Rule , 1976 .

[81] I. Tomek,et al. Two Modifications of CNN , 1976 .

[82] Peter E. Hart,et al. Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.