The Prevalence of Errors in Machine Learning Experiments
暂无分享,去创建一个
Allan Tucker | Ning Li | Steve Counsell | Andrea Capiluppi | Martin Shepperd | Yuchen Guo | Mahir Arzoky | Giuseppe Destefanis | Stephen Swift | Leila Yousefi | M. Shepperd | S. Counsell | A. Tucker | A. Capiluppi | Leila Yousefi | S. Swift | Mahir Arzoky | Giuseppe Destefanis | Yuchen Guo | Ning Li
[1] Pearl Brereton,et al. Evidence-Based Software Engineering and Systematic Reviews , 2015 .
[2] Denis Borenstein,et al. Is predatory publishing a real threat? Evidence from a large database study , 2018, Scientometrics.
[3] James A. J. Heathers,et al. The GRIM Test , 2017 .
[4] Banu Diri,et al. A systematic review of software fault prediction studies , 2009, Expert Syst. Appl..
[5] J. Brooks. Why most published research findings are false: Ioannidis JP, Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece , 2008 .
[6] Tracy Hall,et al. DConfusion: a technique to allow cross study performance evaluation of fault prediction studies , 2013, Automated Software Engineering.
[7] S. Lange,et al. Adjusting for multiple testing--when and how? , 2001, Journal of clinical epidemiology.
[8] Tracy Hall,et al. A Systematic Literature Review on Fault Prediction Performance in Software Engineering , 2012, IEEE Transactions on Software Engineering.
[9] David Colquhoun,et al. An investigation of the false discovery rate and the misinterpretation of p-values , 2014, Royal Society Open Science.
[10] Michèle B. Nuijten,et al. The prevalence of statistical reporting errors in psychology (1985–2013) , 2015, Behavior Research Methods.
[11] D. Trafimow,et al. Replication, falsification, and the crisis of confidence in social psychology , 2015, Front. Psychol..
[12] Marco Zaffalon,et al. Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis , 2016, J. Mach. Learn. Res..
[13] Janez Demsar,et al. Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..
[14] John P. A. Ioannidis,et al. A manifesto for reproducible science , 2017, Nature Human Behaviour.
[15] Ning Li,et al. A Systematic Review of Unsupervised Learning Techniques for Software Defect Prediction , 2019, Inf. Softw. Technol..
[16] Tracy Hall,et al. Researcher Bias: The Use of Machine Learning in Software Defect Prediction , 2014, IEEE Transactions on Software Engineering.
[17] Y. Benjamini,et al. Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .