Evaluation of Classification Methods

24.

[1]  Robert P. W. Duin,et al.  Efficient Multiclass ROC Approximation by Decomposition via Confusion Matrix Perturbation Analysis , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Y. Hochberg A sharper Bonferroni procedure for multiple tests of significance , 1988 .

[3]  W. W. Daniel,et al.  Applied Nonparametric Statistics , 1978 .

[4]  J. L. Hodges,et al.  Rank Methods for Combination of Independent Experiments in Analysis of Variance , 1962 .

[5]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[6]  B. Holland,et al.  An Improved Sequentially Rejective Bonferroni Test Procedure , 1987 .

[7]  K. Doksum Robust Procedures for Some Linear Models with one Observation per Cell , 1967 .

[8]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[9]  Yi-Zeng Liang,et al.  Monte Carlo cross validation , 2001 .

[10]  R. Iman,et al.  Approximations of the critical region of the fbietkan statistic , 1980 .

[11]  G. Hommel A stagewise rejective multiple test procedure based on a modified Bonferroni test , 1988 .

[12]  Francisco Herrera,et al.  Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power , 2010, Inf. Sci..

[13]  Francisco Herrera,et al.  A unifying view on dataset shift in classification , 2012, Pattern Recognit..

[14]  G Hommel,et al.  A rapid algorithm and a computer program for multiple test procedures using logical structures of hypotheses. , 1994, Computer methods and programs in biomedicine.

[15]  D. Rom A sequentially rejective test procedure based on a modified Bonferroni inequality , 1990 .

[16]  José Hernández-Orallo,et al.  Volume under the ROC Surface for Multi-class Problems , 2003, ECML.

[17]  R. Tibshirani,et al.  Improvements on Cross-Validation: The 632+ Bootstrap Method , 1997 .

[18]  M. Friedman The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance , 1937 .

[19]  Z. Šidák Rectangular Confidence Regions for the Means of Multivariate Normal Distributions , 1967 .

[20]  N. A. Diamantidis,et al.  Unsupervised stratification of cross-validation for accuracy estimation , 2000, Artif. Intell..

[21]  Francisco Herrera,et al.  A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms , 2011, Swarm Evol. Comput..

[22]  J. Shaffer Modified Sequentially Rejective Multiple Test Procedures , 1986 .

[23]  D. Mossman Three-way ROCs , 1999, Medical decision making : an international journal of the Society for Medical Decision Making.

[24]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[25]  D. Wolpert The Supervised Learning No-Free-Lunch Theorems , 2002 .

[26]  Francisco Herrera,et al.  A study on the use of statistical tests for experimentation with neural networks: Analysis of parametric test conditions and non-parametric tests , 2007, Expert Syst. Appl..

[27]  S. Larson The shrinkage of the coefficient of multiple correlation. , 1931 .

[28]  Luc Devroye,et al.  Distribution-free performance bounds for potential function rules , 1979, IEEE Trans. Inf. Theory.

[29]  Roger E. Kirk,et al.  Experimental design: Procedures for the behavioral sciences (3rd ed.). , 1995 .

[30]  E. Edgington,et al.  Randomization Tests (3rd ed.) , 1998 .

[31]  M. Keuls,et al.  The use of the „studentized range” in connection with an analysis of variance , 1952, Euphytica.

[32]  Angus M. Brown A new software for carrying out one-way ANOVA post hoc tests , 2005, Comput. Methods Programs Biomed..

[33]  Pavel Paclík,et al.  The ROC skeleton for multiclass ROC estimation , 2010, Pattern Recognit. Lett..

[34]  Stéphane Robin,et al.  Nonparametric density estimation by exact leave-p-out cross-validation , 2008, Comput. Stat. Data Anal..

[35]  Jianjun Li A two-step rejection procedure for testing multiple hypotheses , 2008 .

[36]  James Bailey,et al.  A Novel Scalable Multi-class ROC for Effective Visualization and Computation , 2010, PAKDD.

[37]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[38]  Robert P. W. Duin,et al.  Approximating the multiclass ROC by pairwise analysis , 2007, Pattern Recognit. Lett..

[39]  Francisco Herrera,et al.  Study on the Impact of Partition-Induced Dataset Shift on $k$-Fold Cross-Validation , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[40]  H. Finner On a Monotonicity Problem in Step-Down Multiple Test Procedures , 1993 .

[41]  Tony R. Martinez,et al.  Distribution-balanced stratified cross-validation for accuracy estimation , 2000, J. Exp. Theor. Artif. Intell..

[42]  David J. Hand,et al.  A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems , 2001, Machine Learning.

[43]  O. J. Dunn Multiple Comparisons among Means , 1961 .

[44]  H. Shimodaira,et al.  Improving predictive inference under covariate shift by weighting the log-likelihood function , 2000 .

[45]  Arie Ben-David,et al.  Comparison of classification accuracy using Cohen's Weighted Kappa , 2008, Expert Syst. Appl..

[46]  Zoran Bosnic,et al.  ROC analysis of classifiers in machine learning: A survey , 2013, Intell. Data Anal..

[47]  øöö Blockinøø Well-Trained PETs : Improving Probability Estimation , 2000 .

[48]  R. G. D. Steel,et al.  Tables for a Treatments Versus Control Multiple Comparisons Sign Test , 1965 .

[49]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[50]  Robert P. W. Duin,et al.  A simplified extension of the Area under the ROC to the multiclass domain , 2006 .

[51]  H. Scheffé A METHOD FOR JUDGING ALL CONTRASTS IN THE ANALYSIS OF VARIANCE , 1953 .

[52]  David J. Sheskin,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 1997 .

[53]  Thomas E. Nichols,et al.  Controlling the familywise error rate in functional neuroimaging: a comparative review , 2003, Statistical methods in medical research.

[54]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[55]  D. Quade Using Weighted Rankings in the Analysis of Complete Blocks with Additive Block Effects , 1979 .