Upgrading Model Selection Criteria with Goodness of Fit Tests for Practical Applications

The Bayesian information criterion (BIC), the Akaike information criterion (AIC), and some other indicators derived from them are widely used for model selection. In their original form, they contain the likelihood of the data given the models. Unfortunately, in many applications, it is practically impossible to calculate the likelihood, and, therefore, the criteria have been reformulated in terms of descriptive statistics of the residual distribution: the variance and the mean-squared error of the residuals. These alternative versions are strictly valid only in the presence of additive noise of Gaussian distribution, not a completely satisfactory assumption in many applications in science and engineering. Moreover, the variance and the mean-squared error are quite crude statistics of the residual distributions. More sophisticated statistical indicators, capable of better quantifying how close the residual distribution is to the noise, can be profitably used. In particular, specific goodness of fit tests have been included in the expressions of the traditional criteria and have proved to be very effective in improving their discriminating capability. These improved performances have been demonstrated with a systematic series of simulations using synthetic data for various classes of functions and different noise statistics.

[1]  A Murari,et al.  Mutual interaction of Faraday rotation and Cotton–Mouton phase shift in JET polarimetric measurements. , 2010, The Review of scientific instruments.

[2]  K. Pearson On the Criterion that a Given System of Deviations from the Probable in the Case of a Correlated System of Variables is Such that it Can be Reasonably Supposed to have Arisen from Random Sampling , 1900 .

[3]  Michela Gelfusa,et al.  Non-power law scaling for access to the H-mode in tokamaks via symbolic regression , 2013 .

[4]  Gregory W. Corder,et al.  Nonparametric Statistics for Non-Statisticians: A Step-by-Step Approach , 2009 .

[5]  Karl Pearson F.R.S. X. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling , 2009 .

[6]  Carl Tim Kelley,et al.  Iterative methods for optimization , 1999, Frontiers in applied mathematics.

[7]  M. Bonamente Statistics and Analysis of Scientific Data , 2013, Graduate Texts in Physics.

[8]  P. Arena,et al.  Overview of the JET results , 2015 .

[9]  A. Murari,et al.  New frontiers of Forest Fire Protection : A portable Laser System (FfED). , 2013 .

[10]  Jet Efda Contributors,et al.  Development of an efficient real-time disruption predictor from scratch on JET and implications for ITER , 2013 .

[11]  T. W. Anderson,et al.  Asymptotic Theory of Certain "Goodness of Fit" Criteria Based on Stochastic Processes , 1952 .

[12]  M. Lungaroni,et al.  On the Use of Entropy to Improve Model Selection Criteria , 2019, Entropy.

[13]  Hartmut Bossel,et al.  Modeling and simulation , 1994 .

[14]  Michela Gelfusa,et al.  Extensive statistical analysis of ELMs on JET with a carbon wall , 2014 .

[15]  H. Akaike A new look at the statistical model identification , 1974 .

[16]  E. A. Milne,et al.  Physics and Philosophy , 2015, The Medico-chirurgical review.

[17]  Alex Karagrigoriou,et al.  Robust Model Selection Criteria Based on Pseudodistances , 2020, Entropy.

[18]  Maria Richetta,et al.  Design and development of a compact lidar/DIAL system for aerial surveillance of urban areas , 2013, Remote Sensing.

[19]  Michela Gelfusa,et al.  A statistical methodology to derive the scaling law for the H-mode power threshold using a large multi-machine database , 2012 .

[20]  A. Murari,et al.  Application of symbolic regression to the derivation of scaling laws for tokamak energy confinement time in terms of dimensionless quantities , 2016 .

[21]  A. Murari,et al.  Symbolic regression via genetic programming for data driven derivation of confinement scaling laws without any assumption on their mathematical form , 2014 .

[22]  Shun-ichi Amari,et al.  Methods of information geometry , 2000 .

[23]  G. Marsaglia,et al.  Evaluating Kolmogorov's distribution , 2003 .

[24]  Emmanuele Peluso,et al.  Maximum likelihood bolometric tomography for the determination of the uncertainties in the radiation emission on JET TOKAMAK. , 2018, The Review of scientific instruments.

[25]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[26]  M. Peruggia Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach (2nd ed.) , 2003 .

[27]  Michela Gelfusa,et al.  Clustering based on the geodesic distance on Gaussian manifolds for the automatic classification of disruptions , 2013 .

[28]  D. E. Coleman How To Test Normality And Other Distributional Assumptions , 1986 .

[29]  P. C. de Vries,et al.  Towards the realization on JET of an integrated H-mode scenario for ITER , 2002 .

[30]  G. Claeskens Statistical Model Choice , 2016 .

[31]  Andrea Murari,et al.  Geodesic distance on Gaussian manifolds for the robust identification of chaotic systems , 2016 .

[32]  A. Murari,et al.  A comparison of four reconstruction methods for JET neutron and gamma tomography , 2009 .