Pauci ex tanto numero: reduce redundancy in multi-model ensembles

Abstract. We explicitly address the fundamental issue of member diversity in multi-model ensembles. To date, no attempts in this direction have been documented within the air quality (AQ) community despite the extensive use of ensembles in this field. Common biases and redundancy are the two issues directly deriving from lack of independence, undermining the significance of a multi-model ensemble, and are the subject of this study. Shared, dependant biases among models do not cancel out but will instead determine a biased ensemble. Redundancy derives from having too large a portion of common variance among the members of the ensemble, producing overconfidence in the predictions and underestimation of the uncertainty. The two issues of common biases and redundancy are analysed in detail using the AQMEII ensemble of AQ model results for four air pollutants in two European regions. We show that models share large portions of bias and variance, extending well beyond those induced by common inputs. We make use of several techniques to further show that subsets of models can explain the same amount of variance as the full ensemble with the advantage of being poorly correlated. Selecting the members for generating skilful, non-redundant ensembles from such subsets proved, however, non-trivial. We propose and discuss various methods of member selection and rate the ensemble performance they produce. In most cases, the full ensemble is outscored by the reduced ones. We conclude that, although independence of outputs may not always guarantee enhancement of scores (but this depends upon the skill being investigated), we discourage selecting the members of the ensemble simply on the basis of scores; that is, independence and skills need to be considered disjointly.

[1]  U. Grömping Estimators of Relative Importance in Linear Regression Based on Variance Decomposition , 2007 .

[2]  Zachary Pirtle,et al.  What does it mean when climate models agree? A case for assessing independence among general circulation models. , 2010 .

[3]  Peter Tiño,et al.  Managing Diversity in Regression Ensembles , 2005, J. Mach. Learn. Res..

[4]  Ludmila I. Kuncheva,et al.  Moderate diversity for better cluster ensembles , 2006, Inf. Fusion.

[5]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Douglas W. Nychka,et al.  Local eigenvalue analysis of CMIP3 climate model errors , 2008 .

[7]  R. Cattell The Scree Test For The Number Of Factors. , 1966, Multivariate behavioral research.

[8]  J. Elashoff,et al.  On the choice of variables in classification problems with dichotomous variables. , 1967, Biometrika.

[9]  Vivien Mallet,et al.  Automatic calibration of an ensemble for uncertainty estimation and probabilistic forecast: Application to air quality , 2011 .

[10]  Reto Knutti,et al.  Challenges in Combining Projections from Multiple Climate Models , 2010 .

[11]  Charles Doutriaux,et al.  Performance metrics for climate models , 2008 .

[12]  Michael D. Moran,et al.  Evaluating the capability of regional-scale air quality models to capture the vertical distribution of pollutants , 2013 .

[13]  S. Kotz,et al.  On a Relation between Principal Components and Regression Analysis , 1999 .

[14]  Thomas Reichler,et al.  On the Effective Number of Climate Models , 2011 .

[15]  Reto Knutti,et al.  The use of the multi-model ensemble in probabilistic climate projections , 2007, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[16]  Philippe Thunis,et al.  Skill and uncertainty of a regional air quality model ensemble , 2009 .

[17]  B. Booth,et al.  Selecting Ensemble Members to Provide Regional Climate Change Information , 2012 .

[18]  B. Santer,et al.  Selecting global climate models for regional climate change studies , 2009, Proceedings of the National Academy of Sciences.

[19]  Efisio Solazzo,et al.  E pluribus unum *: ensemble air quality predictions , 2013 .

[20]  Reto Knutti,et al.  The end of model democracy? , 2010 .

[21]  Angelo Ciaramella,et al.  On the systematic reduction of data complexity in multimodel atmospheric dispersion ensemble modeling , 2012 .

[22]  Xin Yao,et al.  Ensemble learning via negative correlation , 1999, Neural Networks.

[23]  Thomas M. Cover,et al.  The Best Two Independent Measurements Are Not the Two Best , 1974, IEEE Trans. Syst. Man Cybern..

[24]  Efisio Solazzo,et al.  ENSEMBLE and AMET: Two systems and approaches to a harmonized, simplified and efficient facility for air quality models development and evaluation , 2012 .

[25]  Tatsuya Akutsu,et al.  Efficient determination of cluster boundaries for analysis of gene expression profile data using hierarchical clustering and wavelet transform. , 2005, Genome informatics. International Conference on Genome Informatics.

[26]  H. Kaiser The Application of Electronic Computers to Factor Analysis , 1960 .

[27]  William J. Collins,et al.  Multimodel estimates of intercontinental source-receptor relationships for ozone pollution , 2008 .

[28]  A. Staniforth,et al.  The Operational CMC–MRB Global Environmental Multiscale (GEM) Model. Part I: Design Considerations and Formulation , 1998 .

[29]  Michael D. Moran,et al.  Evaluation of the meteorological forcing used for the Air Quality Model Evaluation International Initiative (AQMEII) air quality simulations , 2012 .

[30]  A. Nenes,et al.  ISORROPIA: A New Thermodynamic Equilibrium Model for Multiphase Multicomponent Inorganic Aerosols , 1998 .

[31]  Chris H. Q. Ding,et al.  Minimum redundancy feature selection from microarray gene expression data , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[32]  D. Stewart,et al.  A general canonical correlation index. , 1968, Psychological bulletin.

[33]  Slawomir Potempski,et al.  Multi-model ensemble analysis of the ETEX-2 experiment , 2008 .

[34]  Patrick J. F. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 2003 .

[35]  Chris H. Q. Ding,et al.  K-means clustering via principal component analysis , 2004, ICML.

[36]  Leiming Zhang,et al.  A size-segregated particle dry deposition scheme for an atmospheric aerosol module , 2001 .

[37]  Howard E. A. Tinsley,et al.  Multivariate Statistics and Mathematical Modeling , 2000 .

[38]  L. Guttman Some necessary conditions for common-factor analysis , 1954 .

[39]  John Irwin,et al.  A framework for evaluating regional-scale numerical photochemical modeling systems , 2010, Environmental fluid mechanics.

[40]  Carla E. Brodley,et al.  Solving cluster ensemble problems by bipartite graph partitioning , 2004, ICML.

[42]  Douglas W. Nychka,et al.  Local eigenvalue analysis of CMIP 3 climate model errors , 2007 .

[43]  Stefano Galmarini,et al.  Est modus in rebus : analytical properties of multi-model ensembles , 2009 .

[44]  P. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 1999 .

[45]  James D. Annan,et al.  Understanding the CMIP3 Multimodel Ensemble , 2011 .

[46]  C. Bretherton,et al.  The Effective Number of Spatial Degrees of Freedom of a Time-Varying Field , 1999 .

[47]  C. N. Hewitt,et al.  Biogenic emissions in Europe: 1. Estimates and uncertainties , 1995 .

[48]  H. Gunshin,et al.  A review of independent component analysis application to microarray gene expression data. , 2008, BioTechniques.

[49]  Patrick R. Zimmerman,et al.  Natural volatile organic compound emission rate estimates for U.S. woodland landscapes , 1994 .

[50]  K. Strimmer,et al.  Statistical Applications in Genetics and Molecular Biology High-Dimensional Regression and Variable Selection Using CAR Scores , 2011 .

[51]  Philippe Thunis,et al.  Evaluation of long-term ozone simulations from seven regional air quality models and their ensemble , 2007 .

[52]  J. C. McConnell,et al.  GEM-AQ, an on-line global multiscale chemical weather modelling system: model description and evaluation of gas phase chemistry processes , 2007 .

[53]  Sejong Yoon,et al.  Mutual information-based SVM-RFE for diagnostic classification of digitized mammograms , 2009, Pattern Recognit. Lett..

[54]  Michael D. Moran,et al.  Model evaluation and ensemble modelling of surface-level ozone in Europe and North America in the context of AQMEII , 2012 .

[55]  I. Jolliffe Principal Component Analysis , 2002 .

[56]  Gilbert Saporta,et al.  Comparing partitions of two sets of units based on the same variables , 2010, Adv. Data Anal. Classif..

[57]  Stefano Galmarini,et al.  Air Quality Model Evaluation International Initiative (AQMEII): Advancing the State of the Science in Regional Photochemical Modeling and Its Applications , 2011 .

[58]  Godfried T. Toussaint,et al.  Note on optimal selection of independent binary-valued features for pattern recognition (Corresp.) , 1971, IEEE Trans. Inf. Theory.

[59]  J. Annan,et al.  Reliability of the CMIP3 ensemble , 2010 .

[60]  Rohit Mathur,et al.  Air Quality Model Evaluation International Initiative (AQMEII): A Two-Continent Effort for the Evaluation of Regional Air Quality Models , 2014 .

[61]  G. Abramowitz Model independence in multi-model ensemble prediction , 2010 .

[62]  Steven D. Brown,et al.  Handbook of applied multivariate statistics and mathematical modeling , 2000 .