From Complex System Analysis to Pattern Recognition: Experimental Assessment of an Unsupervised Feature Extraction Method Based on the Relevance Index Metrics

The so-called Relevance Index (RI) metrics are a set of recently-introduced indicators based on information theory principles that can be used to analyze complex systems by detecting the main interacting structures within them. Such structures can be described as subsets of the variables which describe the system status that are strongly statistically correlated with one another and mostly independent of the rest of the system. The goal of the work described in this paper is to apply the same principles to pattern recognition and check whether the RI metrics can also identify, in a high-dimensional feature space, attribute subsets from which it is possible to build new features which can be effectively used for classification. Preliminary results indicating that this is possible have been obtained using the RI metrics in a supervised way, i.e., by separately applying such metrics to homogeneous datasets comprising data instances which all belong to the same class, and iterating the procedure over all possible classes taken into consideration. In this work, we checked whether this would also be possible in a totally unsupervised way, i.e., by considering all data available at the same time, independently of the class to which they belong, under the hypothesis that the peculiarities of the variable sets that the RI metrics can identify correspond to the peculiarities by which data belonging to a certain class are distinguishable from data belonging to different classes. The results we obtained in experiments made with some publicly available real-world datasets show that, especially when coupled to tree-based classifiers, the performance of an RI metrics-based unsupervised feature extraction method can be comparable to or better than other classical supervised or unsupervised feature selection or extraction methods.

[1]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[2]  Michele Amoretti,et al.  A Relevance Index Method to Infer Global Properties of Biological Networks , 2017, WIVACE.

[3]  Michele Amoretti,et al.  An Iterative Information-Theoretic Approach to the Detection of Structures in Complex Systems , 2018, Complex..

[4]  Hiroshi Motoda,et al.  Feature Selection Extraction and Construction , 2002 .

[5]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[6]  M. K. Fleming,et al.  Categorization of faces using unsupervised feature extraction , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[7]  Thomas M. Cover,et al.  Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing) , 2006 .

[8]  Michele Amoretti,et al.  Can the Relevance Index be Used to Evolve Relevant Feature Sets? , 2018, EvoApplications.

[9]  Yuan Shi,et al.  Information-Theoretical Learning of Discriminative Clusters for Unsupervised Domain Adaptation , 2012, ICML.

[10]  Hongnian Yu,et al.  Mutual information based input feature selection for classification problems , 2012, Decis. Support Syst..

[11]  Cigdem Gunduz-Demir,et al.  Unsupervised Feature Extraction via Deep Learning for Histopathological Classification of Colon Tissue Images , 2019, IEEE Transactions on Medical Imaging.

[12]  A. Owen Empirical Likelihood Ratio Confidence Regions , 1990 .

[13]  Michele Amoretti,et al.  Efficient Search of Relevant Structures in Complex Systems , 2016, AI*IA.

[14]  Michele Amoretti,et al.  GPU-Based Parallel Search of Relevant Variable Sets in Complex Systems , 2016, WIVACE.

[15]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[16]  Yang Zhang,et al.  Unsupervised Feature Extraction for Time Series Clustering Using Orthogonal Wavelet Transform , 2006, Informatica.

[17]  J.C. Principe,et al.  A methodology for information theoretic feature extraction , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[18]  Stefano Cagnoni,et al.  An Integration-Based Approach to Pattern Clustering and Classification , 2018, AI*IA.

[19]  Jon Atli Benediktsson,et al.  Joint bilateral filtering and spectral similarity-based sparse representation: A generic framework for effective feature extraction and data classification in hyperspectral imaging , 2017, Pattern Recognit..

[20]  Stefano Cagnoni,et al.  Social Relevance Index for Studying Communities in a Facebook Group of Patients , 2018, EvoApplications.

[21]  Antonina Starita,et al.  Particle swarm optimization for multimodal functions: a clustering approach , 2008 .

[22]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[23]  G. Edelman,et al.  A measure for brain complexity: relating functional segregation and integration in the nervous system. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[24]  D. P. Russell,et al.  Functional Clustering: Identifying Strongly Interactive Brain Regions in Neuroimaging Data , 1998, NeuroImage.

[25]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[26]  D. Goodin The cambridge dictionary of statistics , 1999 .

[27]  Luis O. Jimenez-Rodriguez,et al.  Unsupervised Linear Feature-Extraction Methods and Their Effects in the Classification of High-Dimensional Data , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[28]  Jun Yu,et al.  Local Deep-Feature Alignment for Unsupervised Dimension Reduction , 2018, IEEE Transactions on Image Processing.

[29]  Lizhe Wang,et al.  SuperPCA: A Superpixelwise PCA Approach for Unsupervised Feature Extraction of Hyperspectral Imagery , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[30]  S. S. Wilks The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses , 1938 .

[31]  Teuvo Kohonen,et al.  Self-Organizing Maps, Third Edition , 2001, Springer Series in Information Sciences.

[32]  A. Atiya,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[33]  Riccardo Poli,et al.  A Field Guide to Genetic Programming , 2008 .

[34]  Cardona Alzate,et al.  Predicción y selección de variables con bosques aleatorios en presencia de variables correlacionadas , 2020 .

[35]  D. Rajan Probability, Random Variables, and Stochastic Processes , 2017 .

[36]  Wu Jigang,et al.  Unsupervised feature extraction by low-rank and sparsity preserving embedding , 2019, Neural Networks.

[37]  Marco Fiorucci,et al.  Exploring the organisation of complex systems through the dynamical interactions among their relevant subsets , 2015, ECAL.

[38]  Y-h. Taguchi,et al.  Tensor decomposition-based and principal-component-analysis-based unsupervised feature extraction applied to the gene expression and methylation profiles in the brains of social insects with multiple castes , 2018, BMC Bioinformatics.

[39]  Michele Amoretti,et al.  Searching Relevant Variable Subsets in Complex Systems Using K-Means PSO , 2017, WIVACE.

[40]  Shiri Gordon,et al.  Unsupervised image-set clustering using an information theoretic framework , 2006, IEEE Transactions on Image Processing.

[41]  Marco Fiorucci,et al.  The Search for Candidate Relevant Subsets of Variables in Complex Systems , 2015, Artificial Life.

[42]  Roberto Serra,et al.  The detection of intermediate-level emergent structures and patterns , 2013, ECAL.

[43]  P. Greenwood,et al.  A Guide to Chi-Squared Testing , 1996 .

[44]  Christos N. Markides,et al.  Calibration of astigmatic particle tracking velocimetry based on generalized Gaussian feature extraction , 2019 .

[45]  Deniz Erdogmus,et al.  Feature extraction using information-theoretic learning , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Y-H Taguchi,et al.  Tensor Decomposition-Based Unsupervised Feature Extraction Can Identify the Universal Nature of Sequence-Nonspecific Off-Target Regulation of mRNA Mediated by MicroRNA Transfection , 2018, Cells.