The Relevance Index (RI) is an information theory-based measure that was originally defined to detect groups of functionally similar neurons, based on their dynamic behavior. More in general, considering the dynamical analysis of a generic complex system, the larger the RI value associated with a subset of variables, the more those variables are strongly correlated with one another and independent from the other variables describing the system status. We describe some early experiments to evaluate whether such an index can be used to extract relevant feature subsets in binary pattern classification problems. In particular, we used a PSO variant to efficiently explore the RI search space, whose size equals the number of possible variable subsets (in this case \(2^{104}\)) and find the most relevant and discriminating feature subsets with respect to pattern representation. We then turned such relevant subsets into a new smaller set of richer features, whose values depend on the values of the binary features they include. The paper reports some exploratory results we obtained in a simple character recognition task, comparing the performance of RI-based feature extraction and selection with other classical feature selection/extraction approaches.
[1]
D. P. Russell,et al.
Functional Clustering: Identifying Strongly Interactive Brain Regions in Neuroimaging Data
,
1998,
NeuroImage.
[2]
Xin Yao,et al.
A Survey on Evolutionary Computation Approaches to Feature Selection
,
2016,
IEEE Transactions on Evolutionary Computation.
[3]
Roberto Serra,et al.
The detection of intermediate-level emergent structures and patterns
,
2013,
ECAL.
[4]
Antonina Starita,et al.
Particle swarm optimization for multimodal functions: a clustering approach
,
2008
.
[5]
Michele Amoretti,et al.
Searching Relevant Variable Subsets in Complex Systems Using K-Means PSO
,
2017,
WIVACE.
[6]
Michele Amoretti,et al.
GPU-Based Parallel Search of Relevant Variable Sets in Complex Systems
,
2016,
WIVACE.
[7]
J. MacQueen.
Some methods for classification and analysis of multivariate observations
,
1967
.