The Emerging "Big Dimensionality"

The world continues to generate quintillion bytes of data daily, leading to the pressing needs for new efforts in dealing with the grand challenges brought by Big Data. Today, there is a growing consensus among the computational intelligence communities that data volume presents an immediate challenge pertaining to the scalability issue. However, when addressing volume in Big Data analytics, researchers in the data analytics community have largely taken a one-sided study of volume, which is the "Big Instance Size" factor of the data. The flip side of volume which is the dimensionality factor of Big Data, on the other hand, has received much lesser attention. This article thus represents an attempt to fill in this gap and places special focus on this relatively under-explored topic of "Big Dimensionality", wherein the explosion of features (variables) brings about new challenges to computational intelligence. We begin with an analysis on the origins of Big Dimensionality. The evolution of feature dimensionality in the last two decades is then studied using popular data repositories considered in the data analytics and computational intelligence research communities. Subsequently, the state-of-the-art feature selection schemes reported in the field of computational intelligence are reviewed to reveal the inadequacies of existing approaches in keeping pace with the emerging phenomenon of Big Dimensionality. Last but not least, the "curse and blessing of Big Dimensionality" are delineated and deliberated.

[1]  Alex Alves Freitas,et al.  A New Sequential Covering Strategy for Inducing Classification Rules With Ant Colony Algorithms , 2013, IEEE Transactions on Evolutionary Computation.

[2]  Hannu Koivisto,et al.  A Dynamically Constrained Multiobjective Genetic Fuzzy System for Regression Problems , 2010, IEEE Transactions on Fuzzy Systems.

[3]  Francisco Herrera,et al.  A Fast and Scalable Multiobjective Genetic Fuzzy System for Linguistic Fuzzy Modeling in High-Dimensional Regression Problems , 2011, IEEE Transactions on Fuzzy Systems.

[4]  Zhengming Ma,et al.  Local Coordinates Alignment With Global Preservation for Dimensionality Reduction , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[5]  C. Carcassi,et al.  Psoriasis is associated with a SNP haplotype of the corneodesmosin gene (CDSN). , 2002, Tissue antigens.

[6]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[7]  Huan Liu,et al.  Chi2: feature selection and discretization of numeric attributes , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.

[8]  Hiroshi Mamitsuka,et al.  Boosted Network Classifiers for Local Feature Selection , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Jun Zhang,et al.  Relevance Units Latent Variable Model and Nonlinear Dimensionality Reduction , 2010, IEEE Transactions on Neural Networks.

[10]  Ivor W. Tsang,et al.  Efficient Multitemplate Learning for Structured Prediction , 2011, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Ivor W. Tsang,et al.  Minimax Sparse Logistic Regression for Very High-Dimensional Feature Selection , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[12]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[13]  Feiping Nie,et al.  Discriminative Least Squares Regression for Multiclass Classification and Feature Selection , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[14]  Michela Antonelli,et al.  Genetic Training Instance Selection in Multiobjective Evolutionary Fuzzy Systems: A Coevolutionary Approach , 2012, IEEE Transactions on Fuzzy Systems.

[15]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[16]  Suprakash Datta,et al.  Evolved Features for DNA Sequence Classification and Their Fitness Landscapes , 2013, IEEE Transactions on Evolutionary Computation.

[17]  Mikael Collan,et al.  A Practical Approach to R&D Portfolio Selection Using the Fuzzy Pay-Off Method , 2012, IEEE Transactions on Fuzzy Systems.

[18]  André Stuhlsatz,et al.  Feature Extraction With Deep Neural Networks by a Generalized Discriminant Analysis , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[19]  Eric C. Rouchka,et al.  Reduced HyperBF Networks: Regularization by Explicit Complexity Reduction and Scaled Rprop-Based Training , 2011, IEEE Transactions on Neural Networks.

[20]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[21]  Narasimhan Sundararajan,et al.  A Metacognitive Neuro-Fuzzy Inference System (McFIS) for Sequential Classification Problems , 2013, IEEE Transactions on Fuzzy Systems.

[22]  Zhen Ji,et al.  Towards a Memetic Feature Selection Paradigm [Application Notes] , 2010, IEEE Computational Intelligence Magazine.

[23]  Xiaowei Yang,et al.  A Kernel Fuzzy c-Means Clustering-Based Fuzzy Support Vector Machine Algorithm for Classification Problems With Outliers or Noises , 2011, IEEE Transactions on Fuzzy Systems.

[24]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[26]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[27]  James T. Kwok,et al.  Clustered Nyström Method for Large Scale Manifold Learning and Dimension Reduction , 2010, IEEE Transactions on Neural Networks.

[28]  Xiaoming Zhang,et al.  Feature Fusion Using Locally Linear Embedding for Classification , 2010, IEEE Transactions on Neural Networks.

[29]  Gaël Richard,et al.  Multiclass Feature Selection With Kernel Gram-Matrix-Based Criteria , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[30]  Witold Pedrycz,et al.  Clustering Spatiotemporal Data: An Augmented Fuzzy C-Means , 2013, IEEE Transactions on Fuzzy Systems.

[31]  Tapabrata Ray,et al.  A Pareto Corner Search Evolutionary Algorithm and Dimensionality Reduction in Many-Objective Optimization Problems , 2011, IEEE Transactions on Evolutionary Computation.

[32]  Mengjie Zhang,et al.  Parent Selection Pressure Auto-Tuning for Tournament Selection in Genetic Programming , 2013, IEEE Transactions on Evolutionary Computation.

[33]  Qingfu Zhang,et al.  Objective Reduction in Many-Objective Optimization: Linear and Nonlinear Algorithms , 2013, IEEE Transactions on Evolutionary Computation.

[34]  Nikhil R. Pal,et al.  An Integrated Mechanism for Feature Selection and Fuzzy Rule Extraction for Classification , 2012, IEEE Transactions on Fuzzy Systems.

[35]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Ivor W. Tsang,et al.  Learning Sparse SVM for Feature Selection on Very High Dimensional Datasets , 2010, ICML.

[37]  Hiroshi Someya,et al.  Striking a Mean- and Parent-Centric Balance in Real-Valued Crossover Operators , 2013, IEEE Transactions on Evolutionary Computation.

[38]  Bernhard Sendhoff,et al.  Generalizing Surrogate-Assisted Evolutionary Computation , 2010, IEEE Transactions on Evolutionary Computation.

[39]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[40]  Stefano Soatto,et al.  Quick Shift and Kernel Methods for Mode Seeking , 2008, ECCV.

[41]  Ivor W. Tsang,et al.  A Feature Selection Method for Multivariate Performance Measures , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Huan Liu,et al.  Advancing feature selection research , 2010 .

[43]  Gregory Piatetsky-Shapiro,et al.  High-Dimensional Data Analysis: The Curses and Blessings of Dimensionality , 2000 .

[44]  Robert Tibshirani,et al.  1-norm Support Vector Machines , 2003, NIPS.

[45]  James L. Crowley,et al.  A Representation for Shape Based on Peaks and Ridges in the Difference of Low-Pass Transform , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  P. Bickel,et al.  Some theory for Fisher''s linear discriminant function , 2004 .

[47]  Kay Chen Tan,et al.  Multimodal Optimization Using a Biobjective Differential Evolution Algorithm Enhanced With Mean Distance-Based Selection , 2013, IEEE Transactions on Evolutionary Computation.

[48]  Cordelia Schmid,et al.  Action recognition by dense trajectories , 2011, CVPR 2011.

[49]  Z. Zenn Bien,et al.  Representation of a Fisher Criterion Function in a Kernel Feature Space , 2010, IEEE Transactions on Neural Networks.

[50]  Jieping Ye,et al.  Feature grouping and selection over an undirected graph , 2012, KDD.

[51]  Mark Johnston,et al.  Evolving Diverse Ensembles Using Genetic Programming for Classification With Unbalanced Data , 2013, IEEE Transactions on Evolutionary Computation.

[52]  Lei Wang,et al.  Feature Selection With Redundancy-Constrained Class Separability , 2010, IEEE Transactions on Neural Networks.

[53]  Leon Wenliang Zhong,et al.  Efficient Sparse Modeling With Automatic Feature Grouping , 2011, IEEE Transactions on Neural Networks and Learning Systems.

[54]  Marimuthu Palaniswami,et al.  Fuzzy c-Means Algorithms for Very Large Data , 2012, IEEE Transactions on Fuzzy Systems.

[55]  Tim Blackwell,et al.  A Study of Collapse in Bare Bones Particle Swarm Optimization , 2012, IEEE Transactions on Evolutionary Computation.

[56]  Ata Kabán,et al.  Sharp Generalization Error Bounds for Randomly-projected Classifiers , 2013, ICML.

[57]  Luciano Sbaiz,et al.  Finding meaning on YouTube: Tag recommendation and category discovery , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[58]  Guido Sanguinetti,et al.  Semisupervised Multitask Learning With Gaussian Processes , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[59]  Kay Chen Tan,et al.  A Multi-Facet Survey on Memetic Computation , 2011, IEEE Transactions on Evolutionary Computation.

[60]  Derek C. Rose,et al.  Deep Machine Learning - A New Frontier in Artificial Intelligence Research [Research Frontier] , 2010, IEEE Computational Intelligence Magazine.

[61]  Iickho Song,et al.  Complexity-Reduced Scheme for Feature Extraction With Linear Discriminant Analysis , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[62]  Ioannis Patras,et al.  Tree-Structured Feature Extraction Using Mutual Information , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[63]  Jesús Alcalá-Fdez,et al.  A Fuzzy Association Rule-Based Classification Model for High-Dimensional Problems With Genetic Rule Selection and Lateral Tuning , 2011, IEEE Transactions on Fuzzy Systems.

[64]  Glenn Fung,et al.  A Feature Selection Newton Method for Support Vector Machine Classification , 2004, Comput. Optim. Appl..

[65]  Grzegorz Dudek,et al.  An Artificial Immune System for Classification With Local Feature Selection , 2012, IEEE Transactions on Evolutionary Computation.

[66]  Azriel Rosenfeld,et al.  Computer Vision , 1988, Adv. Comput..

[67]  Xiaodong Li,et al.  Cooperatively Coevolving Particle Swarms for Large Scale Optimization , 2012, IEEE Transactions on Evolutionary Computation.

[68]  Ata Kabán Fractional Norm Regularization: Learning With Very Few Relevant Features , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[69]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .

[70]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[71]  Charles X. Ling,et al.  Constructing New and Better Evaluation Measures for Machine Learning , 2007, IJCAI.

[72]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[73]  Paul S. Bradley,et al.  Feature Selection via Concave Minimization and Support Vector Machines , 1998, ICML.

[74]  Qinghua Hu,et al.  Feature Selection for Monotonic Classification , 2012, IEEE Transactions on Fuzzy Systems.

[75]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[76]  Dan Roth,et al.  On generalization bounds, projection profile, and margin distribution , 2002, ICML.

[77]  Cordelia Schmid,et al.  Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[78]  Terence Tao,et al.  The Dantzig selector: Statistical estimation when P is much larger than n , 2005, math/0506081.

[79]  Francisco Herrera,et al.  Integration of an Index to Preserve the Semantic Interpretability in the Multiobjective Evolutionary Rule Selection and Tuning of Linguistic Fuzzy Systems , 2010, IEEE Transactions on Fuzzy Systems.

[80]  Ivor W. Tsang,et al.  Transductive Ordinal Regression , 2011, IEEE Transactions on Neural Networks and Learning Systems.

[81]  Graham Kendall,et al.  Grammatical Evolution of Local Search Heuristics , 2012, IEEE Transactions on Evolutionary Computation.

[82]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[83]  Dimitris K. Tasoulis,et al.  Enhancing Differential Evolution Utilizing Proximity-Based Mutation Operators , 2011, IEEE Transactions on Evolutionary Computation.

[84]  Chin-Teng Lin,et al.  LDA-Based Clustering Algorithm and Its Application to an Unsupervised Feature Extraction , 2011, IEEE Transactions on Fuzzy Systems.

[85]  H. Bondell,et al.  Simultaneous Regression Shrinkage, Variable Selection, and Supervised Clustering of Predictors with OSCAR , 2008, Biometrics.

[86]  Joel Dudley,et al.  Identification of Discriminating Biomarkers for Human Disease Using Integrative Network Biology , 2008, Pacific Symposium on Biocomputing.

[87]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[88]  Peter Tiño,et al.  Scaling Up Estimation of Distribution Algorithms for Continuous Optimization , 2011, IEEE Transactions on Evolutionary Computation.

[89]  Jian-Bo Yang,et al.  Feature Selection Using Probabilistic Prediction of Support Vector Regression , 2011, IEEE Transactions on Neural Networks.

[90]  Yew-Soon Ong,et al.  A proposition on memes and meta-memes in computing for higher-order learning , 2009, Memetic Comput..

[91]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[92]  Francisco Herrera,et al.  IVTURS: A Linguistic Fuzzy Rule-Based Classification System Based On a New Interval-Valued Fuzzy Reasoning Method With Tuning and Rule Selection , 2013, IEEE Transactions on Fuzzy Systems.

[93]  Hua Huang,et al.  Super-Resolution Method for Face Recognition Using Nonlinear Mappings on Coherent Features , 2011, IEEE Transactions on Neural Networks.

[94]  Sebastian Nowozin,et al.  On feature combination for multiclass object classification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[95]  Junzo Watada,et al.  Fuzzy-Portfolio-Selection Models With Value-at-Risk , 2011, IEEE Transactions on Fuzzy Systems.

[96]  Feiping Nie,et al.  Trace Ratio Criterion for Feature Selection , 2008, AAAI.

[97]  Mengjie Zhang,et al.  A Filter Approach to Multiple Feature Construction for Symbolic Learning Classifiers Using Genetic Programming , 2012, IEEE Transactions on Evolutionary Computation.

[98]  Ivor W. Tsang,et al.  Discovering Support and Affiliated Features from Very High Dimensions , 2012, ICML.

[99]  E. Xing,et al.  Statistical Estimation of Correlated Genome Associations to a Quantitative Trait Network , 2009, PLoS genetics.

[100]  Chidchanok Lursinsap,et al.  A Discrimination Analysis for Unsupervised Feature Selection via Optic Diffraction Principle , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[101]  Bir Bhanu,et al.  Image retrieval with feature selection and relevance feedback , 2010, 2010 IEEE International Conference on Image Processing.

[102]  A. Brookes The essence of SNPs. , 1999, Gene.

[103]  Kristin A. Cook,et al.  Illuminating the Path: The Research and Development Agenda for Visual Analytics , 2005 .

[104]  George Kesidis,et al.  Margin-Maximizing Feature Elimination Methods for Linear and Nonlinear Kernel-Based Discriminant Functions , 2010, IEEE Transactions on Neural Networks.

[105]  Dong Xu,et al.  Semi-Supervised Dimension Reduction Using Trace Ratio Criterion , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[106]  Daniel E. O'Leary,et al.  Artificial Intelligence and Big Data , 2013, IEEE Intelligent Systems.

[107]  Edwin Lughofer,et al.  Reliable All-Pairs Evolving Fuzzy Classifiers , 2013, IEEE Transactions on Fuzzy Systems.

[108]  Zhaohong Deng,et al.  Scalable TSK Fuzzy Modeling for Very Large Datasets Using Minimal-Enclosing-Ball Approximation , 2011, IEEE Transactions on Fuzzy Systems.

[109]  Michael F. P. O'Boyle,et al.  Automatic Feature Generation for Machine Learning Based Optimizing Compilation , 2009, 2009 International Symposium on Code Generation and Optimization.

[110]  Nicu Sebe,et al.  Feature Weighting via Optimal Thresholding for Video Analysis , 2013, 2013 IEEE International Conference on Computer Vision.