Obtaining scalable and accurate classification in large-scale spatio-temporal domains

We present an approach for learning models that obtain accurate classification of data objects, collected in large-scale spatio-temporal domains. The model generation is structured in three phases: spatial dimension reduction, spatio-temporal features extraction, and feature selection. Novel techniques for the first two phases are presented, with two alternatives for the middle phase. We explore model generation based on the combinations of techniques from each phase. We apply the introduced methodology to data-sets from the Voltage-Sensitive Dye Imaging (VSDI) domain, where the resulting classification models successfully decode neuronal population responses in the visual cortex of behaving animals. VSDI is currently the best technique enabling simultaneous high spatial (10,000 points) and temporal (10 ms or less) resolution imaging from neuronal population in the cortex. We demonstrate that not only our approach is scalable enough to handle computationally challenging data, but it also contributes to the neuroimaging field of study with its decoding abilities. The effectiveness of our methodology is further explored on a data-set from the hurricanes domain, and a promising direction, based on the preliminary results of hurricane severity classification, is revealed.

[1]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[2]  Yongmin Li,et al.  Video classification using spatial-temporal features and PCA , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[3]  Ian Witten,et al.  Data Mining , 2000 .

[4]  Dinggang Shen,et al.  Classifying spatial patterns of brain activity with machine learning methods: Application to lie detection , 2005, NeuroImage.

[5]  F. Mörchen Time series feature extraction for data mining using DWT and DFT , 2003 .

[6]  Dimitrios Gunopulos,et al.  A Wavelet-Based Anytime Algorithm for K-Means Clustering of Time Series , 2003 .

[7]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[8]  Tom M. Mitchell,et al.  Classification in Very High Dimensional Problems with Handfuls of Examples , 2007, PKDD.

[9]  Glenn Fung,et al.  SVM Feature Selection for Classification of SPECT Images of Alzheimer's Disease Using Spatial Information , 2005, ICDM.

[10]  Stephen C. Strother,et al.  Predicting Motor Tasks in fMRI Data with Support Vector Machines , 2003 .

[11]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[12]  Sameer Singh EEG data classification with localised structural information , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[13]  F. Tong,et al.  Decoding Seen and Attended Motion Directions from Activity in the Human Visual Cortex , 2006, Current Biology.

[14]  C. Shahabi,et al.  A Supervised Feature Subset Selection Technique for Multivariate Time Series , 2005 .

[15]  Shashi Shekhar,et al.  Context inclusive function evaluation: a case study with EM-based multi-scale multi-granular image classification , 2009, Knowledge and Information Systems.

[16]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[17]  Pradeep Shenoy,et al.  Robust Classification of Electrocorticographic Signals for BCI , 2006 .

[18]  Raju S. Bapi,et al.  Detection of Cognitive States from fMRI Data Using Machine Learning Techniques , 2007, IJCAI.

[19]  Seungjin Choi,et al.  PCA+HMM+SVM for EEG pattern classification , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[20]  Tom M. Mitchell,et al.  Classifying Instantaneous Cognitive States from fMRI Data , 2003, AMIA.

[21]  Liqing Zhang,et al.  Temporal and Spatial Features of Single-Trial EEG for Brain-Computer Interface , 2007, Comput. Intell. Neurosci..

[22]  Dimitris Samaras,et al.  Exploiting Temporal Information in Functional Magnetic Resonance Imaging Brain Data , 2005, MICCAI.

[23]  Robert T. Schultz,et al.  Nonlinear Estimation and Modeling of fMRI Data Using Spatio-temporal Support Vector Regression , 2003, IPMI.

[24]  Tom M. Mitchell,et al.  Learning to Decode Cognitive States from Brain Images , 2004, Machine Learning.

[25]  Geoffrey M Boynton,et al.  Imaging orientation selectivity: decoding conscious perception in V1 , 2005, Nature Neuroscience.

[26]  G. Rees,et al.  Predicting the orientation of invisible stimuli from activity in human primary visual cortex , 2005, Nature Neuroscience.

[27]  Christos Faloutsos,et al.  PEGASUS: mining peta-scale graphs , 2011, Knowledge and Information Systems.

[28]  F. Tong,et al.  Decoding the visual and subjective contents of the human brain , 2005, Nature Neuroscience.

[29]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[30]  James Bailey,et al.  Discovering correlated spatio-temporal changes in evolving graphs , 2007, Knowledge and Information Systems.

[31]  Mark Palatucci,et al.  Temporal Feature Selection for fMRI Analysis , 2007 .

[32]  Yuval Shahar,et al.  Improving Worm Detection with Artificial Neural Networks through Feature Selection and Temporal Analysis Techniques , 2008 .

[33]  G. Rees,et al.  Predicting the Stream of Consciousness from Activity in Human Visual Cortex , 2005, Current Biology.

[34]  G. Rees,et al.  Neuroimaging: Decoding mental states from brain activity in humans , 2006, Nature Reviews Neuroscience.

[35]  Karl J. Friston,et al.  Dynamic discrimination analysis: A spatial–temporal SVM , 2007, NeuroImage.

[36]  Le Song,et al.  Supervised feature selection via dependence estimation , 2007, ICML '07.

[37]  R. Ward,et al.  Application of a hybrid wavelet feature selection method in the design of a self-paced brain interface system , 2007, Journal of NeuroEngineering and Rehabilitation.

[38]  C. Shahabi,et al.  A PCA-based Kernel for Kernel PCA on Multivariate Time Series , 2005 .

[39]  A. Grinvald,et al.  Long-term voltage-sensitive dye imaging reveals cortical dynamics in behaving monkeys. , 2002, Journal of neurophysiology.

[40]  E. Seidemann,et al.  Optimal decoding of correlated neural population responses in the primate visual cortex , 2006, Nature Neuroscience.

[41]  Gregory A. Miller,et al.  Classification of functional brain images with a spatio-temporal dissimilarity map , 2006, NeuroImage.

[42]  Kenneth R. Knapp,et al.  Scientific data stewardship of international satellite cloud climatology project B1 global geostationary observations , 2008 .

[43]  Tom M. Mitchell,et al.  Training fMRI Classifiers to Discriminate Cognitive States across Multiple Subjects , 2003, NIPS.

[44]  Marcel J. T. Reinders,et al.  Random subspace method for multivariate feature selection , 2006, Pattern Recognit. Lett..

[45]  Jon Atli Benediktsson,et al.  A joint spatial and spectral SVM’s classification of panchromatic images , 2007, 2007 IEEE International Geoscience and Remote Sensing Symposium.

[46]  Cyrus Shahabi,et al.  Feature Subset Selection on Multivariate Time Series with Extremely Large Spatial Features , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[47]  R. Simpson,et al.  The hurricane and its impact , 1981 .