A COMPUTATIONAL FRAMEWORK FOR DISCRIMINATIVE ANALYSIS OF HIGH DIMENSIONAL BIOMEDICAL IMAGE DATA

In this work we propose, implement and evaluate a discriminative computational pipeline to address the problem of feature generation, feature screening and feature subset selection from biomedical datasets when the ratio of feature dimensions (possibly in millions) to number of samples is very high . The proposed pipeline is modular and can be highly parallel. The framework is applied to a variety of real world discrimination problems including plant specie classification from 2D leaf images, Gender/age/expression/human identification from 3D facial surface mesh and Gender/Age/Disease classification from 3D neural Magnetic Resonance Imaging (MRI) images. By using a unique set of novel features for each application we either achieve better or competitive classification accuracy than existing work or set new benchmarks otherwise. We can also locate discriminative regions for faces and brains which facilitate further scientific discoveries. This work illustrates quantitatively, the effectiveness and the diversity of the proposed feature extraction and machine learning pipelines.

[1]  Chris H. Q. Ding,et al.  Evolving Feature Selection , 2005, IEEE Intell. Syst..

[2]  Hasan Demirel,et al.  Facial Expression Recognition Using 3D Facial Feature Distances , 2007, ICIAR.

[3]  Hee-Seok Oh,et al.  CVTresh: R Package for Level-Dependent Cross-Validation Thresholding , 2006 .

[4]  Huan Liu,et al.  A Probabilistic Approach to Feature Selection - A Filter Solution , 1996, ICML.

[5]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Cristina Conde,et al.  Automatic 3D Face Feature Points Extraction with Spin Images , 2006, ICIAR.

[7]  D. Perrett,et al.  What Gives a Face its Gender? , 1993, Perception.

[8]  Patrick Henry Winston,et al.  The psychology of computer vision , 1976, Pattern Recognit..

[9]  Yanxi Liu,et al.  Local facial asymmetry for expression classification , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[10]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[11]  Jonny Eriksson,et al.  Feature reduction for classification of multidimensional data , 2000, Pattern Recognit..

[12]  Xiuwen Liu,et al.  Shape of Elastic Strings in Euclidean Space , 2008, International Journal of Computer Vision.

[13]  Yanxi Liu,et al.  Discriminative MR Image Feature Analysis for Automatic Schizophrenia and Alzheimer's Disease Classification , 2004, MICCAI.

[14]  Li Wei,et al.  SAXually Explicit Images: Finding Unusual Shapes , 2006, Sixth International Conference on Data Mining (ICDM'06).

[15]  Yanxi Liu,et al.  Classification Driven Semantic Based Medical Image Indexing and Retrieval , 1999 .

[16]  Javier F. Palatnik,et al.  Control of leaf morphogenesis by microRNAs , 2003, Nature.

[17]  Yukiko Kenmochi,et al.  Facial expression analysis from 3D range images; comparison with the analysis from 2D images and their integration , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[18]  Pierre Alliez,et al.  Anisotropic polygonal remeshing , 2003, ACM Trans. Graph..

[19]  Stanley J. Reeves,et al.  A cross-validation framework for solving image restoration problems , 1992, J. Vis. Commun. Image Represent..

[20]  Bruce Draper,et al.  Feature selection from huge feature sets in the context of computer vision , 2000 .

[21]  Lalitha Rangarajan,et al.  Dimensionality reduction of multidimensional temporal data through regression , 2004, Pattern Recognit. Lett..

[22]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[23]  Leslie S. Smith,et al.  Feature subset selection in large dimensionality domains , 2010, Pattern Recognit..

[24]  David D. Pollard,et al.  How to calculate normal curvatures of sampled geological surfaces , 2003 .

[25]  Robert P. W. Duin,et al.  Feature Scaling in Support Vector Data Descriptions , 2000 .

[26]  Jordi Vitrià,et al.  On the Selection and Classification of Independent Features , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  G. Churchill,et al.  Statistical design and the analysis of gene expression microarray data. , 2001, Genetical research.

[28]  Yanxi Liu,et al.  A classification based similarity metric for 3D image retrieval , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[29]  Yanxi Liu,et al.  Robust midsagittal plane extraction from normal and pathological 3-D neuroradiology images , 2001, IEEE Transactions on Medical Imaging.

[30]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[31]  Ch Chen,et al.  Pattern recognition and artificial intelligence , 1976 .

[32]  M. Matsui,et al.  High-throughput characterization of plant gene functions by using gain-of-function technology. , 2010, Annual review of plant biology.

[33]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[34]  Roni Khardon,et al.  Kernel methods and their application to structured data , 2009 .

[35]  Hichem Sahbi,et al.  Kernel PCA for similarity invariant shape recognition , 2007, Neurocomputing.

[36]  Sung-Bae Cho,et al.  Efficient huge-scale feature selection with speciated genetic algorithm , 2005 .

[37]  Yanxi Li,et al.  Truly 3D midsagittal plane extraction for robust neuroimage registration , 2006, 3rd IEEE International Symposium on Biomedical Imaging: Nano to Macro, 2006..

[38]  Anil K. Jain,et al.  Multimodal Facial Gender and Ethnicity Identification , 2006, ICB.

[39]  James C. Gee,et al.  Morphometric analysis of brain images with reduced number of statistical tests: A study on the gender-related differentiation of the corpus callosum , 2009, Artif. Intell. Medicine.

[40]  Ioannis A. Kakadiaris,et al.  Three-Dimensional Face Recognition in the Presence of Facial Expressions: An Annotated Deformable Model Approach , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[42]  Hamed Habibi Aghdam,et al.  Novel Framework for Selecting the Optimal Feature Vector from Large Feature Spaces , 2009, ICIAR.

[43]  Hao Zhang,et al.  Adapting Geometric Attributes for Expression-Invariant 3D Face Recognition , 2007, IEEE International Conference on Shape Modeling and Applications 2007 (SMI '07).

[44]  Joshua D. Schwartz,et al.  Hierarchical Matching of Deformable Shapes , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[45]  R. Adolphs Recognizing emotion from facial expressions: psychological and neurological mechanisms. , 2002, Behavioral and cognitive neuroscience reviews.

[46]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[47]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[48]  James M. Rehg,et al.  Where am I: Place instance and category recognition using spatial PACT , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[49]  Maya R. Gupta,et al.  Bayesian Quadratic Discriminant Analysis , 2007, J. Mach. Learn. Res..

[50]  Li Wei,et al.  Fast time series classification using numerosity reduction , 2006, ICML.

[51]  S. Cornish From atoms to molecules (and back) , 2008 .

[52]  C. Ding,et al.  Gene selection algorithm by combining reliefF and mRMR , 2007, 2007 IEEE 7th International Symposium on BioInformatics and BioEngineering.

[53]  Daphne Koller,et al.  Toward Optimal Feature Selection , 1996, ICML.

[54]  Haibin Ling,et al.  Shape Classification Using the Inner-Distance , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55]  Henrik I. Christensen,et al.  Pattern Recognition in Practice IV: Multiple Paradigms, Comparative Studies and Hybrid Systems , 1994 .

[56]  Michèle Sebag,et al.  Data Streaming with Affinity Propagation , 2008, ECML/PKDD.

[57]  Jeremy Kubica,et al.  Parallel Large Scale Feature Selection for Logistic Regression , 2009, SDM.

[58]  A. Cañas Advances in Computer Vision and Image Processing.Volume 1, 1984, Image Reconstruction from Incomplete Observations , 1986 .

[59]  Dario Floreano,et al.  Active vision and feature selection in evolutionary behavioral systems , 2002 .

[60]  Michael G. Strintzis,et al.  3D facial expression recognition using swarm intelligence , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[61]  Sang Wook Lee,et al.  ICP Registration Using Invariant Features , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[62]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[63]  Michael G. Strintzis,et al.  Bilinear elastically deformable models with application to 3D face and facial expression recognition , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[64]  Josef Kittler,et al.  Floating search methods for feature selection with nonmonotonic criterion functions , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[65]  Michael Thompson,et al.  Frontiers of Pattern Recognition , 1975 .

[66]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[67]  Duy-Dinh Le,et al.  Robust Object Detection using Fast Feature Selection from Huge Feature Sets , 2006, 2006 International Conference on Image Processing.

[68]  Amparo Alonso-Betanzos,et al.  Filter Methods for Feature Selection - A Comparative Study , 2007, IDEAL.

[69]  John E. Moody,et al.  The Effective Number of Parameters: An Analysis of Generalization and Regularization in Nonlinear Learning Systems , 1991, NIPS.

[70]  Zhuowen Tu,et al.  Learning Context-Sensitive Shape Similarity by Graph Transduction , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[72]  Yanxi Liu,et al.  Cervical Cancer Detection Using SVM Based Feature Screening , 2004, MICCAI.

[73]  Youping Deng,et al.  Feature Selection and Classification of MAQC-II Breast Cancer and Multiple Myeloma Microarray Gene Expression Data , 2009, PloS one.

[74]  程东峰,et al.  Wilson病与肝移植 , 2004 .

[75]  Hiroshi Motoda,et al.  Computational Methods of Feature Selection , 2022 .

[76]  Yanxi Liu,et al.  Expression Classification using Wavelet Packet Method on Asymmetry Faces , 2006 .

[77]  Yanxi Liu,et al.  DICOVERY OF "BIOMARKERS" FOR ALZHEIMER'S DISEASE PREDICTION FROM STRUCTURAL MR IMAGES , 2007, 2007 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[78]  Gaile G. Gordon,et al.  Face recognition based on depth and curvature features , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[79]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[80]  Yanxi Liu,et al.  Quantified brain asymmetry for age estimation of normal and AD/MCI subjects , 2008, 2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[81]  Avinash C. Kak,et al.  PCA versus LDA , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[82]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[83]  Michael I. Jordan,et al.  Supervised learning from incomplete data via an EM approach , 1993, NIPS.

[84]  Andrea J. van Doorn,et al.  Surface shape and curvature scales , 1992, Image Vis. Comput..

[85]  E. M. Wright,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[86]  Ioannis A. Kakadiaris,et al.  Evaluation of 3D Face Recognition in the presence of facial expressions: an Annotated Deformable Model approach , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[87]  R. C. Grimsdale Automatic Interpretation and Classification of Images, A. Grasselli (Ed.). Academic Press, New York (1969), 436 pp. $14.00. , 1972 .

[88]  Adnan A. Y. Mustafa Fuzzy shape matching with boundary signatures , 2002, Pattern Recognit. Lett..

[89]  Guorong Xuan,et al.  Feature Selection based on the Bhattacharyya Distance , 2006, ICPR.

[90]  Yanxi Liu,et al.  SVM decision boundary based discriminative subspace induction , 2005, Pattern Recognit..

[91]  A J O'Toole,et al.  More about the Difference between Men and Women: Evidence from Linear Neural Networks and the Principal-Component Approach , 1995, Perception.

[92]  Patrick J. Flynn,et al.  Multiple Nose Region Matching for 3D Face Recognition under Varying Facial Expression , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[93]  Jun Wang,et al.  A 3D facial expression database for facial behavior research , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[94]  Bruce A. Draper,et al.  Feature selection from huge feature sets , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[95]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[96]  Terrence J. Sejnowski,et al.  A Perceptron Reveals the Face of Sex , 1995, Neural Computation.

[97]  Nuno Vasconcelos,et al.  A Kullback-Leibler Divergence Based Kernel for SVM Classification in Multimedia Applications , 2003, NIPS.

[98]  Pavlos Protopapas,et al.  Kernels for Periodic Time Series Arising in Astronomy , 2009, ECML/PKDD.

[99]  Haibin Ling,et al.  Using the inner-distance for classification of articulated shapes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[100]  Alan C. Evans,et al.  An MRI-based stereotactic atlas from 250 young normal subjects , 1992 .

[101]  A. M. Burton,et al.  Sex Discrimination: How Do We Tell the Difference between Male and Female Faces? , 1993, Perception.

[102]  Azriel Rosenfeld,et al.  Computer Methods in Image Analysis , 1977 .

[103]  Leslie A. Zebrowitz,et al.  Do facial averageness and symmetry signal health? , 2001, Evolution and human behavior : official journal of the Human Behavior and Evolution Society.

[104]  Oskar Söderkvist,et al.  Computer Vision Classification of Leaves from Swedish Trees , 2001 .

[105]  Yonghong Peng,et al.  A novel feature selection approach for biomedical data classification , 2010, J. Biomed. Informatics.

[106]  Jun Wang,et al.  3D Facial Expression Recognition Based on Primitive Surface Feature Distribution , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[107]  Takeo Kanade,et al.  Classification-Driven Pathological Neuroimage Retrieval Using Statistical Asymmetry Measures , 2001, MICCAI.

[108]  Adam C. Winstanley,et al.  Invariant optimal feature selection: A distance discriminant and feature ranking based solution , 2008, Pattern Recognit..

[109]  A. M. Burton,et al.  What's the Difference between Men and Women? Evidence from Facial Measurement , 1993, Perception.

[110]  L. Farkas,et al.  Facial Asymmetry in Healthy North American Caucasians , 2009 .

[111]  A. O'Toole,et al.  Sex Classification is Better with Three-Dimensional Head Structure Than with Image Intensity Information , 1997, Perception.

[112]  Yanxi Liu,et al.  A quantified study of facial asymmetry in 3D faces , 2003, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443).

[113]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[114]  Nikolaus F. Troje,et al.  How is bilateral symmetry of human faces used for recognition of novel views? , 1998, Vision Research.

[115]  J. Kittler,et al.  Feature Set Search Alborithms , 1978 .

[116]  Rafael C. González,et al.  Local Determination of a Moving Contrast Edge , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[117]  Stuart Jefferies,et al.  Computing and telescopes at the frontiers of optical astronomy , 2003, Comput. Sci. Eng..

[118]  Gaile G. Gordon,et al.  Face recognition based on depth maps and surface curvature , 1991, Optics & Photonics.

[119]  John R. Kender,et al.  Feature selection in large dataset processing, especially in the video domain , 2005 .

[120]  Shutao Li,et al.  Gene Selection Using Wilcoxon Rank Sum Test and Support Vector Machine for Cancer Classification , 2007, CIS.

[121]  Thomas S. Huang,et al.  3D facial expression recognition based on automatically selected features , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[122]  Shenghuo Zhu,et al.  Using discriminant analysis for multi-class classification , 2003, Third IEEE International Conference on Data Mining.

[123]  C. Duan,et al.  Biosynthesis and Genetic Regulation of Proanthocyanidins in Plants , 2008, Molecules.