Machine Learning Methods for Medical and Biological Image Computing

MACHINE LEARNING METHODS FOR MEDICAL AND BIOLOGICAL IMAGE COMPUTING Rongjian Li Old Dominion University, 2016 Director: Dr. Shuiwang Ji Medical and biological imaging technologies provide valuable visualization information of structure and function for an organ from the level of individual molecules to the whole object. Brain is the most complex organ in body, and it increasingly attracts intense research attentions with the rapid development of medical and biological imaging technologies. A massive amount of high-dimensional brain imaging data being generated makes the design of computational methods for efficient analysis on those images highly demanded. The current study of computational methods using hand-crafted features does not scale with the increasing number of brain images, hindering the pace of scientific discoveries in neuroscience. In this thesis, I propose computational methods using high-level features for automated analysis of brain images at different levels. At the brain function level, I develop a deep learning based framework for completing and integrating multi-modality neuroimaging data, which increases the diagnosis accuracy for Alzheimer’s disease. At the cellular level, I propose to use three dimensional convolutional neural networks (CNNs) for segmenting the volumetric neuronal images, which improves the performance of digital reconstruction of neuron structures. I design a novel CNN architecture such that the model training and testing image prediction can be implemented in an endto-end manner. At the molecular level, I build a voxel CNN classifier to capture discriminative features of the input along three spatial dimensions, which facilitate the identification of secondary structures of proteins from electron microscopy images. In order to classify genes specifically expressed in different brain cell-type, I propose to use invariant image feature descriptors to capture local gene expression information from cellular-resolution in situ hybridization images. I build image-level representations by applying regularized learning and vector quantization on generated image descriptors. The developed computational methods in this dissertation are evaluated using images from medical and biological experiments in comparison with baseline methods. Experimental results demonstrate that the developed representations, formulations, and algorithms are effective and efficient in learning from brain imaging data.

[1]  Dong Si,et al.  A machine learning approach for the identification of protein secondary structure elements from electron cryo-microscopy density maps. , 2012, Biopolymers.

[2]  YangLin,et al.  Automatic myonuclear detection in isolated single muscle fibers using robust ellipse fitting and sparse representation , 2014 .

[3]  Luca Maria Gambardella,et al.  Mitosis Detection in Breast Cancer Histology Images with Deep Neural Networks , 2013, MICCAI.

[4]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[5]  Joseph F. Murray,et al.  Convolutional Networks Can Learn to Generate Affinity Graphs for Image Segmentation , 2010, Neural Computation.

[6]  Mark E. Schmidt,et al.  The Alzheimer's Disease Neuroimaging Initiative: A review of papers published since its inception , 2012, Alzheimer's & Dementia.

[7]  Andrea J. Goldsmith,et al.  A Self-Directed Method for Cell-Type Identification and Separation of Gene Expression Microarrays , 2013, PLoS Comput. Biol..

[8]  D. M. Green,et al.  Signal detection theory and psychophysics , 1966 .

[9]  Jieping Ye,et al.  Automated annotation of Drosophila gene expression patterns using a controlled vocabulary , 2008, Bioinform..

[10]  Gal Chechik,et al.  FuncISH: learning a functional representation of neural ISH images , 2013, Bioinform..

[11]  Jieping Ye,et al.  A bag-of-words approach for Drosophila gene expression pattern annotation , 2009, BMC Bioinformatics.

[12]  J. Price :Allen Reference Atlas: A Digital Color Brain Atlas of the C57BL/6J Male Mouse , 2008 .

[13]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[14]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Hanchuan Peng,et al.  Bioimage informatics: a new area of engineering biology , 2008, Bioinform..

[16]  Allan R. Jones,et al.  An anatomically comprehensive atlas of the adult human brain transcriptome , 2012, Nature.

[17]  M. Baker,et al.  Bridging the information gap: computational tools for intermediate resolution structure interpretation. , 2001, Journal of molecular biology.

[18]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[19]  Gal Chechik,et al.  Localizing Genes to Cerebellar Layers by Classifying ISH Images , 2012, PLoS Comput. Biol..

[20]  Jie Zhou,et al.  Automatic Dendritic Length Quantification for High Throughput Screening of Mature Neurons , 2015, Neuroinformatics.

[21]  James A. Eddy,et al.  Cell type-specific genes show striking and distinct patterns of spatial expression in the mouse brain , 2013, Proceedings of the National Academy of Sciences.

[22]  Luca Maria Gambardella,et al.  Fast image scanning with deep max-pooling convolutional neural networks , 2013, 2013 IEEE International Conference on Image Processing.

[23]  Sean L. Hill,et al.  BigNeuron: Large-Scale 3D Neuron Reconstruction from Optical Microscopy Images , 2015, Neuron.

[24]  Paul M. Thompson,et al.  Multi-source feature learning for joint analysis of incomplete multiple heterogeneous neuroimaging data , 2012, NeuroImage.

[25]  S. Horvath,et al.  Functional organization of the transcriptome in human brain , 2008, Nature Neuroscience.

[26]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[27]  Allan R. Jones,et al.  Genome-wide atlas of gene expression in the adult mouse brain , 2007, Nature.

[28]  Partha P. Mitra,et al.  Cell-type-specific microarray data and the Allen atlas: quantitative analysis of brain-wide patterns of correlation and density , 2013, 1303.0013.

[29]  I. K. Wood,et al.  Neuroscience: Exploring the brain , 1996 .

[30]  Peter Bühlmann,et al.  Bagging, Boosting and Ensemble Methods , 2012 .

[31]  Sacha B. Nelson,et al.  A Quantitative Comparison of Cell-Type-Specific Microarray Gene Expression Profiling Methods in the Mouse Brain , 2011, PloS one.

[32]  Mario Giacobini,et al.  Visual Search of Neuropil-Enriched RNAs from Brain In Situ Hybridization Data through the Image Analysis Pipeline Hippo-ATESC , 2013, PloS one.

[33]  Kaustubh Supekar,et al.  Sparse logistic regression for whole-brain classification of fMRI data , 2010, NeuroImage.

[34]  Li Cheng,et al.  Learning to Boost Filamentary Structure Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[35]  M. Baker,et al.  Identification of secondary structure elements in intermediate-resolution density maps. , 2007, Structure.

[36]  Shuiwang Ji,et al.  SLEP: Sparse Learning with Efficient Projections , 2011 .

[37]  Eugene W. Myers,et al.  BlastNeuron for Automated Comparison, Retrieval and Clustering of 3D Neuron Morphologies , 2015, Neuroinformatics.

[38]  Jianpeng Ma,et al.  A structural-informatics approach for mining beta-sheets: locating sheets in intermediate-resolution density maps. , 2003, Journal of molecular biology.

[39]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[40]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Zhi Zhou,et al.  Neuron crawler: An automatic tracing algorithm for very large neuron images , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[42]  Hanchuan Peng,et al.  APP2: automatic tracing of 3D neuron morphology based on hierarchical pruning of a gray-weighted image distance-tree , 2013, Bioinform..

[43]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[44]  Chih-Jen Lin,et al.  Trust Region Newton Method for Logistic Regression , 2008, J. Mach. Learn. Res..

[45]  Xiaogang Wang,et al.  Highly Efficient Forward and Backward Propagation of Convolutional Neural Networks for Pixelwise Classification , 2014, ArXiv.

[46]  Andrew Zisserman,et al.  Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[47]  Chih-Jen Lin,et al.  A Comparison of Optimization Methods and Software for Large-scale L1-regularized Linear Classification , 2010, J. Mach. Learn. Res..

[48]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Shuiwang Ji Computational genetic neuroanatomy of the developing mouse brain: dimensionality reduction, visualization, and clustering , 2013, BMC Bioinformatics.

[50]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling , 2015, CVPR 2015.

[51]  W. Walz Role of astrocytes in the clearance of excess extracellular potassium , 2000, Neurochemistry International.

[52]  Chia-Hua Ho,et al.  Recent Advances of Large-Scale Linear Classification , 2012, Proceedings of the IEEE.

[53]  Zhi Zhou,et al.  TReMAP: Automatic 3D Neuron Reconstruction Based on Tracing, Reverse Mapping and Assembling of 2D Projections , 2015, Neuroinformatics.

[54]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[55]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56]  Iasonas Kokkinos,et al.  Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.

[57]  Y. LeCun,et al.  Learning methods for generic object recognition with invariance to pose and lighting , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[58]  Brian B. Avants,et al.  Neuroinformatics for Genome-Wide 3-D Gene Expression Mapping in the Mouse Brain , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[59]  S. A. Khonsary,et al.  THE BRAIN, An Introduction to Functional Neuroanatomy , 2017, Surgical Neurology International.

[60]  Shuiwang Ji,et al.  Deep convolutional neural networks for multi-modality isointense infant brain image segmentation , 2015, NeuroImage.

[61]  Hanchuan Peng,et al.  V3D enables real-time 3D visualization and quantitative analysis of large-scale biological image data sets , 2010, Nature Biotechnology.

[62]  Paul Tseng,et al.  Trace Norm Regularization: Reformulations, Algorithms, and Multi-Task Learning , 2010, SIAM J. Optim..

[63]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[64]  Paul Pavlidis,et al.  Neuron-Enriched Gene Expression Patterns are Regionally Anti-Correlated with Oligodendrocyte-Enriched Patterns in the Adult Mouse and Human Brain , 2013, Front. Neurosci..

[65]  H. Sebastian Seung,et al.  Natural Image Denoising with Convolutional Networks , 2008, NIPS.

[66]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[67]  Hao Chen,et al.  Deep Contextual Networks for Neuronal Structure Segmentation , 2016, AAAI.

[68]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[69]  Hans Burkhardt,et al.  RENNSH: A Novel \alpha-Helix Identification Approach for Intermediate Resolution Electron Density Maps , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[70]  Shuiwang Ji,et al.  Integrative analysis of the connectivity and gene expression atlases in the mouse brain , 2014, NeuroImage.

[71]  Hanchuan Peng,et al.  A distance-field based automatic neuron tracing method , 2013, BMC Bioinformatics.

[72]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[73]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[74]  N. Meinshausen,et al.  Stability selection , 2008, 0809.2932.

[75]  Hanchuan Peng,et al.  Automated image computing reshapes computational neuroscience , 2013, BMC Bioinformatics.

[76]  Partha P. Mitra,et al.  Computational neuroanatomy and gene expression: Optimal sets of marker genes for brain regions , 2012, 2012 46th Annual Conference on Information Sciences and Systems (CISS).

[77]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[78]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[79]  Jieping Ye,et al.  Multi-Task Feature Learning Via Efficient l2, 1-Norm Minimization , 2009, UAI.

[80]  Matthew de Brecht,et al.  Combining sparseness and smoothness improves classification accuracy and interpretability , 2012, NeuroImage.

[81]  Zeyun Yu,et al.  Computational Approaches for Automatic Structural Analysis of Large Biomolecular Complexes , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[82]  Allan R. Jones,et al.  An anatomic gene expression atlas of the adult mouse brain , 2009, Nature Neuroscience.

[83]  Jieping Ye,et al.  Deep convolutional neural networks for annotating gene expression patterns in the mouse brain , 2015, BMC Bioinformatics.

[84]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[85]  Dong-Hua Chen,et al.  De novo backbone trace of GroEL from single particle electron cryomicroscopy. , 2008, Structure.

[86]  Kent A. Spackman,et al.  Signal Detection Theory: Valuable Tools for Evaluating Inductive Learning , 1989, ML.

[87]  Hanchuan Peng,et al.  Automatic reconstruction of 3D neuron structures using a graph-augmented deformable model , 2010, Bioinform..

[88]  Tianming Liu,et al.  SmartTracing: self-learning-based Neuron reconstruction , 2015, Brain Informatics.

[89]  H. Sompolinsky,et al.  Compressed sensing, sparsity, and dimensionality in neuronal information processing and data analysis. , 2012, Annual review of neuroscience.

[90]  Y. Xing,et al.  A Transcriptome Database for Astrocytes, Neurons, and Oligodendrocytes: A New Resource for Understanding Brain Development and Function , 2008, The Journal of Neuroscience.

[91]  Dinggang Shen,et al.  Deep Learning Based Imaging Data Completion for Improved Brain Disease Diagnosis , 2014, MICCAI.

[92]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[93]  Frédéric Jurie,et al.  Sampling Strategies for Bag-of-Features Image Classification , 2006, ECCV.

[94]  Leon French,et al.  Large-Scale Analysis of Gene Expression and Connectivity in the Rodent Brain: Insights through Data Integration , 2011, Front. Neuroinform..

[95]  Hai Su,et al.  Automatic Ki-67 Counting Using Robust Cell Detection and Online Dictionary Learning , 2014, IEEE Transactions on Biomedical Engineering.