Spatially aware clustering of ion images in mass spectrometry imaging data using deep learning

Computational analysis is crucial to capitalize on the wealth of spatio-molecular information generated by mass spectrometry imaging (MSI) experiments. Currently, the spatial information available in MSI data is often under-utilized, due to the challenges of in-depth spatial pattern extraction. The advent of deep learning has greatly facilitated such complex spatial analysis. In this work, we use a pre-trained neural network to extract high-level features from ion images in MSI data, and test whether this improves downstream data analysis. The resulting neural network interpretation of ion images, coined neural ion images, are used to cluster ion images based on spatial expressions. We evaluate the impact of neural ion images on two ion image clustering pipelines, namely DBSCAN clustering, combined with UMAP-based dimensionality reduction, and k-means clustering. In both pipelines, we compare regular and neural ion images from two different MSI datasets. All tested pipelines could extract underlying spatial patterns, but the neural network-based pipelines provided better assignment of ion images, with more fine-grained clusters, and greater consistency in the spatial structures assigned to individual clusters. Additionally, we introduce the Relative Isotope Ratio metric to quantitatively evaluate clustering quality. The resulting scores show that isotopical m/z values are more often clustered together in the neural network-based pipeline, indicating improved clustering outcomes. The usefulness of neural ion images extends beyond clustering towards a generic framework to incorporate spatial information into any MSI-focused machine learning pipeline, both supervised and unsupervised.

[1]  Ivan Laptev,et al.  Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[3]  Olga Vitek,et al.  Probabilistic Segmentation of Mass Spectrometry (MS) Images Helps Select Important Ions and Characterize Confidence in the Resulting Segments* , 2016, Molecular & Cellular Proteomics.

[4]  Richard M. Caprioli,et al.  Unsupervised machine learning for exploratory data analysis in imaging mass spectrometry , 2019, Mass spectrometry reviews.

[5]  Lingjun Li,et al.  Mass Spectrometry Imaging: A Review of Emerging Advancements and Future Insights. , 2018, Analytical chemistry.

[6]  George R. Thoma,et al.  Pre-trained convolutional neural networks as feature extractors toward improved malaria parasite detection in thin blood smear images , 2018, PeerJ.

[7]  Rand Wilcox Chapter 10 – Robust Regression , 2012 .

[8]  Alexander Rakhlin,et al.  ColocML: machine learning quantifies co-localization between mass spectrometry images , 2020, Bioinform..

[9]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[10]  Julia Kastner,et al.  Introduction to Robust Estimation and Hypothesis Testing , 2005 .

[11]  Christian Etmann,et al.  Deep learning for tumor classification in imaging mass spectrometry , 2017, Bioinform..

[12]  B. De Moor,et al.  Evaluation of Distance Metrics and Spatial Autocorrelation in Uniform Manifold Approximation and Projection Applied to Mass Spectrometry Imaging Data. , 2019, Analytical Chemistry.

[13]  Sandra Schulz,et al.  Advanced MALDI mass spectrometry imaging in pharmaceutical research and drug development. , 2019, Current opinion in biotechnology.

[14]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection , 2018, J. Open Source Softw..

[15]  Quoc V. Le,et al.  Do Better ImageNet Models Transfer Better? , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  J. Lefman,et al.  Automated correlation and classification of secondary ion mass spectrometry images using a k-means cluster method. , 2012, The Analyst.

[17]  M. A. Dalal,et al.  A survey on clustering in data mining , 2011, ICWET.

[18]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Chao Yang,et al.  A Survey on Deep Transfer Learning , 2018, ICANN.

[20]  R. Caprioli,et al.  Molecular imaging of biological samples: localization of peptides and proteins using MALDI-TOF MS. , 1997, Analytical chemistry.

[21]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[22]  Spencer A. Thomas,et al.  Dimensionality reduction of mass spectrometry imaging data using autoencoders , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).

[23]  A. Ng,et al.  Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists , 2018, PLoS medicine.

[24]  B. De Moor,et al.  Prioritization of m/z- values in mass spectrometry imaging profiles obtained using Uniform Manifold Approximation and Projection for dimensionality reduction. , 2020, Analytical chemistry.

[25]  A. Ewing,et al.  Imaging mass spectrometry in neuroscience. , 2013, ACS chemical neuroscience.

[26]  Yoshua Bengio,et al.  Deep Learning of Representations for Unsupervised and Transfer Learning , 2011, ICML Unsupervised and Transfer Learning.

[27]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[28]  Ricardo J. G. B. Campello,et al.  Density-Based Clustering Based on Hierarchical Density Estimates , 2013, PAKDD.

[29]  Amos J. Storkey,et al.  Data Augmentation Generative Adversarial Networks , 2017, ICLR 2018.

[30]  Paolo Napoletano,et al.  Benchmark Analysis of Representative Deep Neural Network Architectures , 2018, IEEE Access.

[31]  A. Enk,et al.  Deep learning outperformed 11 pathologists in the classification of histopathological melanoma images. , 2019, European journal of cancer.

[32]  Kristina Schwamborn,et al.  MALDI imaging mass spectrometry - From bench to bedside. , 2017, Biochimica et biophysica acta. Proteins and proteomics.

[33]  Md. Jahid Hasan,et al.  Deep Convolutional Neural Network for Microscopic Bacteria Image Classification , 2019, 2019 5th International Conference on Advances in Electrical Engineering (ICAEE).

[34]  R Bellman,et al.  DYNAMIC PROGRAMMING AND LAGRANGE MULTIPLIERS. , 1956, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[36]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[37]  F. Mosteller,et al.  Low Moments for Small Samples: A Comparative Study of Order Statistics , 1947 .

[38]  Michael Becker,et al.  Analysis and interpretation of imaging mass spectrometry data by clustering mass-to-charge images according to their spatial similarity. , 2013, Analytical chemistry.

[39]  Michael I. Jordan,et al.  Unsupervised Domain Adaptation with Residual Transfer Networks , 2016, NIPS.

[40]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[41]  Kevin A. Schneider,et al.  Breast Cancer Diagnosis with Transfer Learning and Global Pooling , 2019, 2019 International Conference on Information and Communication Technology Convergence (ICTC).

[42]  Bart De Moor,et al.  Automated Anatomical Interpretation of Ion Distributions in Tissue: Linking Imaging Mass Spectrometry to Curated Atlases , 2014, Analytical chemistry.

[43]  Theodore Alexandrov,et al.  Efficient spatial segmentation of large imaging mass spectrometry datasets with spatially aware clustering , 2011, Bioinform..

[44]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.