Multiscale PHATE Exploration of SARS-CoV-2 Data Reveals Multimodal Signatures of Disease

The biomedical community is producing increasingly high dimensional datasets, integrated from hundreds of patient samples, which current computational techniques struggle to explore. To uncover biological meaning from these complex datasets, we present an approach called Multiscale PHATE, which learns abstracted biological features from data that can be directly predictive of disease. Built on a continuous coarse graining process called diffusion condensation, Multiscale PHATE creates a tree of data granularities that can be cut at coarse levels for high level summarizations of data, as well as at fine levels for detailed representations on subsets. We apply Multiscale PHATE to study the immune response to COVID-19 in 54 million cells from 168 hospitalized patients. Through our analysis of patient samples, we identify CD16hi CD66blo neutrophil and IFNγ+GranzymeB+ Th17 cell responses enriched in patients who die. Further, we show that population groupings Multiscale PHATE discovers can be directly fed into a classifier to predict disease outcome. We also use Multiscale PHATE-derived features to construct two different manifolds of patients, one from abstracted flow cytometry features and another directly on patient clinical features, both associating immune subsets and clinical markers with outcome.

[1]  Yan Zhou,et al.  Minimum Spanning Tree Based Clustering Algorithms , 2006, 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06).

[2]  Stéphane Lafon,et al.  Diffusion maps , 2006 .

[3]  M. Girardis,et al.  Expansion of plasmablasts and loss of memory B cells in peripheral blood from COVID-19 patients with pneumonia. , 2020, European journal of immunology.

[4]  R. Woods,et al.  Neutrophil extracellular traps in COVID-19. , 2020, JCI insight.

[5]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[6]  David van Dijk,et al.  Manifold learning-based methods for analyzing single-cell RNA-sequencing data , 2018 .

[7]  J. Bienvenu,et al.  The anti-inflammatory response dominates after septic shock: association of low monocyte HLA-DR expression and high interleukin-10 concentration. , 2004, Immunology letters.

[8]  Inkyung Jung,et al.  Immunophenotyping of COVID-19 and influenza highlights the role of type I interferons in development of severe COVID-19 , 2020, Science Immunology.

[9]  W. Chen,et al.  Critically ill SARS-CoV-2 patients display lupus-like hallmarks of extrafollicular B cell activation , 2020, medRxiv.

[10]  Mitchell R. Ladd,et al.  A Dynamic Variation of Pulmonary ACE2 Is Required to Modulate Neutrophilic Inflammation in Response to Pseudomonas aeruginosa Lung Infection in Mice , 2019, The Journal of Immunology.

[11]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008 .

[12]  Ronald R. Coifman,et al.  Visualizing structure and transitions in high-dimensional biological data , 2019, Nature Biotechnology.

[13]  David van Dijk,et al.  Compressed Diffusion , 2019, 2019 13th International conference on Sampling Theory and Applications (SampTA).

[14]  David van Dijk,et al.  Enhancing experimental signals in single-cell RNA-sequencing data using graph signal processing , 2019 .

[15]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[16]  A. Averbuch,et al.  Coarse-grained localized diffusion , 2012 .

[17]  Lai Guan Ng,et al.  Dimensionality reduction for visualizing single-cell data using UMAP , 2018, Nature Biotechnology.

[18]  Dirk Abel,et al.  Simulation physiologischer Regelkreise mit der objektorientierten Modellbibliothek “HumanLib” , 2011, Autom..

[19]  Mike Clarke,et al.  A minimal common outcome measure set for COVID-19 clinical research , 2020, The Lancet Infectious Diseases.

[20]  T. Kishimoto Factors affecting B-cell growth and differentiation. , 1985, Annual review of immunology.

[21]  W. Greene,et al.  SARS-CoV-2-Specific T Cells Exhibit Phenotypic Features of Helper Function, Lack of Terminal Differentiation, and High Proliferation Potential , 2020, Cell Reports Medicine.

[22]  A. Sattler,et al.  SARS-CoV-2 specific T-cell responses and correlations with COVID-19 patient predisposition. , 2020, The Journal of clinical investigation.

[23]  M. Netea,et al.  The immunopathology of sepsis and potential therapeutic targets , 2017, Nature Reviews Immunology.

[24]  Huy Q. Dinh,et al.  Interplay of Monocytes and T Lymphocytes in COVID-19 Severity , 2020, bioRxiv.

[25]  N. Friedman,et al.  Aging promotes reorganization of the CD4 T cell landscape toward extreme regulatory and effector phenotypes , 2019, Science Advances.

[26]  L. Fouser,et al.  An IL-17F/A Heterodimer Protein Is Produced by Mouse Th17 Cells and Induces Airway Neutrophil Recruitment , 2007, The Journal of Immunology.

[27]  O. Ornatsky,et al.  Mass cytometry: technique for real time single cell multitarget immunoassay based on inductively coupled plasma time-of-flight mass spectrometry. , 2009, Analytical chemistry.

[28]  Leo Koenderman,et al.  A subset of neutrophils in human systemic inflammation inhibits T cell responses through Mac-1. , 2012, The Journal of clinical investigation.

[29]  John D Lambris,et al.  Complement and tissue factor-enriched neutrophil extracellular traps are key drivers in COVID-19 immunothrombosis , 2020, medRxiv.

[30]  Jie Dong,et al.  Heightened Innate Immune Responses in the Respiratory Tract of COVID-19 Patients , 2020, Cell Host & Microbe.

[31]  Robin Sibson,et al.  SLINK: An Optimally Efficient Algorithm for the Single-Link Cluster Method , 1973, Comput. J..

[32]  G. Nolan,et al.  Mass Cytometry: Single Cells, Many Features , 2016, Cell.

[33]  Zeyu Chen,et al.  T cell responses in patients with COVID-19 , 2020, Nature Reviews Immunology.

[34]  Sean C. Bendall,et al.  Comprehensive Immune Monitoring of Clinical Trials to Advance Human Immunotherapy , 2018, bioRxiv.

[35]  A. Regev,et al.  Induction and molecular signature of pathogenic TH17 cells , 2012, Nature Immunology.

[36]  Howard Y. Chang,et al.  Single-cell chromatin accessibility reveals principles of regulatory variation , 2015, Nature.

[37]  A. Oshlack,et al.  Splatter: simulation of single-cell RNA sequencing data , 2017, Genome Biology.

[38]  S. Farhadian,et al.  Sex differences in immune responses that underlie COVID-19 disease outcomes , 2020, Nature.

[39]  P. Linton,et al.  Age-related changes in lymphocyte development and function , 2004, Nature Immunology.

[40]  David van Dijk,et al.  TrajectoryNet: A Dynamic Optimal Transport Network for Modeling Cellular Dynamics , 2020, ICML.

[41]  Katie Chan,et al.  Interleukin-17 stimulates the expression of interleukin-8, growth-related oncogene-alpha, and granulocyte-colony-stimulating factor by human airway epithelial cells. , 2002, American journal of respiratory cell and molecular biology.

[42]  Jincun Zhao,et al.  Virus-Specific Memory CD8 T Cells Provide Substantial Protection from Lethal Severe Acute Respiratory Syndrome Coronavirus Infection , 2014, Journal of Virology.

[43]  J. Soriano,et al.  COVID-19 severity associates with pulmonary redistribution of CD1c+ DC and inflammatory transitional and nonclassical monocytes. , 2020, The Journal of clinical investigation.

[44]  Evan Z. Macosko,et al.  Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets , 2015, Cell.

[45]  Gil David,et al.  Hierarchical data organization , clustering and denoising via localized diffusion folders , 2011 .

[46]  Allon M. Klein,et al.  Droplet Barcoding for Single-Cell Transcriptomics Applied to Embryonic Stem Cells , 2015, Cell.

[47]  Eric Song,et al.  Longitudinal analyses reveal immunological misfiring in severe COVID-19 , 2020, Nature.

[48]  J. Mason,et al.  A dynamic COVID-19 immune signature includes associations with poor prognosis , 2020, Nature Medicine.

[49]  E. Wherry,et al.  Cutting Edge: Rapid In Vivo Killing by Memory CD8 T Cells1 , 2003, The Journal of Immunology.

[50]  Kevin R. Moon,et al.  Recovering Gene Interactions from Single-Cell Data Using Data Diffusion , 2018, Cell.

[51]  R. Newton,et al.  ADAM17 cleaves CD16b (FcγRIIIb) in human neutrophils. , 2013, Biochimica et biophysica acta.

[52]  E. Jabłońska,et al.  Heterogeneity Among Neutrophils , 2017, Archivum Immunologiae et Therapiae Experimentalis.

[53]  Howard M. Shapiro Learning Flow Cytometry , 2005 .

[54]  A. Ganser,et al.  Reappearance of effector T cells is associated with recovery from COVID-19 , 2020, EBioMedicine.

[55]  Sasikanth Manne,et al.  Deep immune profiling of COVID-19 patients reveals distinct immunotypes with therapeutic implications , 2020, Science.

[56]  E. Schwedhelm,et al.  Human leucocyte antigen (HLA-DR) gene expression is reduced in sepsis and correlates with impaired TNFα response: A diagnostic tool for immunosuppression? , 2017, PloS one.

[57]  Rebecca Liu,et al.  IL-17 Promotes Neutrophil-Mediated Immunity by Activating Microvascular Pericytes and Not Endothelium , 2016, The Journal of Immunology.

[58]  Sergio M. Savaresi,et al.  On the performance of bisecting K-means and PDDP , 2001, SDM.

[59]  Amir Averbuch,et al.  Diffusion-based kernel methods on Euclidean metric measure spaces , 2016 .

[60]  David Duvenaud,et al.  Neural Ordinary Differential Equations , 2018, NeurIPS.

[61]  Kenji Fukumizu,et al.  Tree-Sliced Variants of Wasserstein Distances , 2019, NeurIPS.

[62]  S. Cowley,et al.  MAIT cells promote inflammatory monocyte differentiation into dendritic cells during pulmonary intracellular infection , 2016, The Journal of experimental medicine.

[63]  J. Knight,et al.  Longitudinal COVID-19 profiling associates IL-1Ra and IL-10 with disease severity and RANTES with mild disease. , 2020, JCI insight.

[64]  William S. DeWitt,et al.  A Single-Cell Atlas of In Vivo Mammalian Chromatin Accessibility , 2018, Cell.

[65]  Burkhard Becher,et al.  Development, application and computational analysis of high-dimensional fluorescent antibody panels for single-cell flow cytometry , 2019, Nature Protocols.

[66]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[67]  V. Pettilä,et al.  PREDICTIVE VALUE OF MONOCYTE HISTOCOMPATIBILITY LEUKOCYTE ANTIGEN‐DR EXPRESSION AND PLASMA INTERLEUKIN‐4 AND ‐10 LEVELS IN CRITICALLY ILL PATIENTS WITH SEPSIS , 2003, Shock.

[68]  Jennifer N. Dines,et al.  A large-scale database of T-cell receptor beta (TCRβ) sequences and binding associations from natural and synthetic exposure to SARS-CoV-2 , 2020, Research square.

[69]  S. Farhadian,et al.  Clinical characteristics and outcomes for 7,995 patients with SARS-CoV-2 infection , 2020, medRxiv.

[70]  Lin Cheng,et al.  Single-cell landscape of bronchoalveolar immune cells in patients with COVID-19 , 2020, Nature Medicine.

[71]  B A Askonas,et al.  Cytotoxic T cells clear virus but augment lung pathology in mice infected with respiratory syncytial virus , 1988, The Journal of experimental medicine.

[72]  D. Kioussis,et al.  Contribution of  Virus-specific CD8+ Cytotoxic T Cells to Virus Clearance or Pathologic Manifestations of Influenza Virus Infection in a T Cell Receptor Transgenic Mouse Model , 1998, The Journal of experimental medicine.

[73]  S. Klein,et al.  Sex differences in immune responses , 2016, Nature Reviews Immunology.

[74]  Matthew J. Hirn,et al.  Time Coupled Diffusion Maps , 2016, Applied and Computational Harmonic Analysis.

[75]  J. Alcorn,et al.  Influenza A Inhibits Th17-Mediated Host Defense against Bacterial Pneumonia in Mice , 2011, The Journal of Immunology.

[76]  Vincent A. Traag,et al.  From Louvain to Leiden: guaranteeing well-connected communities , 2018, Scientific Reports.

[77]  P. G. Choe,et al.  Aberrant hyperactivation of cytotoxic T-cell as a potential determinant of COVID-19 severity , 2020, International Journal of Infectious Diseases.

[78]  C. Rice,et al.  Convergent Antibody Responses to SARS-CoV-2 in Convalescent Individuals , 2020, Nature.

[79]  I. Amit,et al.  Host-Viral Infection Maps Reveal Signatures of Severe COVID-19 Patients , 2020, Cell.

[80]  R. Lutter,et al.  Activation of the Granzyme Pathway in Children With Severe Respiratory Syncytial Virus Infection , 2008, Pediatric Research.

[81]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[82]  Sean C. Bendall,et al.  Conditional density-based analysis of T cell signaling in single-cell data , 2014, Science.

[83]  Jamie K. Scott,et al.  iReceptor: A platform for querying and analyzing antibody/B‐cell and T‐cell receptor repertoire data across federated repositories , 2018, Immunological reviews.

[84]  B. Richardson,et al.  Stronger inflammatory/cytotoxic T cell response in women identified by microarray analysis , 2008, Genes and Immunity.

[85]  David van Dijk,et al.  Coarse Graining of Data via Inhomogeneous Diffusion Condensation , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[86]  D. Wagner,et al.  Neutrophil extracellular traps , 2013, Oncoimmunology.

[87]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[88]  David van Dijk,et al.  Uncovering axes of variation among single-cell cancer specimens , 2020, Nature Methods.

[89]  Amir Averbuch,et al.  Cover-based bounds on the numerical rank of Gaussian kernels , 2014 .

[90]  K. Bhaskaran,et al.  Factors associated with COVID-19-related death using OpenSAFELY , 2020, Nature.

[91]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[92]  Adam S. Charles,et al.  Visualizing the PHATE of Neural Networks , 2019, NeurIPS.

[93]  Robert A. Campbell,et al.  Neutrophil extracellular traps contribute to immunothrombosis in COVID-19 acute respiratory distress syndrome , 2020, Blood.

[94]  R. Coifman,et al.  Hölder–Lipschitz Norms and Their Duals on Spaces with Semigroups, with Applications to Earth Mover’s Distance , 2016 .

[95]  Morten Nielsen,et al.  Robust T Cell Immunity in Convalescent Individuals with Asymptomatic or Mild COVID-19 , 2020, Cell.

[96]  L. Koenderman,et al.  Human neutrophils switch to an activated phenotype after homing to the lung irrespective of inflammatory disease , 2009, Clinical and experimental immunology.

[97]  C. Macaubas,et al.  The MHC class II antigen presentation pathway in human monocytes differs by subset and is regulated by cytokines , 2017, PloS one.