EndoDB: a database of endothelial cell transcriptomics data

Abstract Endothelial cells (ECs) line blood vessels, regulate homeostatic processes (blood flow, immune cell trafficking), but are also involved in many prevalent diseases. The increasing use of high-throughput technologies such as gene expression microarrays and (single cell) RNA sequencing generated a wealth of data on the molecular basis of EC (dys-)function. Extracting biological insight from these datasets is challenging for scientists who are not proficient in bioinformatics. To facilitate the re-use of publicly available EC transcriptomics data, we developed the endothelial database EndoDB, a web-accessible collection of expert curated, quality assured and pre-analyzed data collected from 360 datasets comprising a total of 4741 bulk and 5847 single cell endothelial transcriptomes from six different organisms. Unlike other added-value databases, EndoDB allows to easily retrieve and explore data of specific studies, determine under which conditions genes and pathways of interest are deregulated and assess reprogramming of metabolism via principal component analysis, differential gene expression analysis, gene set enrichment analysis, heatmaps and metabolic and transcription factor analysis, while single cell data are visualized as gene expression color-coded t-SNE plots. Plots and tables in EndoDB are customizable, downloadable and interactive. EndoDB is freely available at https://vibcancer.be/software-tools/endodb, and will be updated to include new studies.

[1]  P. Carmeliet,et al.  Endothelial Cell Metabolism in Health and Disease. , 2017, Trends in cell biology.

[2]  Sergio Contrino,et al.  ArrayExpress—a public repository for microarray gene expression data at the EBI , 2004, Nucleic Acids Res..

[3]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[4]  W. Gerald,et al.  Genes that mediate breast cancer metastasis to the brain , 2009, Nature.

[5]  W. Aird,et al.  Endothelial cell heterogeneity. , 2012, Cold Spring Harbor perspectives in medicine.

[6]  P. Carmeliet,et al.  Serine Synthesis via PHGDH Is Essential for Heme Production in Endothelial Cells. , 2018, Cell metabolism.

[7]  M. Ni,et al.  Single-Cell Transcriptome Analyses Reveal Endothelial Cell Heterogeneity in Tumors and Changes following Antiangiogenic Treatment. , 2018, Cancer research.

[8]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[9]  P. Carmeliet,et al.  Impairment of Angiogenesis by Fatty Acid Synthase Inhibition Involves mTOR Malonylation. , 2018, Cell metabolism.

[10]  P. Carmeliet,et al.  Molecular mechanisms and clinical applications of angiogenesis , 2011, Nature.

[11]  A. van Oudenaarden,et al.  Single-cell sequencing reveals dissociation-induced gene expression in tissue subpopulations , 2017, Nature Methods.

[12]  John T. Wei,et al.  Integrative molecular concept modeling of prostate cancer progression , 2007, Nature Genetics.

[13]  Davis J. McCarthy,et al.  A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor , 2016, F1000Research.

[14]  I. Hellmann,et al.  Comparative Analysis of Single-Cell RNA Sequencing Methods , 2016, bioRxiv.

[15]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[16]  Dennis B. Troup,et al.  NCBI GEO: mining tens of millions of expression profiles—database and tools update , 2006, Nucleic Acids Res..

[17]  T. Barrette,et al.  ONCOMINE: a cancer microarray database and integrated data-mining platform. , 2004, Neoplasia.

[18]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[19]  P. Carmeliet,et al.  Inhibition of the Glycolytic Activator PFKFB3 in Endothelium Induces Tumor Vessel Normalization, Impairs Metastasis, and Improves Chemotherapy. , 2016, Cancer cell.

[20]  H. Sedlacek Pharmacological aspects of targeting cancer gene therapy to endothelial cells. , 2001, Critical reviews in oncology/hematology.

[21]  D. Maric,et al.  EphrinB2 controls vessel pruning through STAT1-JNK3 signaling , 2015, Nature Communications.

[22]  M. Roussel,et al.  Medulloblastoma Genotype Dictates Blood Brain Barrier Phenotype. , 2016, Cancer cell.

[23]  J. Tchinda,et al.  Recurrent fusion of TMPRSS2 and ETS transcription factor genes in prostate cancer. , 2006, Science.

[24]  Koji Ando,et al.  A molecular atlas of cell types and zonation in the brain vasculature , 2018, Nature.

[25]  Peter Olson,et al.  Cancer-Associated Fibroblasts Are Activated in Incipient Neoplasia to Orchestrate Tumor-Promoting Inflammation in an NF-kappaB-Dependent Manner. , 2010, Cancer cell.

[26]  S. Linnarsson,et al.  Single-cell genomics: coming of age , 2016, Genome Biology.

[27]  K Ham,et al.  OpenRefine (version 2.5). . Free, open-source tool for cleaning and transforming data. , 2013 .

[28]  P. Carmeliet,et al.  Fatty acid carbon is essential for dNTP synthesis in endothelial cells , 2015, Nature.

[29]  B. Singer,et al.  Flow-cytometric method for simultaneous analysis of mouse lung epithelial, endothelial, and hematopoietic lineage cells. , 2016, American journal of physiology. Lung cellular and molecular physiology.

[30]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[31]  A. Regev,et al.  Spatial reconstruction of single-cell gene expression data , 2015 .

[32]  Antoni Ribas,et al.  Single-cell analysis tools for drug discovery and development , 2015, Nature Reviews Drug Discovery.

[33]  V. Mootha,et al.  Metabolic enzyme expression highlights a key role for MTHFD2 and the mitochondrial folate pathway in cancer , 2014, Nature Communications.

[34]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[35]  W. Aird Endothelial cell heterogeneity , 2003, Critical care medicine.

[36]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[37]  Audrey Kauffmann,et al.  Bioinformatics Applications Note Arrayqualitymetrics—a Bioconductor Package for Quality Assessment of Microarray Data , 2022 .

[38]  A. Brazma,et al.  Reuse of public genome-wide gene expression data , 2012, Nature Reviews Genetics.

[39]  P. Carmeliet,et al.  Principles of targeting endothelial cell metabolism to treat angiogenesis and endothelial cell dysfunction in disease , 2014, EMBO molecular medicine.

[40]  P. Carmeliet,et al.  Phenotype molding of stromal cells in the lung tumor microenvironment , 2018, Nature Medicine.

[41]  Guangchuang Yu,et al.  clusterProfiler: an R package for comparing biological themes among gene clusters. , 2012, Omics : a journal of integrative biology.

[42]  P. Carmeliet,et al.  Incomplete and transitory decrease of glycolysis , 2014, Cell cycle.

[43]  Lior Pachter,et al.  Near-optimal probabilistic RNA-seq quantification , 2016, Nature Biotechnology.

[44]  D. Vitkup,et al.  Heterogeneity of tumor-induced gene expression changes in the human metabolic network , 2013, Nature Biotechnology.

[45]  Rhonda Bacher,et al.  Design and computational analysis of single-cell RNA-sequencing experiments , 2016, Genome Biology.

[46]  Jinghang Zhang,et al.  CCL2 recruits inflammatory monocytes to facilitate breast tumor metastasis , 2011, Nature.