Modeling multifunctionality of genes with secondary gene co-expression networks in human brain provides novel disease insights

Motivation Co-expression networks are a powerful gene expression analysis method to study how genes co-express together in clusters with functional coherence that usually resemble specific cell type behaviour for the genes involved. They can be applied to bulk-tissue gene expression profiling and assign function, and usually cell type specificity, to a high percentage of the gene pool used to construct the network. One of the limitations of this method is that each gene is predicted to play a role in a specific set of coherent functions in a single cell type (i.e. at most we get a single for each gene). We present here GMSCA (Gene Multifunctionality Secondary Co-expression Analysis), a software tool that exploits the co-expression paradigm to increase the number of functions and cell types ascribed to a gene in bulk-tissue co-expression networks. Results We applied GMSCA to 27 co-expression networks derived from bulk-tissue gene expression profiling of a variety of brain tissues. Neurons and glial cells (microglia, astrocytes and oligodendrocytes) were considered the main cell types. Applying this approach, we increase the overall number of predicted triplets by 46.73%. Moreover, GMSCA predicts that the SNCA gene, traditionally associated to work mainly in neurons, also plays a relevant function in oligodendrocytes. Availability The tool is available at GitHub,https://github.com/drlaguna/GMSCA as open source software. Implementation GSMCA is implemented in R.

[1]  Juan A. Botía,et al.  CoExp Web, a web tool for the exploitation of co-expression networks , 2020, bioRxiv.

[2]  Esti Yeger-Lotem,et al.  Mechanisms of tissue and cell-type specificity in heritable traits and diseases , 2020, Nature Reviews Genetics.

[3]  Aman Aggarwal,et al.  A locomotor assay reveals deficits in heterozygous Parkinson’s disease model and proprioceptive mutants in adult Drosophila , 2019, Proceedings of the National Academy of Sciences.

[4]  Fei Zou,et al.  SCDC: bulk gene expression deconvolution by multiple single-cell RNA sequencing references , 2019, bioRxiv.

[5]  Guo-Cheng Yuan,et al.  Accurate estimation of cell-type composition from gene expression data , 2019, Nature Communications.

[6]  Manolis Kellis,et al.  Single-cell transcriptomic analysis of Alzheimer’s disease , 2019, Nature.

[7]  Ash A. Alizadeh,et al.  Determining cell-type abundance and expression from bulk tissues with digital cytometry , 2019, Nature Biotechnology.

[8]  Hunna J. Watson,et al.  Genetic Identification of Cell Types Underlying Brain Complex Traits Yields Novel Insights Into the Etiology of Parkinson’s Disease , 2019, bioRxiv.

[9]  R. Cappai,et al.  Amyloid precursor protein and amyloid precursor‐like protein 2 have distinct roles in modulating myelination, demyelination, and remyelination of axons , 2018, Glia.

[10]  Tudor Groza,et al.  Expansion of the Human Phenotype Ontology (HPO) knowledge base and resources , 2018, Nucleic Acids Res..

[11]  Sonja W. Scholz,et al.  Moving beyond neurons: the role of cell type-specific gene regulation in Parkinson’s disease heritability , 2019, npj Parkinson's Disease.

[12]  R. Irizarry,et al.  Missing data and technical variability in single‐cell RNA‐sequencing experiments , 2018, Biostatistics.

[13]  Nancy R. Zhang,et al.  Bulk tissue cell type deconvolution with multi-subject single-cell expression reference , 2018, Nature Communications.

[14]  Lars E. Borm,et al.  Molecular Architecture of the Mouse Nervous System , 2018, Cell.

[15]  João Pedro de Magalhães,et al.  Gene co-expression analysis for functional classification and gene–disease predictions , 2017, Briefings Bioinform..

[16]  K. Roeder,et al.  A UNIFIED STATISTICAL FRAMEWORK FOR SINGLE CELL AND BULK RNA SEQUENCING DATA. , 2016, The annals of applied statistics.

[17]  S. Teichmann,et al.  A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications , 2017, Genome Medicine.

[18]  J. C. Love,et al.  Cell-type Dependent Alzheimer's Disease Phenotypes: Probing the Biology of Selective Neuronal Vulnerability , 2017, Alzheimer's & Dementia.

[19]  John Hardy,et al.  An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks , 2017, BMC Systems Biology.

[20]  Samuel L. Wolock,et al.  A Single-Cell Transcriptomic Map of the Human and Mouse Pancreas Reveals Inter- and Intra-cell Population Structure. , 2016, Cell systems.

[21]  Seth G. N. Grant,et al.  Identification of Vulnerable Cell Types in Major Brain Disorders Using Single Cell Transcriptomes and Expression Weighted Cell Type Enrichment , 2016, Front. Neurosci..

[22]  E. Chang,et al.  Purification and Characterization of Progenitor and Mature Human Astrocytes Reveals Transcriptional and Functional Differences with Mouse , 2016, Neuron.

[23]  R. Reynolds,et al.  Neuronal activity regulates remyelination via glutamate signalling to oligodendrocyte progenitors , 2015, Nature Communications.

[24]  C. Margulies,et al.  Designing Cell-Type-Specific Genome-wide Experiments. , 2015, Molecular cell.

[25]  Aleksandra A. Kolodziejczyk,et al.  The technology and biology of single-cell RNA sequencing. , 2015, Molecular cell.

[26]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[27]  Ash A. Alizadeh,et al.  Robust enumeration of cell subsets from tissue expression profiles , 2015, Nature Methods.

[28]  S. Linnarsson,et al.  Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq , 2015, Science.

[29]  Fabian J Theis,et al.  Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells , 2015, Nature Biotechnology.

[30]  A. Singleton,et al.  Genetic variability in the regulation of gene expression in ten regions of the human brain , 2014, Nature Neuroscience.

[31]  Jennifer M. Pocock,et al.  Insights into TREM2 biology by network analysis of human brain gene expression data , 2013, Neurobiology of Aging.

[32]  M. Farrer,et al.  Alpha‐synuclein p.H50Q, a novel pathogenic mutation for Parkinson's disease , 2013, Movement disorders : official journal of the Movement Disorder Society.

[33]  Ronald Melki,et al.  G51D α‐synuclein mutation causes a novel Parkinsonian–pyramidal syndrome , 2013, Annals of neurology.

[34]  H. Sawada,et al.  Catecholamines and Neurodegeneration in Parkinson’s Disease—From Diagnostic Marker to Aggregations of α-Synuclein , 2013, Diagnostics.

[35]  J. Schneider,et al.  Overview and findings from the religious orders study. , 2012, Current Alzheimer research.

[36]  J. Schneider,et al.  Overview and findings from the rush Memory and Aging Project. , 2012, Current Alzheimer research.

[37]  L. Stefanis α-Synuclein in Parkinson's disease. , 2012, Cold Spring Harbor perspectives in medicine.

[38]  G. Vivacqua,et al.  The role of alpha-synuclein in neurotransmission and synaptic plasticity , 2011, Journal of Chemical Neuroanatomy.

[39]  Jesse Gillis,et al.  The Impact of Multifunctional Genes on "Guilt by Association" Analysis , 2011, PloS one.

[40]  Rui Luo,et al.  Is My Network Module Preserved and Reproducible? , 2011, PLoS Comput. Biol..

[41]  Ben A. Barres,et al.  Regulation of synaptic connectivity by glia , 2010, Nature.

[42]  S. Horvath,et al.  Divergence of human and mouse brain transcriptome highlights Alzheimer disease pathways , 2010, Proceedings of the National Academy of Sciences.

[43]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[44]  S. Horvath,et al.  Functional organization of the transcriptome in human brain , 2008, Nature Neuroscience.

[45]  Hedi Peterson,et al.  g:Profiler—a web-based toolset for functional profiling of gene lists from large-scale experiments , 2007, Nucleic Acids Res..

[46]  Atul J. Butte,et al.  Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks , 2005, BMC Bioinformatics.

[47]  J. Nutt,et al.  The dopamine transporter: Importance in Parkinson's disease , 2004, Annals of neurology.

[48]  J. Hoenicka,et al.  The new mutation, E46K, of α‐synuclein causes parkinson and Lewy body dementia , 2004, Annals of neurology.

[49]  Janel O. Johnson,et al.  α-Synuclein Locus Triplication Causes Parkinson's Disease , 2003, Science.

[50]  S. Hayashi,et al.  NACP/α-synuclein-positive filamentous inclusions in astrocytes and oligodendrocytes of Parkinson’s disease brains , 2000, Acta Neuropathologica.

[51]  E. Masliah,et al.  Alpha‐synuclein in Lewy Body Disease and Alzheimer's Disease , 1999, Brain pathology.

[52]  Olaf Riess,et al.  AlaSOPro mutation in the gene encoding α-synuclein in Parkinson's disease , 1998, Nature Genetics.

[53]  Robert L. Nussbaum,et al.  Mutation in the α-Synuclein Gene Identified in Families with Parkinson's Disease , 1997 .