Not by systems alone: replicability assessment of disease expression signals

In characterizing a disease, it is common to search for dysfunctional genes by assaying the transcriptome. The resulting differentially expressed genes are typically assessed for shared features, such as functional annotation or co-expression. While useful, the reliability of these systems methods is hard to evaluate. To better understand shared disease signals, we assess their replicability by first looking at gene-level recurrence and then pathway-level recurrence along with co-expression signals across six pedigrees of a rare homogeneous X-linked disorder, TAF1 syndrome. We find most differentially expressed genes are not recurrent between pedigrees, making functional enrichment largely distinct in each pedigree. However, we find two highly recurrent “functional outliers” (CACNA1I and IGFBP3), genes acting atypically with respect to co-expression and therefore absent from a systems-level assessment. We show this occurs in re-analysis of Huntington’s disease, Parkinson’s disease and schizophrenia. Our results suggest a significant role for genes easily missed in systems approaches.

[1]  Benjamin J. Strober,et al.  Identification of rare-disease genes in diverse undiagnosed cases using whole blood transcriptome sequencing and large control cohorts , 2018, bioRxiv.

[2]  L. Tora,et al.  Homozygous TAF8 mutation in a patient with intellectual disability results in undetectable TAF8 protein, but preserved RNA polymerase II transcription , 2018, Human molecular genetics.

[3]  Esti Yeger-Lotem,et al.  RSRC1 mutation affects intellect and behaviour through aberrant splicing and transcription, downregulating IGFBP3 , 2018, Brain : a journal of neurology.

[4]  Helena Kuivaniemi,et al.  A review of genome‐wide transcriptomics studies in Parkinson's disease , 2018, The European journal of neuroscience.

[5]  Jaques Reifman,et al.  A strategy for evaluating pathway analysis methods , 2017, BMC Bioinformatics.

[6]  Funded Statistical Methods groups-AWG,et al.  Improving genetic diagnosis in Mendelian disease with transcriptome sequencing , 2017 .

[7]  Eugene I. Drigalenko,et al.  Transcriptome sequencing study implicates immune-related genes differentially expressed in schizophrenia: new data and a meta-analysis , 2017, Translational Psychiatry.

[8]  Thomas Meitinger,et al.  Genetic diagnosis of Mendelian disorders via RNA sequencing , 2017, Nature Communications.

[9]  Paul D Thomas,et al.  The Gene Ontology and the Meaning of Biological Function. , 2017, Methods in molecular biology.

[10]  Tin Chi Nguyen,et al.  TOMAS: A novel TOpology-aware Meta-Analysis approach applied to System biology , 2016, BCB.

[11]  Benjamin Haibe-Kains,et al.  Study of Meta-analysis strategies for network inference using information-theoretic approaches , 2016, 2016 27th International Workshop on Database and Expert Systems Applications (DEXA).

[12]  K. Ihara,et al.  De novo missense mutations in NALCN cause developmental and intellectual impairment with hypotonia , 2016, Journal of Human Genetics.

[13]  N. Pervaiz,et al.  The Parkinson Disease gene SNCA: Evolutionary and structural insights with pathological implication , 2016, Scientific Reports.

[14]  E. Nogales,et al.  Structure of promoter-bound TFIID and model of human pre-initiation complex assembly , 2016, Nature.

[15]  M. Sobrido,et al.  Synaptotagmin XI in Parkinson's disease: New evidence from an association study in Spain and Mexico , 2016, Journal of the Neurological Sciences.

[16]  Jing Zhao,et al.  A Burden of Rare Variants Associated with Extremes of Gene Expression in Human Peripheral Blood. , 2016, American journal of human genetics.

[17]  Giulio Genovese,et al.  Schizophrenia risk from complex variation of complement component 4 , 2016, Nature.

[18]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[19]  Edward Yang,et al.  TAF1 Variants Are Associated with Dysmorphic Features, Intellectual Disability, and Neurological Manifestations , 2015, American journal of human genetics.

[20]  Sara Ballouz,et al.  Guidance for RNA-seq co-expression network construction and analysis: safety in numbers , 2015, Bioinform..

[21]  Daniel S. Himmelstein,et al.  Understanding multicellular function and disease with human tissue-specific networks , 2015, Nature Genetics.

[22]  Ash A. Alizadeh,et al.  Robust enumeration of cell subsets from tissue expression profiles , 2015, Nature Methods.

[23]  Mohamed Abouelhoda,et al.  Accelerating novel candidate gene discovery in neurogenetic disorders via whole-exome sequencing of prescreened multiplex consanguineous families. , 2015, Cell reports.

[24]  Guoli Ji,et al.  Aberrant Gene Expression in Humans , 2015, PLoS genetics.

[25]  S. Teng,et al.  RNA Sequencing in Schizophrenia , 2015, Bioinformatics and biology insights.

[26]  M. Owen,et al.  Genetics of schizophrenia , 2005, Current Opinion in Behavioral Sciences.

[27]  G. Miller,et al.  VMAT2 and Parkinson’s disease: harnessing the dopamine vesicle , 2014, Expert review of neurotherapeutics.

[28]  Eric S. Lander,et al.  A polygenic burden of rare disruptive mutations in schizophrenia , 2014, Nature.

[29]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[30]  G. Tseng,et al.  Beyond modules and hubs: the potential of gene coexpression networks for investigating molecular mechanisms of complex brain disorders , 2014, Genes, brain, and behavior.

[31]  Dvir Dahary,et al.  Microcephaly thin corpus callosum intellectual disability syndrome caused by mutated TAF2. , 2013, Pediatric neurology.

[32]  L. Siever,et al.  Spatial and Temporal Mapping of De Novo Mutations in Schizophrenia to a Fetal Prefrontal Cortical Network , 2013, Cell.

[33]  Dong Wook Han,et al.  A central role for TFIID in the pluripotent transcription circuitry , 2013, Nature.

[34]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[35]  Xiang Wan,et al.  Gemma: a resource for the reuse, sharing and meta-analysis of expression profiling data , 2012, Bioinform..

[36]  William Stafford Noble,et al.  Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors , 2012, Genome research.

[37]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[38]  K. Mayilyan Complement genetics, deficiencies, and disease associations , 2012, Protein & Cell.

[39]  Patrick R Hof,et al.  Optimizing the phenotyping of rodent ASD models: enrichment analysis of mouse and human neurobiological phenotypes associated with high-risk autism genes identifies morphological, electrophysiological, neurological, and behavioral features , 2012, Molecular Autism.

[40]  K. Hansen,et al.  Sequencing technology does not eliminate biological variability , 2011, Nature Biotechnology.

[41]  F. Müller,et al.  Developmental regulation of transcription initiation: more than just changing the actors. , 2010, Current opinion in genetics & development.

[42]  Pablo V Gejman,et al.  The role of genetics in the etiology of schizophrenia. , 2010, The Psychiatric clinics of North America.

[43]  I. Hanbauer,et al.  Lower number of cerebellar Purkinje neurons in psychosis is associated with reduced reelin expression , 2010, Proceedings of the National Academy of Sciences.

[44]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[45]  R. Schliebs,et al.  Synaptotagmins in Neurodegeneration , 2009, Anatomical record.

[46]  P. Garred,et al.  MBL2, FCN1, FCN2 and FCN3-The genes behind the initiation of the lectin pathway of complement. , 2009, Molecular immunology.

[47]  L. Liang,et al.  Mapping complex disease traits with global gene expression , 2009, Nature Reviews Genetics.

[48]  E. Dermitzakis From gene expression to disease risk , 2008, Nature Genetics.

[49]  Bin Zhang,et al.  Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R , 2008, Bioinform..

[50]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[51]  B. Migeon Why females are mosaics, X-chromosome inactivation, and sex differences in disease. , 2007, Gender medicine.

[52]  R. Jirtle,et al.  Environmental epigenomics and disease susceptibility , 2007, Nature Reviews Genetics.

[53]  Pall I. Olason,et al.  A human phenome-interactome network of protein complexes implicated in genetic disorders , 2007, Nature Biotechnology.

[54]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[55]  D. Goldstein,et al.  UCHL‐1 is not a Parkinson's disease susceptibility gene , 2006, Annals of neurology.

[56]  B. Harper Huntington Disease , 2005, Journal of the Royal Society of Medicine.

[57]  G. Bates History of genetic disease: The molecular genetics of Huntington disease — a history , 2005, Nature Reviews Genetics.

[58]  E. Kipreos,et al.  Developmental cell biology: C. elegans cell cycles: invariance and stem cell divisions , 2005, Nature Reviews Molecular Cell Biology.

[59]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[60]  J. Raser,et al.  Noise in Gene Expression: Origins, Consequences, and Control , 2005, Science.

[61]  D. Goldstein,et al.  UCHL1 is a Parkinson's disease susceptibility gene. , 2004 .

[62]  D. Maraganore,et al.  UCHL1 is a Parkinson's disease susceptibility gene , 2004, Annals of neurology.

[63]  P. Bauer,et al.  Trinucleotide repeat expansion in SCA17/TBP in white patients with Huntington’s disease-like phenotype , 2004, Journal of Medical Genetics.

[64]  Douglas A. Hosack,et al.  Identifying biological themes within lists of genes with EASE , 2003, Genome Biology.

[65]  H. Kitano Systems Biology: A Brief Overview , 2002, Science.