Not by systems alone: identifying functional outliers in rare disease pedigrees

In disease expression analysis, looking for shared functional signals from a set of genes which exhibit differential expression is commonplace. We examine the complement as a possibility, that disease genes display "outlier" or unexpected expression relative to broader patterns of functional expression variation. Using six families from the rare TAF1 syndrome disease cohort, we performed family-specific differential expression analyses and find that functional characterization of top candidates enriches for common pathways unlikely to be specifically linked to disease. However, by filtering away common expression changes using known co-expression, we lose all functional enrichment and are left with a small number of outliers characteristic of each proband. Two of these outlier genes are highly recurrent across pedigrees (FDR

[1]  Benjamin J. Strober,et al.  Identification of rare-disease genes in diverse undiagnosed cases using whole blood transcriptome sequencing and large control cohorts , 2018, bioRxiv.

[2]  L. Tora,et al.  Homozygous TAF8 mutation in a patient with intellectual disability results in undetectable TAF8 protein, but preserved RNA polymerase II transcription , 2018, Human molecular genetics.

[3]  Esti Yeger-Lotem,et al.  RSRC1 mutation affects intellect and behaviour through aberrant splicing and transcription, downregulating IGFBP3 , 2018, Brain : a journal of neurology.

[4]  Helena Kuivaniemi,et al.  A review of genome‐wide transcriptomics studies in Parkinson's disease , 2018, The European journal of neuroscience.

[5]  Jaques Reifman,et al.  A strategy for evaluating pathway analysis methods , 2017, BMC Bioinformatics.

[6]  Funded Statistical Methods groups-AWG,et al.  Improving genetic diagnosis in Mendelian disease with transcriptome sequencing , 2017 .

[7]  Eugene I. Drigalenko,et al.  Transcriptome sequencing study implicates immune-related genes differentially expressed in schizophrenia: new data and a meta-analysis , 2017, Translational Psychiatry.

[8]  Thomas Meitinger,et al.  Genetic diagnosis of Mendelian disorders via RNA sequencing , 2017, Nature Communications.

[9]  Kara Dolinski,et al.  The BioGRID interaction database: 2017 update , 2016, Nucleic Acids Res..

[10]  Paul D Thomas,et al.  The Gene Ontology and the Meaning of Biological Function. , 2017, Methods in molecular biology.

[11]  Benjamin Haibe-Kains,et al.  Study of Meta-analysis strategies for network inference using information-theoretic approaches , 2016, 2016 27th International Workshop on Database and Expert Systems Applications (DEXA).

[12]  K. Ihara,et al.  De novo missense mutations in NALCN cause developmental and intellectual impairment with hypotonia , 2016, Journal of Human Genetics.

[13]  N. Pervaiz,et al.  The Parkinson Disease gene SNCA: Evolutionary and structural insights with pathological implication , 2016, Scientific Reports.

[14]  E. Nogales,et al.  Structure of promoter-bound TFIID and model of human pre-initiation complex assembly , 2016, Nature.

[15]  M. Sobrido,et al.  Synaptotagmin XI in Parkinson's disease: New evidence from an association study in Spain and Mexico , 2016, Journal of the Neurological Sciences.

[16]  Jing Zhao,et al.  A Burden of Rare Variants Associated with Extremes of Gene Expression in Human Peripheral Blood. , 2016, American journal of human genetics.

[17]  Giulio Genovese,et al.  Schizophrenia risk from complex variation of complement component 4 , 2016, Nature.

[18]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[19]  Edward Yang,et al.  TAF1 Variants Are Associated with Dysmorphic Features, Intellectual Disability, and Neurological Manifestations , 2015, American journal of human genetics.

[20]  Kenny Q. Ye,et al.  Low load for disruptive mutations in autism genes and their biased transmission , 2015, Proceedings of the National Academy of Sciences.

[21]  Sara Ballouz,et al.  Guidance for RNA-seq co-expression network construction and analysis: safety in numbers , 2015, Bioinform..

[22]  William Stafford Noble,et al.  Machine learning applications in genetics and genomics , 2015, Nature Reviews Genetics.

[23]  Daniel S. Himmelstein,et al.  Understanding multicellular function and disease with human tissue-specific networks , 2015, Nature Genetics.

[24]  Ash A. Alizadeh,et al.  Robust enumeration of cell subsets from tissue expression profiles , 2015, Nature Methods.

[25]  Mohamed Abouelhoda,et al.  Accelerating novel candidate gene discovery in neurogenetic disorders via whole-exome sequencing of prescreened multiplex consanguineous families. , 2015, Cell reports.

[26]  Guoli Ji,et al.  Aberrant Gene Expression in Humans , 2015, PLoS genetics.

[27]  M. Owen,et al.  Genetics of schizophrenia , 2005, Current Opinion in Behavioral Sciences.

[28]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[29]  G. Miller,et al.  VMAT2 and Parkinson’s disease: harnessing the dopamine vesicle , 2014, Expert review of neurotherapeutics.

[30]  David P. Kreil,et al.  A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control consortium , 2014, Nature Biotechnology.

[31]  Jay Shendure,et al.  Disruptive CHD8 Mutations Define a Subtype of Autism Early in Development , 2014, Cell.

[32]  A. Barabasi,et al.  Human symptoms–disease network , 2014, Nature Communications.

[33]  B. Kuster,et al.  Mass-spectrometry-based draft of the human proteome , 2014, Nature.

[34]  Bin Yan,et al.  CMGRN: a web server for constructing multilevel gene regulatory networks using ChIP-seq and gene expression data , 2014, Bioinform..

[35]  Eric S. Lander,et al.  A polygenic burden of rare disruptive mutations in schizophrenia , 2014, Nature.

[36]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[37]  G. Tseng,et al.  Beyond modules and hubs: the potential of gene coexpression networks for investigating molecular mechanisms of complex brain disorders , 2014, Genes, brain, and behavior.

[38]  Gautier Koscielny,et al.  The International Mouse Phenotyping Consortium Web Portal, a unified point of access for knockout mice and related phenotyping data , 2013, Nucleic Acids Res..

[39]  Dvir Dahary,et al.  Microcephaly thin corpus callosum intellectual disability syndrome caused by mutated TAF2. , 2013, Pediatric neurology.

[40]  K. Boycott,et al.  Rare-disease genetics in the era of next-generation sequencing: discovery to translation , 2013, Nature Reviews Genetics.

[41]  L. Siever,et al.  Spatial and Temporal Mapping of De Novo Mutations in Schizophrenia to a Fetal Prefrontal Cortical Network , 2013, Cell.

[42]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[43]  Dong Wook Han,et al.  A central role for TFIID in the pluripotent transcription circuitry , 2013, Nature.

[44]  R. Young,et al.  Transcriptional Regulation and Its Misregulation in Disease , 2013, Cell.

[45]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[46]  A. Lu,et al.  Support for calcium channel gene defects in autism spectrum disorders , 2012, Molecular Autism.

[47]  Susan R. Wilson,et al.  Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing , 2012, BMC Genomics.

[48]  Xiang Wan,et al.  Gemma: a resource for the reuse, sharing and meta-analysis of expression profiling data , 2012, Bioinform..

[49]  William Stafford Noble,et al.  Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors , 2012, Genome research.

[50]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[51]  K. Mayilyan Complement genetics, deficiencies, and disease associations , 2012, Protein & Cell.

[52]  Paul Pavlidis,et al.  “Guilt by Association” Is the Exception Rather Than the Rule in Gene Networks , 2012, PLoS Comput. Biol..

[53]  K. Hansen,et al.  Sequencing technology does not eliminate biological variability , 2011, Nature Biotechnology.

[54]  Christopher B. Forrest,et al.  The case for a global rare-diseases registry , 2011, The Lancet.

[55]  F. Müller,et al.  Developmental regulation of transcription initiation: more than just changing the actors. , 2010, Current opinion in genetics & development.

[56]  Pablo V Gejman,et al.  The role of genetics in the etiology of schizophrenia. , 2010, The Psychiatric clinics of North America.

[57]  I. Hanbauer,et al.  Lower number of cerebellar Purkinje neurons in psychosis is associated with reduced reelin expression , 2010, Proceedings of the National Academy of Sciences.

[58]  R. Schliebs,et al.  Synaptotagmins in Neurodegeneration , 2009, Anatomical record.

[59]  Rafael A Irizarry,et al.  Gene set enrichment analysis made simple , 2009, Statistical methods in medical research.

[60]  Michael J. Keiser,et al.  Predicting new molecular targets for known drugs , 2009, Nature.

[61]  P. Garred,et al.  MBL2, FCN1, FCN2 and FCN3-The genes behind the initiation of the lectin pathway of complement. , 2009, Molecular immunology.

[62]  L. Liang,et al.  Mapping complex disease traits with global gene expression , 2009, Nature Reviews Genetics.

[63]  E. Dermitzakis From gene expression to disease risk , 2008, Nature Genetics.

[64]  Bin Zhang,et al.  Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R , 2008, Bioinform..

[65]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[66]  Calcium channels - An overview , 2008 .

[67]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[68]  B. Migeon Why females are mosaics, X-chromosome inactivation, and sex differences in disease. , 2007, Gender medicine.

[69]  R. Jirtle,et al.  Environmental epigenomics and disease susceptibility , 2007, Nature Reviews Genetics.

[70]  Pall I. Olason,et al.  A human phenome-interactome network of protein complexes implicated in genetic disorders , 2007, Nature Biotechnology.

[71]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[72]  E. Kipreos,et al.  Developmental cell biology: C. elegans cell cycles: invariance and stem cell divisions , 2005, Nature Reviews Molecular Cell Biology.

[73]  J. Raser,et al.  Noise in Gene Expression: Origins, Consequences, and Control , 2005, Science.

[74]  Homin K. Lee,et al.  Coexpression analysis of human genes across many microarray data sets. , 2004, Genome research.

[75]  D. Goldstein,et al.  UCHL1 is a Parkinson's disease susceptibility gene. , 2004 .

[76]  P. Bauer,et al.  Trinucleotide repeat expansion in SCA17/TBP in white patients with Huntington’s disease-like phenotype , 2004, Journal of Medical Genetics.

[77]  Douglas A. Hosack,et al.  Identifying biological themes within lists of genes with EASE , 2003, Genome Biology.

[78]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[79]  B. Snel,et al.  Function prediction and protein networks. , 2003, Current opinion in cell biology.

[80]  H. Kitano Systems Biology: A Brief Overview , 2002, Science.

[81]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[82]  D. Eisenberg,et al.  Detecting protein function and protein-protein interactions from genome sequences. , 1999, Science.

[83]  Edmund M. Talley,et al.  Differential Distribution of Three Members of a Gene Family Encoding Low Voltage-Activated (T-Type) Calcium Channels , 1999, The Journal of Neuroscience.