The human gene connectome as a map of short cuts for morbid allele discovery

High-throughput genomic data reveal thousands of gene variants per patient, and it is often difficult to determine which of these variants underlies disease in a given individual. However, at the population level, there may be some degree of phenotypic homogeneity, with alterations of specific physiological pathways underlying the pathogenesis of a particular disease. We describe here the human gene connectome (HGC) as a unique approach for human Mendelian genetic research, facilitating the interpretation of abundant genetic data from patients with the same disease, and guiding subsequent experimental investigations. We first defined the set of the shortest plausible biological distances, routes, and degrees of separation between all pairs of human genes by applying a shortest distance algorithm to the full human gene network. We then designed a hypothesis-driven application of the HGC, in which we generated a Toll-like receptor 3-specific connectome useful for the genetic dissection of inborn errors of Toll-like receptor 3 immunity. In addition, we developed a functional genomic alignment approach from the HGC. In functional genomic alignment, the genes are clustered according to biological distance (rather than the traditional molecular evolutionary genetic distance), as estimated from the HGC. Finally, we compared the HGC with three state-of-the-art methods: String, FunCoup, and HumanNet. We demonstrated that the existing methods are more suitable for polygenic studies, whereas HGC approaches are more suitable for monogenic studies. The HGC and functional genomic alignment data and computer programs are freely available to noncommercial users from http://lab.rockefeller.edu/casanova/HGC and should facilitate the genome-wide selection of disease-causing candidate alleles for experimental validation.

[1]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[2]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[3]  D. Chaussabel,et al.  Herpes simplex virus encephalitis in a patient with complete TLR3 deficiency: TLR3 is otherwise redundant in protective immunity , 2011, The Journal of experimental medicine.

[4]  L. Lagae,et al.  Classical Ehlers-Danlos syndrome caused by a mutation in type I collagen. , 2000, American journal of human genetics.

[5]  G. Daley,et al.  Impaired intrinsic immunity to HSV-1 in human iPSC-derived TLR3-deficient CNS cells , 2012, Nature.

[6]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[7]  D. Chaussabel,et al.  Heterozygous TBK1 mutations impair TLR3 immunity and underlie herpes simplex encephalitis of childhood , 2012, The Journal of experimental medicine.

[8]  E. Marcotte,et al.  Prioritizing candidate disease genes by network-based boosting of genome-wide association data. , 2011, Genome research.

[9]  P. Narcisi,et al.  A family with Ehlers-Danlos syndrome type III/articular hypermobility syndrome has a glycine 637 to serine substitution in type III collagen. , 1994, Human molecular genetics.

[10]  J. Yates,et al.  The gene encoding collagen alpha1(V)(COL5A1) is linked to mixed Ehlers-Danlos syndrome type I/II. , 1996, The Journal of investigative dermatology.

[11]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[12]  J. Casanova,et al.  Human TRAF3 adaptor molecule deficiency leads to impaired Toll-like receptor 3 response and susceptibility to herpes simplex encephalitis. , 2010, Immunity.

[13]  L. Peltonen,et al.  Evidence for a structural mutation of procollagen type I in a patient with the Ehlers-Danlos syndrome type VII. , 1980, The Journal of biological chemistry.

[14]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[15]  B. Snel,et al.  STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene. , 2000, Nucleic acids research.

[16]  J. Casanova,et al.  Primary Immunodeficiencies: A Field in Its Infancy , 2007, Science.

[17]  M. Metzker Sequencing technologies — the next generation , 2010, Nature Reviews Genetics.

[18]  J. Casanova,et al.  Life‐threatening infectious diseases of childhood: single‐gene inborn errors of immunity? , 2010, Annals of the New York Academy of Sciences.

[19]  Kevin Bryson,et al.  Detecting Gene Duplications in the Human Lineage , 2010, Annals of human genetics.

[20]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[21]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[22]  D. Chaussabel,et al.  Herpes simplex encephalitis in children with autosomal recessive and dominant TRIF deficiency. , 2011, The Journal of clinical investigation.

[23]  Livia Perfetto,et al.  MINT, the molecular interaction database: 2012 update , 2011, Nucleic Acids Res..

[24]  B. Beutler,et al.  How host defense is encoded in the mammalian genome , 2011, Mammalian Genome.

[25]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[26]  T. Matsunaga Value of genetic testing in the otological approach for sensorineural hearing loss. , 2009, The Keio journal of medicine.

[27]  J. Casanova,et al.  TLR3 immunity to infection in mice and humans. , 2013, Current opinion in immunology.

[28]  Christian von Mering,et al.  STRING: a database of predicted functional associations between proteins , 2003, Nucleic Acids Res..

[29]  A. Leonardi,et al.  Association of the adaptor TANK with the I kappa B kinase (IKK) regulator NEMO connects IKK complexes with IKK epsilon and TBK1 kinases. , 2002, The Journal of biological chemistry.

[30]  J. Casanova,et al.  Inborn errors of anti-viral interferon immunity in humans. , 2011, Current opinion in virology.

[31]  Christian Gilissen,et al.  Disease gene identification strategies for exome sequencing , 2012, European Journal of Human Genetics.

[32]  A. Smahi,et al.  TLR3 Deficiency in Patients with Herpes Simplex Encephalitis , 2007, Science.

[33]  J. Casanova,et al.  Herpes Simplex Virus Encephalitis in Human UNC-93B Deficiency , 2006, Science.

[34]  Korbinian Strimmer,et al.  APE: Analyses of Phylogenetics and Evolution in R language , 2004, Bioinform..

[35]  Andrey Alexeyenko,et al.  Comparative interactomics with Funcoup 2.0 , 2011, Nucleic Acids Res..

[36]  G. Sen,et al.  Epidermal Growth Factor Receptor Is Essential for Toll-Like Receptor 3 Signaling , 2012, Science Signaling.

[37]  A. Leonardi,et al.  Association of the Adaptor TANK with the IκB Kinase (IKK) Regulator NEMO Connects IKK Complexes with IKKε and TBK1 Kinases* , 2002, The Journal of Biological Chemistry.

[38]  Lincoln Stein,et al.  Reactome knowledgebase of human biological pathways and processes , 2008, Nucleic Acids Res..

[39]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[40]  Matthew D. Young,et al.  Gene ontology analysis for RNA-seq: accounting for selection bias , 2010, Genome Biology.

[41]  B. Beutler,et al.  Resisting viral infection: the gene by gene approach. , 2011, Current opinion in virology.

[42]  B. Hamel,et al.  Haploinsufficiency of TNXB is associated with hypermobility type of Ehlers-Danlos syndrome. , 2003, American journal of human genetics.