Align human interactome with phenome to identify causative genes and networks underlying disease families

MOTIVATION Understanding the complexity in gene-phenotype relationship is vital for revealing the genetic basis of common diseases. Recent studies on the basis of human interactome and phenome not only uncovers prevalent phenotypic overlap and genetic overlap between diseases, but also reveals a modular organization of the genetic landscape of human diseases, providing new opportunities to reduce the complexity in dissecting the gene-phenotype association. RESULTS We provide systematic and quantitative evidence that phenotypic overlap implies genetic overlap. With these results, we perform the first heterogeneous alignment of human interactome and phenome via a network alignment technique and identify 39 disease families with corresponding causative gene networks. Finally, we propose AlignPI, an alignment-based framework to predict disease genes, and identify plausible candidates for 70 diseases. Our method scales well to the whole genome, as demonstrated by prioritizing 6154 genes across 37 chromosome regions for Crohn's disease (CD). Results are consistent with a recent meta-analysis of genome-wide association studies for CD. AVAILABILITY Bi-modules and disease gene predictions are freely available at the URL http://bioinfo.au.tsinghua.edu.cn/alignpi/

[1]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .

[2]  Marc Vidal,et al.  Predictive models of molecular machines involved in Caenorhabditis elegans early embryogenesis , 2005, Nature.

[3]  Kriston L. McGary,et al.  Open Access Method , 2007 .

[4]  T. Ideker,et al.  Modeling cellular machinery through biological network comparison , 2006, Nature Biotechnology.

[5]  A. Sparks,et al.  The Genomic Landscapes of Human Breast and Colorectal Cancers , 2007, Science.

[6]  Pall I. Olason,et al.  A human phenome-interactome network of protein complexes implicated in genetic disorders , 2007, Nature Biotechnology.

[7]  S. Rich,et al.  Linkage of Genetic Markers on Human Chromosomes 20 and 12 to NIDDM in Caucasian Sib Pairs With a History of Diabetic Nephropathy , 1997, Diabetes.

[8]  Judy H Cho,et al.  Genome-wide association study identifies new susceptibility loci for Crohn disease and implicates autophagy in disease pathogenesis , 2007, Nature Genetics.

[9]  T. Hyypiä,et al.  Release of soluble ICAM-5, a neuronal adhesion molecule, in acute encephalitis , 2002, Neurology.

[10]  C. Gahmberg,et al.  Shedded neuronal ICAM-5 suppresses T-cell activation. , 2008, Blood.

[11]  A. Barabasi,et al.  A Protein–Protein Interaction Network for Human Inherited Ataxias and Disorders of Purkinje Cell Degeneration , 2006, Cell.

[12]  Hunter B. Fraser,et al.  Using protein complexes to predict phenotypic effects of gene mutation , 2007, Genome Biology.

[13]  Antal F. Novak,et al.  networks Græmlin : General and robust alignment of multiple large interaction data , 2006 .

[14]  T. Sittler,et al.  The Plasmodium protein network diverges from those of other eukaryotes , 2005, Nature.

[15]  Joaquín Dopazo,et al.  The role of the environment in Parkinson's disease. , 1996, Nucleic Acids Res..

[16]  M. DePamphilis,et al.  HUMAN DISEASE , 1957, The Ulster Medical Journal.

[17]  L. Biesecker,et al.  Mapping phenotypes to language: a proposal to organize and standardize the clinical descriptions of malformations , 2005, Clinical genetics.

[18]  B. Snel,et al.  Predicting disease genes using protein–protein interactions , 2006, Journal of Medical Genetics.

[19]  R. Sharan,et al.  Network-based prediction of protein function , 2007, Molecular systems biology.

[20]  Ellen M Wijsman,et al.  Evidence for a novel late-onset Alzheimer disease locus on chromosome 19p13.2. , 2004, American journal of human genetics.

[21]  O. Combarros,et al.  Interaction between interleukin–6 and intercellular adhesion molecule–1 genes and Alzheimer’s disease risk , 2005, Journal of Neurology.

[22]  M. Mcdermott,et al.  Release of the neuronal glycoprotein ICAM‐5 in serum after hypoxic‐ischemic injury , 2000, Annals of neurology.

[23]  M. Oti,et al.  The modular nature of genetic diseases , 2006, Clinical genetics.

[24]  J. Beckmann,et al.  A susceptibility locus for early-onset non-insulin dependent (type 2) diabetes mellitus maps to chromosome 20q, proximal to the phosphoenolpyruvate carboxykinase gene. , 1997, Human molecular genetics.

[25]  A. Fraser,et al.  A single gene network accurately predicts phenotypic effects of gene perturbation in Caenorhabditis elegans , 2008, Nature Genetics.

[26]  K. Mori,et al.  Reduction of telencephalin immunoreactivity in the brain of patients with Alzheimer's disease , 1997, Brain Research.

[27]  A. Rzhetsky,et al.  Probing genetic overlap among complex human phenotypes , 2007, Proceedings of the National Academy of Sciences.

[28]  S. Rich,et al.  New Susceptibility Locus for NIDDM Is Localized to Human Chromosome 20q , 1997, Diabetes.

[29]  Simon Heath,et al.  Novel Crohn Disease Locus Identified by Genome-Wide Association Maps to a Gene Desert on 5p13.1 and Modulates Expression of PTGER4 , 2007, PLoS genetics.

[30]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[31]  G. Vriend,et al.  A text-mining analysis of the human phenome , 2006, European Journal of Human Genetics.

[32]  C. Gahmberg,et al.  ICAM-5--a novel two-facetted adhesion molecule in the mammalian brain. , 2008, Immunology letters.

[33]  Rosario M. Piro,et al.  Prediction of Human Disease Genes by Human-Mouse Conserved Coexpression Analysis , 2008, PLoS Comput. Biol..

[34]  V. McKusick Mendelian Inheritance in Man and Its Online Version, OMIM , 2007, The American Journal of Human Genetics.

[35]  P. Robinson,et al.  Walking the interactome for prioritization of candidate disease genes. , 2008, American journal of human genetics.

[36]  K. Gunsalus,et al.  Network modeling links breast cancer susceptibility and centrosome dysfunction. , 2007, Nature genetics.

[37]  Michael Q. Zhang,et al.  Network-based global inference of human disease genes , 2008, Molecular systems biology.

[38]  Bonnie Berger,et al.  Global alignment of multiple protein interaction networks with application to functional orthology detection , 2008, Proceedings of the National Academy of Sciences.

[39]  B. Steinhoff,et al.  Telencephalin as an indicator for temporal-lobe dysfunction , 1998, The Lancet.

[40]  S. Horvath,et al.  Variations in DNA elucidate molecular networks that cause disease , 2008, Nature.

[41]  T. Speed,et al.  GOstat: find statistically overrepresented Gene Ontologies within a group of genes. , 2004, Bioinformatics.

[42]  David A. Bennett,et al.  Genetic association of low density lipoprotein receptor and Alzheimer's disease , 2005, Neurobiology of Aging.

[43]  Judy H. Cho,et al.  Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease , 2008, Nature Genetics.

[44]  Brad T. Sherman,et al.  DAVID: Database for Annotation, Visualization, and Integrated Discovery , 2003, Genome Biology.

[45]  Johannes Berg,et al.  Cross-species analysis of biological networks by Bayesian alignment. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[46]  Natalie Wilson,et al.  Human Protein Reference Database , 2004, Nature Reviews Molecular Cell Biology.

[47]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[48]  C. Wijmenga,et al.  Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes. , 2006, American journal of human genetics.