Network Properties of Complex Human Disease Genes Identified through Genome-Wide Association Studies

Background Previous studies of network properties of human disease genes have mainly focused on monogenic diseases or cancers and have suffered from discovery bias. Here we investigated the network properties of complex disease genes identified by genome-wide association studies (GWAs), thereby eliminating discovery bias. Principal findings We derived a network of complex diseases (n = 54) and complex disease genes (n = 349) to explore the shared genetic architecture of complex diseases. We evaluated the centrality measures of complex disease genes in comparison with essential and monogenic disease genes in the human interactome. The complex disease network showed that diseases belonging to the same disease class do not always share common disease genes. A possible explanation could be that the variants with higher minor allele frequency and larger effect size identified using GWAs constitute disjoint parts of the allelic spectra of similar complex diseases. The complex disease gene network showed high modularity with the size of the largest component being smaller than expected from a randomized null-model. This is consistent with limited sharing of genes between diseases. Complex disease genes are less central than the essential and monogenic disease genes in the human interactome. Genes associated with the same disease, compared to genes associated with different diseases, more often tend to share a protein-protein interaction and a Gene Ontology Biological Process. Conclusions This indicates that network neighbors of known disease genes form an important class of candidates for identifying novel genes for the same disease.

[1]  P. Holme,et al.  Exploring the assortativity-clustering space of a network's degree sequence. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[3]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[4]  R. Guimerà,et al.  Modularity from fluctuations in random graphs and complex networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[6]  M. DePamphilis,et al.  HUMAN DISEASE , 1957, The Ulster Medical Journal.

[7]  Paul A. Bates,et al.  Global topological features of cancer proteins in the human interactome , 2006, Bioinform..

[8]  F. Barrenäs,et al.  A haplotype in the inducible T‐cell tyrosine kinase is a risk factor for seasonal allergic rhinitis , 2009, Allergy.

[9]  Judith A. Blake,et al.  The Mouse Genome Database genotypes::phenotypes , 2008, Nucleic Acids Res..

[10]  Thomas Lengauer,et al.  Improved scoring of functional groups from gene expression data by decorrelating GO graph structure , 2006, Bioinform..

[11]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[12]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[13]  B. Snel,et al.  Predicting disease genes using protein–protein interactions , 2006, Journal of Medical Genetics.

[14]  Bassem A. Hassan,et al.  Gene prioritization through genomic data fusion , 2006, Nature Biotechnology.

[15]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[16]  M E J Newman Assortative mixing in networks. , 2002, Physical review letters.

[17]  Mykhaylo Tyomkyn,et al.  Distances in Graphs , 2010 .

[18]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  D. Gale A theorem on flows in networks , 1957 .

[20]  M. Rudemo,et al.  Connectivity can be used to identify key genes in DNA microarray data: a study based on gene expression in nasal polyps before and after treatment with glucocorticoids , 2007, Acta oto-laryngologica.

[21]  Rainer Breitling,et al.  Network theory to understand microarray studies of complex diseases. , 2006, Current molecular medicine.

[22]  Michael A. Langston,et al.  A module-based analytical strategy to identify novel disease-associated genes shows an inhibitory role for interleukin 7 Receptor in allergic inflammation , 2009, BMC Systems Biology.

[23]  K. Sneppen,et al.  Specificity and Stability in Topology of Protein Networks , 2002, Science.

[24]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[25]  K. Frazer,et al.  Human genetic variation and its contribution to complex traits , 2009, Nature Reviews Genetics.

[26]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[27]  Pall I. Olason,et al.  A human phenome-interactome network of protein complexes implicated in genetic disorders , 2007, Nature Biotechnology.

[28]  Yongjin Li,et al.  Discovering disease-genes by topological features in human protein-protein interaction network , 2006, Bioinform..

[29]  M. Oti,et al.  The modular nature of genetic diseases , 2006, Clinical genetics.

[30]  Kumaran Kandasamy,et al.  An evaluation of human protein-protein interaction data in the public domain , 2006, BMC Bioinformatics.

[31]  David Valle,et al.  Human disease genes , 2001, Nature.

[32]  D. Vitkup,et al.  Network properties of genes harboring inherited disease mutations , 2008, Proceedings of the National Academy of Sciences.

[33]  Michael Q. Zhang,et al.  Network-based global inference of human disease genes , 2008, Molecular systems biology.

[34]  Shinichiro Wachi,et al.  Interactome-transcriptome analysis reveals the high centrality of genes differentially expressed in lung cancer tissues , 2005, Bioinform..

[35]  Suzanne Craft,et al.  The role of metabolic disorders in Alzheimer disease and vascular dementia: two roads converged. , 2009, Archives of neurology.

[36]  C. Wijmenga,et al.  Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes. , 2006, American journal of human genetics.

[37]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.