Disease-driven detection of differential inherited SNP modules from SNP network.

Detection of the synergetic effects between variants, such as single-nucleotide polymorphisms (SNPs), is crucial for understanding the genetic characters of complex diseases. Here, we proposed a two-step approach to detect differentially inherited SNP modules (synergetic SNP units) from a SNP network. First, SNP-SNP interactions are identified based on prior biological knowledge, such as their adjacency on the chromosome or degree of relatedness between the functional relationships of their genes. These interactions form SNP networks. Second, disease-risk SNP modules (or sub-networks) are prioritised by their differentially inherited properties in IBD (Identity by Descent) profiles of affected and unaffected sibpairs. The search process is driven by the disease information and follows the structure of a SNP network. Simulation studies have indicated that this approach achieves high accuracy and a low false-positive rate in the identification of known disease-susceptible SNPs. Applying this method to an alcoholism dataset, we found that flexible patterns of susceptible SNP combinations do play a role in complex diseases, and some known genes were detected through these risk SNP modules. One example is GRM7, a known alcoholism gene successfully detected by a SNP module comprised of two SNPs, but neither of the two SNPs was significantly associated with the disease in single-locus analysis. These identified genes are also enriched in some pathways associated with alcoholism, including the calcium signalling pathway, axon guidance and neuroactive ligand-receptor interaction. The integration of network biology and genetic analysis provides putative functional bridges between genetic variants and candidate genes or pathways, thereby providing new insight into the aetiology of complex diseases.

[1]  M. Stein,et al.  Mutation screen of the GAD2 gene and association study of alcoholism in three populations , 2007, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[2]  P. Stankiewicz,et al.  Intragenic rearrangements in NRXN1 in three families with autism spectrum disorder, developmental delay, and speech delay , 2010, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[3]  J. Schwaber,et al.  Chronic alcohol exposure alters transcription broadly in a key integrative brain nucleus for homeostasis: the nucleus tractus solitarius. , 2005, Physiological genomics.

[4]  G. Abecasis,et al.  Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies , 2006, Nature Genetics.

[5]  Chiara Sabatti,et al.  Human genetics: Variants in common diseases , 2007, Nature.

[6]  Michael Q. Zhang,et al.  Network-based global inference of human disease genes , 2008, Molecular systems biology.

[7]  S. Yamawaki,et al.  Decreased inositol 1,4,5-trisphosphate-specific binding in platelets from alcoholic subjects , 1996, Biological Psychiatry.

[8]  Shinichiro Wachi,et al.  Interactome-transcriptome analysis reveals the high centrality of genes differentially expressed in lung cancer tissues , 2005, Bioinform..

[9]  G. Abecasis,et al.  Optimal designs for two‐stage genome‐wide association studies , 2007, Genetic epidemiology.

[10]  C. Nemeroff,et al.  Neurotensin studies in alcohol naive, preferring and non-preferring rats ∗ Presented in part at the 1998 ACNP Meeting. ∗ , 1999, Neuroscience.

[11]  John P A Ioannidis,et al.  Meta-analysis in genome-wide association studies. , 2009, Pharmacogenomics.

[12]  Deborah C. Mash,et al.  Gene Expression in Human Hippocampus from Cocaine Abusers Identifies Genes which Regulate Extracellular Matrix Remodeling , 2007, PloS one.

[13]  S. Reymann,et al.  Transcriptome profiling of human hepatocytes treated with Aroclor 1254 reveals transcription factor regulatory networks and clusters of regulated genes , 2006, BMC Genomics.

[14]  Mark A. Levenstien,et al.  Precision and type I error rate in the presence of genotype errors and missing parental data: a comparison between the original transmission disequilibrium test (TDT) and TDTae statistics , 2005, BMC Genetics.

[15]  Ju Han Kim,et al.  Genomic characterization of perturbation sensitivity , 2007, ISMB/ECCB.

[16]  D. Maraganore,et al.  A Genomic Pathway Approach to a Complex Disease: Axon Guidance and Parkinson Disease , 2007, PLoS genetics.

[17]  Brian L. Browning,et al.  High-resolution detection of identity by descent in unrelated individuals. , 2010, American journal of human genetics.

[18]  John P. A. Ioannidis,et al.  Validating, augmenting and refining genome-wide association signals , 2009, Nature Reviews Genetics.

[19]  J. Hein,et al.  Using biological networks to search for interacting loci in genome-wide association studies , 2009, European Journal of Human Genetics.

[20]  B. Garvik,et al.  Principles for the Buffering of Genetic Variation , 2001, Science.

[21]  S. Liu-Cordero Patterns of linkage disequilibrium in the human genome , 2002 .

[22]  Jason H. Moore,et al.  Exploiting the proteome to improve the genome-wide genetic analysis of epistasis in common human diseases , 2008, Human Genetics.

[23]  Pak Chung Sham,et al.  WHAP: haplotype-based association analysis , 2007, Bioinform..

[24]  L. Wasserman,et al.  On the identification of disease mutations by the analysis of haplotype similarity and goodness of fit. , 2003, American journal of human genetics.

[25]  G. T. te Meerman,et al.  Haplotype sharing analysis in affected individuals from nuclear families with at least one affected offspring , 1997, Genetic epidemiology.

[26]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[27]  P. Park,et al.  Discovering statistically significant pathways in expression profiling studies. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[28]  N. Mendell,et al.  Genetic Analysis Workshop 14: microsatellite and single-nucleotide polymorphism marker loci for genome-wide scans , 2005, BMC Genetics.

[29]  Gary O Zerbe,et al.  Permutation‐based adjustments for the significance of partial regression coefficients in microarray data analysis , 2008, Genetic epidemiology.

[30]  Evyn J. Loucks,et al.  Deciphering the role of Shh signaling in axial defects produced by ethanol exposure. , 2009, Birth defects research. Part A, Clinical and molecular teratology.

[31]  B. Tabakoff,et al.  Genomic Insights into Acute Alcohol Tolerance , 2008, Journal of Pharmacology and Experimental Therapeutics.

[32]  B K Suarez,et al.  The affected sib pair IBD distribution for HLA-linked disease susceptibility genes. , 1978, Tissue antigens.

[33]  Brien Riley,et al.  Molecular genetic studies of schizophrenia , 2006, European Journal of Human Genetics.

[34]  Alcohol affects the skeletal muscle proteins, titin and nebulin in male and female rats. , 2003, The Journal of nutrition.

[35]  D. Goldman,et al.  Alcoholism is associated with GALR3 but not two other galanin receptor genes , 2007, Genes, brain, and behavior.

[36]  Rod K. Nibbe,et al.  Discovery and Scoring of Protein Interaction Subnetworks Discriminative of Late Stage Human Colon Cancer*S , 2009, Molecular & Cellular Proteomics.

[37]  C. Vadász,et al.  Glutamate receptor metabotropic 7 is cis-regulated in the mouse brain and modulates alcohol drinking. , 2007, Genomics.

[38]  E. Petretto,et al.  Integrated transcriptional profiling and linkage analysis for identification of genes underlying disease , 2005, Nature Genetics.

[39]  Hsin-Chou Yang,et al.  A genome-wide scanning and fine mapping study of COGA data , 2005, BMC Genetics.

[40]  T. Ideker,et al.  Network-based classification of breast cancer metastasis , 2007, Molecular systems biology.

[41]  Stefan Wuchty,et al.  Peeling the yeast protein network , 2005, Proteomics.

[42]  Roded Sharan,et al.  Analysis of SNP-expression association matrices , 2006, 2005 IEEE Computational Systems Bioinformatics Conference (CSB'05).

[43]  Yun Xiao,et al.  A systematic method for mapping multiple loci: an application to construct a genetic network for rheumatoid arthritis. , 2008, Gene.

[44]  I. Pe’er,et al.  Optimal two‐stage genotyping designs for genome‐wide association scans , 2006, Genetic epidemiology.

[45]  Nazneen Rahman,et al.  The emerging landscape of breast cancer susceptibility , 2007, Nature Genetics.

[46]  T. Reich,et al.  A perspective on epistasis: limits of models displaying no main effect. , 2002, American journal of human genetics.

[47]  Zohar Yakhini,et al.  Methods for Analysis and Visualization of SNP Genotype Data for Complex Diseases , 2002, Pacific Symposium on Biocomputing.

[48]  L. Almasy,et al.  Genome-wide linkage analysis for identifying quantitative trait loci involved in the regulation of lipoprotein a (Lpa) levels , 2008, European Journal of Human Genetics.

[49]  Jason H. Moore,et al.  The Ubiquitous Nature of Epistasis in Determining Susceptibility to Common Human Diseases , 2003, Human Heredity.

[50]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[51]  Bing Zhang,et al.  WebGestalt: an integrated system for exploring gene sets in various biological contexts , 2005, Nucleic Acids Res..

[52]  Simon Cawley,et al.  Description of the data from the Collaborative Study on the Genetics of Alcoholism (COGA) and single-nucleotide polymorphism genotyping for Genetic Analysis Workshop 14 , 2005, BMC Genetics.

[53]  J. Chang-Claude,et al.  Impact of genotyping errors on the type I error rate and the power of haplotype-based association methods , 2009, BMC Genetics.

[54]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[55]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[56]  Wei Zhang,et al.  Decision forest analysis of large-scale sib-pair identical-by-decent profiles for locating the underlying disease genes for alcoholism in human. , 2006, Beijing da xue xue bao. Yi xue ban = Journal of Peking University. Health sciences.

[57]  Stefan Wuchty,et al.  Stable evolutionary signal in a Yeast protein interaction network , 2006, BMC Evolutionary Biology.

[58]  Blaz Zupan,et al.  SNPsyn: Detection and exploration of SNP–SNP interactions , 2011 .

[59]  Celia MT Greenwood,et al.  A genome scan for parent-of-origin linkage effects in alcoholism , 2005, BMC Genetics.

[60]  E. Ciani,et al.  Haplotype association analysis of meat quality traits at the bovine PRKAG3 locus , 2007 .

[61]  Dai Zhang,et al.  Two-stage designs to identify the effects of SNP combinations on complex diseases , 2008, Journal of Human Genetics.

[62]  J. Locker,et al.  Rare NRXN1 promoter variants in patients with schizophrenia , 2010, Neuroscience Letters.

[63]  R. Radcliffe,et al.  Neurotensin levels in specific brain regions and hypnotic sensitivity to ethanol and pentobarbital as a function of time after haloperidol administration in selectively bred rat lines. , 2001, The Journal of pharmacology and experimental therapeutics.

[64]  Michael W. Miller,et al.  Ethanol enhances erbB-mediated migration of human breast cancer cells in culture , 2000, Breast Cancer Research and Treatment.

[65]  Qifang Liu,et al.  Align human interactome with phenome to identify causative genes and networks underlying disease families , 2009, Bioinform..

[66]  Lester L. Peters,et al.  Genome-wide association study identifies novel breast cancer susceptibility loci , 2007, Nature.