Detection of disease-associated deletions in case–control studies using SNP genotypes with application to rheumatoid arthritis

Genomic deletions have long been known to play a causative role in microdeletion syndromes. Recent whole-genome genetic studies have shown that deletions can increase the risk for several psychiatric disorders, suggesting that genomic deletions play an important role in the genetic basis of complex traits. However, the association between genomic deletions and common, complex diseases has not yet been systematically investigated in gene mapping studies. Likelihood-based statistical methods for identifying disease-associated deletions have recently been developed for familial studies of parent-offspring trios. The purpose of this study is to develop statistical approaches for detecting genomic deletions associated with complex disease in case–control studies. Our methods are designed to be used with dense single nucleotide polymorphism (SNP) genotypes to detect deletions in large-scale or whole-genome genetic studies. As more and more SNP genotype data for genome-wide association studies become available, development of sophisticated statistical approaches will be needed that use these data. Our proposed statistical methods are designed to be used in SNP-by-SNP analyses and in cluster analyses based on combined evidence from multiple SNPs. We found that these methods are useful for detecting disease-associated deletions and are robust in the presence of linkage disequilibrium using simulated SNP data sets. Furthermore, we applied the proposed statistical methods to SNP genotype data of chromosome 6p for 868 rheumatoid arthritis patients and 1,197 controls from the North American Rheumatoid Arthritis Consortium. We detected disease-associated deletions within the region of human leukocyte antigen in which genomic deletions were previously discovered in rheumatoid arthritis patients.

[1]  R. Redon,et al.  Relative Impact of Nucleotide and Copy Number Variation on Gene Expression Phenotypes , 2007, Science.

[2]  William Wheeler,et al.  Probability of detecting disease-associated single nucleotide polymorphisms in case-control genome-wide association studies. , 2008, Biostatistics.

[3]  C. Hoggart,et al.  Sequence-Level Population Simulations Over Large Genomic Regions , 2007, Genetics.

[4]  Michael R. Speicher,et al.  Digital karyotyping , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[5]  S. Beck,et al.  Sequence organisation of the class II region of the human MHC , 1999, Immunological reviews.

[6]  Marek Kimmel,et al.  Forward-Time Simulations of Human Populations with Complex Diseases , 2007, PLoS genetics.

[7]  P. Donnelly,et al.  The Fine-Scale Structure of Recombination Rate Variation in the Human Genome , 2004, Science.

[8]  J. Strominger,et al.  Structural organization of the DR subregion of the human major histocompatibility complex. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[9]  G. Abecasis,et al.  Genetic variation in the 22q11 locus and susceptibility to schizophrenia , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[10]  E. Lindsay Chromosomal microdeletions: dissecting del22q11 syndrome , 2001, Nature reviews genetics.

[11]  L. Guyant‐Maréchal APP locus duplication causes Alzheimer disease with cerebral amyloid angiopathy , 2009, Journal of Neurological Sciences.

[12]  J. Stockman Association between Microdeletion and Microduplication at 16p11.2 and Autism , 2009 .

[13]  Ellen M Wijsman,et al.  Presence of large deletions in kindreds with autism. , 2002, American journal of human genetics.

[14]  S. Shete,et al.  Statistical Methods for Anomalous Discrete Time Series Based on Minimum Cell Count , 2008, Biometrical journal. Biometrische Zeitschrift.

[15]  J. Wall,et al.  When did the human population size start increasing? , 2000, Genetics.

[16]  S. Gabriel,et al.  Calibrating a coalescent simulation of human genome sequence variation. , 2005, Genome research.

[17]  S. Shete,et al.  Positional Identification of Microdeletions with Genetic Markers , 2003, Human Heredity.

[18]  P. Gregersen,et al.  The genetics revolution and the assault on rheumatoid arthritis. , 1999, Arthritis and rheumatism.

[19]  Thomas W. Mühleisen,et al.  Large recurrent microdeletions associated with schizophrenia , 2008, Nature.

[20]  C I Amos,et al.  A genomewide screen in multiplex rheumatoid arthritis families suggests genetic overlap with other autoimmune diseases. , 2001, American journal of human genetics.

[21]  D. Cutler,et al.  Simultaneous discovery and testing of deletions for disease association in SNP genotyping studies. , 2007, American journal of human genetics.

[22]  D. Conrad,et al.  Global variation in copy number in the human genome , 2006, Nature.

[23]  R. Ophoff,et al.  Detection, imputation, and association analysis of small deletions and null alleles on oligonucleotide arrays. , 2008, American journal of human genetics.

[24]  G. Gyapay,et al.  New susceptibility locus for rheumatoid arthritis suggested by a genome-wide linkage study. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[25]  D. Conrad,et al.  A high-resolution survey of deletion polymorphism in the human genome , 2006, Nature Genetics.

[26]  S Wallenstein,et al.  An approximation for the distribution of the scan statistic. , 1987, Statistics in medicine.

[27]  Pardis C Sabeti,et al.  Common deletion polymorphisms in the human genome , 2006, Nature Genetics.

[28]  R. Grimson,et al.  A Method for Detecting Current Temporal Clusters of Toxic Events Through Data Monitoring by Poison Control Centers , 2000, Journal of toxicology. Clinical toxicology.

[29]  Paul Wordsworth,et al.  Whole-genome linkage analysis of rheumatoid arthritis susceptibility loci in 252 affected sibling pairs in the United Kingdom. , 2002, Arthritis and rheumatism.

[30]  Xiangli Xiao,et al.  Screening the genome for rheumatoid arthritis susceptibility genes: a replication study and combined analysis of 512 multicase families. , 2003, Arthritis and rheumatism.

[31]  K. Frazer,et al.  Common deletions and SNPs are in linkage disequilibrium in the human genome , 2006, Nature Genetics.

[32]  R. Redon,et al.  Copy Number Variation: New Insights in Genome Diversity References , 2006 .

[33]  E. Eichler,et al.  Fine-scale structural variation of the human genome , 2005, Nature Genetics.

[34]  Anat Levit,et al.  A complete genetic association scan of the 22q11 deletion region and functional evidence reveal an association between DGCR2 and schizophrenia , 2006, Human Genetics.

[35]  G. Galbraith,et al.  TRAF1-C5 as a Risk Locus for Rheumatoid Arthritis—A Genomewide Study , 2008 .

[36]  R. Grimson Disease clusters, exact distributions of maxima, and P-values. , 1993, Statistics in medicine.

[37]  Carolyn J. Brown,et al.  A comprehensive analysis of common copy-number variations in the human genome. , 2007, American journal of human genetics.

[38]  Kenny Q. Ye,et al.  Strong Association of De Novo Copy Number Mutations with Autism , 2007, Science.