XWAS: A Software Toolset for Genetic Data Analysis and Association Studies of the X Chromosome.

XWAS is a new software suite for the analysis of the X chromosome in association studies and similar genetic studies. The X chromosome plays an important role in human disease and traits of many species, especially those with sexually dimorphic characteristics. Special attention needs to be given to its analysis due to the unique inheritance pattern, which leads to analytical complications that have resulted in the majority of genome-wide association studies (GWAS) either not considering X or mishandling it with toolsets that had been designed for non-sex chromosomes. We hence developed XWAS to fill the need for tools that are specially designed for analysis of X. Following extensive, stringent, and X-specific quality control, XWAS offers an array of statistical tests of association, including: 1) the standard test between a SNP (single nucleotide polymorphism) and disease risk, including after first stratifying individuals by sex, 2) a test for a differential effect of a SNP on disease between males and females, 3) motivated by X-inactivation, a test for higher variance of a trait in heterozygous females as compared with homozygous females, and 4) for all tests, a version that allows for combining evidence from all SNPs across a gene. We applied the toolset analysis pipeline to 16 GWAS datasets of immune-related disorders and 7 risk factors of coronary artery disease, and discovered several new X-linked genetic associations. XWAS will provide the tools and incentive for others to incorporate the X chromosome into GWAS and similar studies in any species with an XX/XY system, hence enabling discoveries of novel loci implicated in many diseases and in their sexual dimorphism.

[1]  Alon Keinan,et al.  X-inactivation informs variance-based testing for X-linked association of a quantitative trait , 2015, BMC Genomics.

[2]  Aaron J. Sams,et al.  Accounting for eXentricities: Analysis of the X Chromosome in GWAS Reveals X-Linked Genes Implicated in Autoimmune Diseases , 2014, bioRxiv.

[3]  J. Abbott,et al.  Sex differences in disease genetics: evidence, evolution, and detection. , 2014, Trends in genetics : TIG.

[4]  A. Siepel,et al.  Contrasting X-linked and autosomal diversity across 14 human populations. , 2014, American journal of human genetics.

[5]  Matti Pirinen,et al.  Chromosome X-Wide Association Study Identifies Loci for Fasting Insulin and Height and Evidence for Incomplete Dosage Compensation , 2014, PLoS genetics.

[6]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[7]  Masato Kimura,et al.  NCBI’s Database of Genotypes and Phenotypes: dbGaP , 2013, Nucleic Acids Res..

[8]  Jonathan M. Eby,et al.  A central role for inducible heat‐shock protein 70 in autoimmune vitiligo , 2013, Experimental dermatology.

[9]  Xi Jin,et al.  Association between Helicobacter Pylori Infection and Ulcerative Colitis-A Case Control Study from China , 2013, International journal of medical sciences.

[10]  A. Abdou,et al.  Immunohistochemical expression of heat shock protein 70 in vitiligo. , 2013, Annals of diagnostic pathology.

[11]  T. Manolio,et al.  eXclusion: toward integrating the X chromosome in genome-wide association analyses. , 2013, American journal of human genetics.

[12]  Andrew G. Clark,et al.  Gene-Based Testing of Interactions in Association Studies of Quantitative Traits , 2013, PLoS genetics.

[13]  Carl D Langefeld,et al.  Age of onset of amyotrophic lateral sclerosis is modulated by a locus on 1p34.1 , 2013, Neurobiology of Aging.

[14]  C. Disteche Dosage compensation of the sex chromosomes. , 2012, Annual review of genetics.

[15]  J. Marchini,et al.  Fast and accurate genotype imputation in genome-wide association studies through pre-phasing , 2012, Nature Genetics.

[16]  T. Thornton,et al.  XM: Association Testing on the X‐Chromosome in Case‐Control Samples With Related Individuals , 2012, Genetic epidemiology.

[17]  Peter Kraft,et al.  Analysis of case-control association studies with known risk variants , 2012, Bioinform..

[18]  Peter H. R. Green,et al.  Association Analysis of the Extended MHC Region in Celiac Disease Implicates Multiple Independent Susceptibility Loci , 2012, PloS one.

[19]  Jo Lambert,et al.  Genome-wide association analyses identify 13 new susceptibility loci for generalized vitiligo , 2012, Nature Genetics.

[20]  I. L. Le Poole,et al.  HSP70i is a critical component of the immune response leading to vitiligo , 2012, Pigment cell & melanoma research.

[21]  Paul Weston,et al.  Interaction between ERAP1 and HLA-B27 in ankylosing spondylitis implicates peptide handling in the mechanism for HLA-B27 in disease susceptibility , 2011, Nature Genetics.

[22]  Simon C. Potter,et al.  Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis , 2011, Nature.

[23]  Yijun Zuo,et al.  A powerful truncated tail strength method for testing multiple null hypotheses in one dataset. , 2011, Journal of theoretical biology.

[24]  Ian M. Wilson,et al.  Chromosome-wide DNA methylation analysis predicts human tissue-specific X inactivation , 2011, Human Genetics.

[25]  Andreas Ziegler,et al.  Association Tests for X-Chromosomal Markers – A Comparison of Different Test Statistics , 2011, Human Heredity.

[26]  R. Spritz,et al.  Comprehensive association analysis of candidate genes for generalized vitiligo supports XBP1, FOXP3, and TSLP. , 2011, The Journal of investigative dermatology.

[27]  Eric Boerwinkle,et al.  The Atherosclerosis Risk in Communities (ARIC) Study , 2011 .

[28]  R. Voskuhl Sex differences in autoimmune diseases , 2011, Biology of Sex Differences.

[29]  J. Felsenstein,et al.  Estimators of the human effective sex ratio detect sex biases on different timescales. , 2010, American journal of human genetics.

[30]  David Heckerman,et al.  Chromosome 9p21 in amyotrophic lateral sclerosis in Finland: a genome-wide association study , 2010, The Lancet Neurology.

[31]  August E. Woerner,et al.  The ratio of human X chromosome to autosome diversity is positively correlated with genetic distance from genes , 2010, Nature Genetics.

[32]  D. Reich,et al.  Can a sex-biased human demography account for the reduced effective population size of chromosome X in non-Africans? , 2010, Molecular biology and evolution.

[33]  P. Visscher,et al.  A versatile gene-based test for genome-wide association studies. , 2010, American journal of human genetics.

[34]  Yun Li,et al.  METAL: fast and efficient meta-analysis of genomewide association scans , 2010, Bioinform..

[35]  Peter Kraft,et al.  Genetic variants at 2q24 are associated with susceptibility to type 2 diabetes. , 2010, Human molecular genetics.

[36]  Jeremiah D. Degenhardt,et al.  Sex-averaged recombination and mutation rates on the X chromosome: a comment on Labuda et al. , 2010, American journal of human genetics.

[37]  M. Dave,et al.  Association between Helicobacter pylori infection and inflammatory bowel disease: A meta‐analysis and systematic review of the literature† , 2010, Inflammatory bowel diseases.

[38]  Jo Lambert,et al.  Variant of TYR and autoimmunity susceptibility loci in generalized vitiligo. , 2010, The New England journal of medicine.

[39]  R. Qiu,et al.  Highly punctuated patterns of population structure on the X chromosome and implications for African evolutionary history. , 2010, American journal of human genetics.

[40]  D. Clayton,et al.  Sex chromosomes and genetic association studies , 2009, Genome Medicine.

[41]  P. Donnelly,et al.  Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region , 2010 .

[42]  Melissa A. Wilson,et al.  Genomic analyses of sex chromosome evolution. , 2009, Annual review of genomics and human genetics.

[43]  Michael R. Johnson,et al.  Genome-wide association analysis of susceptibility and clinical phenotype in multiple sclerosis. , 2009, Human molecular genetics.

[44]  Pui-Yan Kwok,et al.  Genomewide Scan Reveals Association of Psoriasis with IL-23 and NF-κB Pathways , 2008, Nature Genetics.

[45]  Yoav Gilad,et al.  Sex-specific genetic architecture of human disease , 2008, Nature Reviews Genetics.

[46]  Joshua M. Korn,et al.  Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs , 2008, Nature Genetics.

[47]  August E. Woerner,et al.  Sex-Biased Evolutionary Forces Shape Genomic Patterns of Human Diversity , 2008, PLoS genetics.

[48]  D. Clayton,et al.  Testing for association on the X chromosome , 2008, Biostatistics.

[49]  Orla Hardiman,et al.  A genome-wide association study of sporadic ALS in a homogenous Irish population. , 2007, Human molecular genetics.

[50]  Gang Zheng,et al.  Testing association for markers on the X chromosome , 2007, Genetic epidemiology.

[51]  K. Sirotkin,et al.  The NCBI dbGaP database of genotypes and phenotypes , 2007, Nature Genetics.

[52]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[53]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[54]  C. Eun A Genome-Wide Association Study Identifies IL23R as an Inflammatory Bowel Disease Gene. , 2007 .

[55]  Judy H. Cho,et al.  A Genome-Wide Association Study Identifies IL23R as an Inflammatory Bowel Disease Gene , 2006, Science.

[56]  G. Matanoski,et al.  Demographics and tumor characteristics of colorectal cancers in the United States, 1998–2001 , 2006, Cancer.

[57]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[58]  R. Cummings,et al.  Protein glycosylation: Chaperone mutation in Tn syndrome , 2005, Nature.

[59]  H. Willard,et al.  X-inactivation profile reveals extensive variability in X-linked gene expression in females , 2005, Nature.

[60]  R. Fisher Statistical methods for research workers , 1927, Protoplasma.

[61]  B S Weir,et al.  Truncated product method for combining P‐values , 2002, Genetic epidemiology.

[62]  E. Verdin,et al.  BCoR, a novel corepressor involved in BCL-6 repression. , 2000, Genes & development.

[63]  B. Diamond,et al.  Autoimmune diseases , 2000, Bone Marrow Transplantation.

[64]  P. L. Reddy,et al.  Alpha thalassemia and its impact on other clinical conditions. , 1997, Clinics in laboratory medicine.

[65]  E. Wynder,et al.  Gender differences in smoking and risk for oral cancer. , 1996, Cancer research.

[66]  A. Folsom,et al.  The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators. , 1989, American journal of epidemiology.

[67]  W. Kannel,et al.  Patterns of coronary heart disease morbidity and mortality in the sexes: a 26-year follow-up of the Framingham population. , 1986, American heart journal.

[68]  Morton B. Brown,et al.  Robust Tests for the Equality of Variances , 1974 .

[69]  H. Levene Robust tests for equality of variances , 1961 .

[70]  Glenn L. McConagha,et al.  The American soldier , 1949 .

[71]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[72]  Billy I. Ross,et al.  The American Soldier. , 1898 .