Epigenome-wide association studies without the need for cell-type composition

In epigenome-wide association studies, cell-type composition often differs between cases and controls, yielding associations that simply tag cell type rather than reveal fundamental biology. Current solutions require actual or estimated cell-type composition—information not easily obtainable for many samples of interest. We propose a method, FaST-LMM-EWASher, that automatically corrects for cell-type composition without the need for explicit knowledge of it, and then validate our method by comparison with the state-of-the-art approach. Corresponding software is available from http://www.microsoft.com/science/.

[1]  R. Pearl Biometrics , 1914, The American Naturalist.

[2]  Timo Malm,et al.  Validation by analysis , 1998 .

[3]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[4]  L. Wasserman,et al.  Genomic control, a new approach to genetic-based association studies. , 2001, Theoretical population biology.

[5]  Gui-yuan Li,et al.  Expression of LRRC4 has the potential to decrease the growth rate and tumorigenesis of glioblastoma cell line U251. , 2003, Ai zheng = Aizheng = Chinese journal of cancer.

[6]  D. Balding A tutorial on statistical methods for population association studies , 2006, Nature Reviews Genetics.

[7]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[8]  G. Watkins,et al.  Expression of Interleukin 11 and Its Receptor and Their Prognostic Value in Human Breast Cancer , 2006, Annals of Surgical Oncology.

[9]  BMC Bioinformatics , 2005 .

[10]  John D. Storey,et al.  Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis , 2007, PLoS genetics.

[11]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[12]  N. Ahuja,et al.  Is There a Difference in Survival Between Right- Versus Left-Sided Colon Cancers? , 2008, Annals of Surgical Oncology.

[13]  S. Zukin,et al.  Epigenetics , 2009, Alzheimer's & Dementia.

[14]  W. V. van IJcken,et al.  CITED2 and NCOR2 in anti-oestrogen resistance and progression of breast cancer , 2009, British Journal of Cancer.

[15]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[16]  D. Philpott,et al.  NLRC5 Limits the Activation of Inflammatory Pathways , 2010, Journal of Immunology.

[17]  Alkes L. Price,et al.  New approaches to population stratification in genome-wide association studies , 2010, Nature Reviews Genetics.

[18]  H. Kang,et al.  Variance component model to account for sample structure in genome-wide association studies , 2010, Nature Genetics.

[19]  William M. Grady,et al.  Epigenetics and colorectal cancer , 2011, Nature Reviews Gastroenterology &Hepatology.

[20]  D. Balding,et al.  Epigenome-wide association studies for common human diseases , 2011, Nature Reviews Genetics.

[21]  Ying Liu,et al.  FaST linear mixed models for genome-wide association studies , 2011, Nature Methods.

[22]  Mark I McCarthy,et al.  Genomic inflation factors under polygenic inheritance , 2011, European Journal of Human Genetics.

[23]  Yingsong Wu,et al.  Aberrant methylation and downregulation of sall3 in human hepatocellular carcinoma. , 2012, World journal of gastroenterology.

[24]  K. Heichman,et al.  DNA methylation biomarkers and their utility for solid cancer diagnostics , 2012, Clinical chemistry and laboratory medicine.

[25]  Eldon Emberly,et al.  Factors underlying variable DNA methylation in a human community cohort , 2012, Proceedings of the National Academy of Sciences.

[26]  Gaoxi Xiao,et al.  RUNX3 acts as a tumor suppressor in breast cancer by targeting estrogen receptor α , 2012, Oncogene.

[27]  Nan Wang,et al.  Prognostic impact of Metadherin–SND1 interaction in colon cancer , 2012, Molecular Biology Reports.

[28]  R. Millikan,et al.  &mgr;-Opioid Receptor Gene A118G Polymorphism Predicts Survival in Patients with Breast Cancer , 2012, Anesthesiology.

[29]  J. Kere,et al.  Differential DNA Methylation in Purified Human Blood Cells: Implications for Cell Lineage and Studies on Disease Susceptibility , 2012, PloS one.

[30]  Devin C. Koestler,et al.  DNA methylation arrays as surrogate measures of cell mixture distribution , 2012, BMC Bioinformatics.

[31]  Eleazar Eskin,et al.  Improved linear mixed models for genome-wide association studies , 2012, Nature Methods.

[32]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[33]  Martin J. Aryee,et al.  Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in Rheumatoid Arthritis , 2013, Nature Biotechnology.

[34]  Margaret R Karagas,et al.  Blood-based profiles of DNA methylation predict the underlying distribution of cell types , 2013, Epigenetics.

[35]  David K. Meyerholz,et al.  Ectopic expression of Zmiz1 induces cutaneous squamous cell malignancies in a mouse model of cancer , 2013, The Journal of investigative dermatology.

[36]  David Heckerman,et al.  The benefits of selecting phenotype-specific variants for applications of mixed models in genomics , 2013, Scientific Reports.