Identifying differentially methylated genes using mixed effect and generalized least square models

BackgroundDNA methylation plays an important role in the process of tumorigenesis. Identifying differentially methylated genes or CpG islands (CGIs) associated with genes between two tumor subtypes is thus an important biological question. The methylation status of all CGIs in the whole genome can be assayed with differential methylation hybridization (DMH) microarrays. However, patient samples or cell lines are heterogeneous, so their methylation pattern may be very different. In addition, neighboring probes at each CGI are correlated. How these factors affect the analysis of DMH data is unknown.ResultsWe propose a new method for identifying differentially methylated (DM) genes by identifying the associated DM CGI(s). At each CGI, we implement four different mixed effect and generalized least square models to identify DM genes between two groups. We compare four models with a simple least square regression model to study the impact of incorporating random effects and correlations.ConclusionsWe demonstrate that the inclusion (or exclusion) of random effects and the choice of correlation structures can significantly affect the results of the data analysis. We also assess the false discovery rate of different models using CGIs associated with housekeeping genes.

[1]  Wen-Lin Kuo,et al.  A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes. , 2006, Cancer cell.

[2]  Tim Hui-Ming Huang,et al.  Applications of CpG island microarrays for high-throughput analysis of DNA methylation. , 2002, The Journal of nutrition.

[3]  Sandya Liyanarachchi,et al.  Breast cancer-associated fibroblasts confer AKT1-mediated epigenetic silencing of Cystatin M in epithelial cells. , 2008, Cancer research.

[4]  Carsten Denkert,et al.  Hyperactivation of the insulin-like growth factor receptor I signaling pathway is an essential event for cisplatin resistance of ovarian cancer cells. , 2009, Cancer research.

[5]  T. Huang,et al.  Methylation profiling of CpG islands in human breast cancer cells. , 1999, Human molecular genetics.

[6]  J. Gray,et al.  Gamma-Normal-Gamma Mixture Model for Detecting Differentially Methylated Loci in Three Breast Cancer Cell Lines , 2007, Cancer informatics.

[7]  Tim Hui-Ming Huang,et al.  Differential Methylation Hybridization Array of Endometrial Cancers Reveals Two Novel Cancer-Specific Methylation Markers , 2007, Clinical Cancer Research.

[8]  Peter A. Jones,et al.  The Epigenomics of Cancer , 2007, Cell.

[9]  Stephan Preibisch,et al.  Specific and nonspecific hybridization of oligonucleotide probes on microarrays. , 2004, Biophysical journal.

[10]  Shicai Fan,et al.  CpG island methylation pattern in different human tissues and its correlation with gene expression. , 2009, Biochemical and biophysical research communications.

[11]  David Edwards,et al.  Non-linear Normalization and Background Correction in One-channel CDNA Microarray Studies , 2003, Bioinform..

[12]  Jian Su,et al.  The -271 G>A polymorphism of kinase insert domain-containing receptor gene regulates its transcription level in patients with non-small cell lung cancer , 2009, BMC Cancer.

[13]  Li Hsu,et al.  Tissue-specific variation in DNA methylation levels along human chromosome 1 , 2009, Epigenetics & Chromatin.

[14]  E. Levanon,et al.  Human housekeeping genes are compact. , 2003, Trends in genetics : TIG.

[15]  J. Rogers,et al.  DNA methylation profiling of human chromosomes 6, 20 and 22 , 2006, Nature Genetics.

[16]  Yee Hwa Yang,et al.  Normalization for two-color cDNA microarray data , 2003 .

[17]  D. Delia,et al.  Role of erythropoietin receptor expression in malignant melanoma , 2008 .

[18]  Igor Zwir,et al.  Profile analysis and prediction of tissue-specific CpG island methylation classes , 2009, BMC Bioinformatics.

[19]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[20]  Henrik Bjørn Nielsen,et al.  Improving comparability between microarray probe signals by thermodynamic intensity correction. , 2007, Nucleic acids research.

[21]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[22]  Xue-lin Zhang,et al.  [Evaluation of angiogenesis in the tumorigenesis and progression of breast cancer]. , 2009, Zhonghua wai ke za zhi [Chinese journal of surgery].

[23]  Joel Greshock,et al.  Integrative Genomic Analysis of Phosphatidylinositol 3′-Kinase Family Identifies PIK3R3 as a Potential Therapeutic Target in Epithelial Ovarian Cancer , 2007, Clinical Cancer Research.

[24]  W. Kamps,et al.  Evidence Based Selection of Housekeeping Genes , 2007, PloS one.

[25]  J. Castle,et al.  Definition, conservation and epigenetics of housekeeping and tissue-enriched genes , 2009, BMC Genomics.

[26]  Yu-Quan Wei,et al.  Genetic Vaccines and Therapy , 2009 .

[27]  Hongzhe Li,et al.  A Markov random field model for network-based analysis of genomic data , 2007, Bioinform..

[28]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[29]  E. Sauter,et al.  Quantitative evaluation of DNA hypermethylation in malignant and benign breast tissue and fluids , 2010, International journal of cancer.

[30]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[31]  Peter A. Jones,et al.  Chromatin, cancer and drug therapies. , 2008, Mutation research.

[32]  S. Baylin,et al.  Frequent epigenetic silencing of the bone morphogenetic protein 2 gene through methylation in gastric carcinomas , 2006, Oncogene.

[33]  A. Porwit,et al.  MPLW515L mutation in acute megakaryoblastic leukaemia , 2009, Leukemia.

[34]  Wei Pan,et al.  Incorporating prior knowledge of gene functional groups into regularized discriminant analysis of microarray data , 2007, Bioinform..

[35]  Shili Lin,et al.  Differential methylation hybridization: profiling DNA methylation with a high-density CpG island microarray. , 2009, Methods in molecular biology.

[36]  Terry Speed,et al.  Normalization of cDNA microarray data. , 2003, Methods.

[37]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[38]  M. Frommer,et al.  CpG islands in vertebrate genomes. , 1987, Journal of molecular biology.

[39]  A. Feinberg,et al.  Genome-wide methylation analysis of human colon cancer reveals similar hypo- and hypermethylation at conserved tissue-specific CpG island shores , 2008, Nature Genetics.

[40]  Alessandra Marini,et al.  Role of erythropoietin receptor expression in malignant melanoma. , 2010, The Journal of investigative dermatology.

[41]  D. R. Goldstein,et al.  Science and Statistics: A Festschrift for Terry Speed , 2003 .