Integrative analysis of gene expression and methylation data for breast cancer cell lines

BackgroundThe deadly costs of cancer and necessity for an accurate method of early cancer detection have demanded the identification of genetic and epigenetic factors associated with cancer. DNA methylation, an epigenetic event, plays an important role in cancer susceptibility. In this paper, we use DNA methylation and gene expression data integration and pathway analysis to further explore and understand the complex relationship between methylation and gene expression.ResultsThrough linear modeling and analysis of variance, we obtain genes that show a significant correlation between methylation and gene expression. We then examine the functions and relationships of these genes using bioinformatic tools and databases. In particular, using ConsensusPathDB, we analyze the networks of statistically significant genes to identify hub genes, genes with a large number of links to other genes. We identify eight major hub genes, all in strong association with cancer susceptibility. Through further analysis of the function, gene expression level, and methylation level of these hub genes, we conclude that they are novel potential biomarkers for breast cancer.ConclusionsOur findings have various implications for cancer screening, early detection methods, and potential novel treatments for cancer. Researchers can also use our results to develop more effective methods for cancer study.

[1]  Ralf Herwig,et al.  ConsensusPathDB: toward a more complete picture of cell biology , 2010, Nucleic Acids Res..

[2]  Wen-Lin Kuo,et al.  A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes. , 2006, Cancer cell.

[3]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[4]  Regina M. Santella,et al.  Promoter Hypermethylation in White Blood Cell DNA and Breast Cancer Risk , 2015, Journal of Cancer.

[5]  E. Maher,et al.  DNA methylation: a form of epigenetic control of gene expression , 2010 .

[6]  Lana X. Garmire,et al.  More Is Better: Recent Progress in Multi-Omics Data Integration Methods , 2017, Front. Genet..

[7]  Tim Hui-Ming Huang,et al.  Applications of CpG island microarrays for high-throughput analysis of DNA methylation. , 2002, The Journal of nutrition.

[8]  Pearlly Yan,et al.  Preprocessing differential methylation hybridization microarray data , 2011, BioData Mining.

[9]  Raj Chari,et al.  An integrative multi-dimensional genetic and epigenetic strategy to identify aberrant genes and pathways in cancer , 2010, BMC Systems Biology.

[10]  Ralf Herwig,et al.  ConsensusPathDB—a database for integrating human functional interaction networks , 2008, Nucleic Acids Res..

[11]  T. Huang,et al.  Methylation profiling of CpG islands in human breast cancer cells. , 1999, Human molecular genetics.

[12]  Shili Lin,et al.  Differential methylation hybridization: profiling DNA methylation with a high-density CpG island microarray. , 2009, Methods in molecular biology.

[13]  Terry Speed,et al.  Normalization of cDNA microarray data. , 2003, Methods.

[14]  Krishna R. Kalari,et al.  Integrated Analysis of Gene Expression, CpG Island Methylation, and Gene Copy Number in Breast Cancer Cells by Deep Sequencing , 2011, PloS one.

[15]  Sampsa Hautaniemi,et al.  CNAmet: an R package for integrating copy number, methylation and expression data , 2011, Bioinform..

[16]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Guanghua Xiao,et al.  Epigenetic change detection and pattern recognition via Bayesian hierarchical hidden Markov models , 2013, Statistics in medicine.

[18]  Susan J. Clark,et al.  Breast cancer epigenetics: normal human mammary epithelial cells as a model system , 2008, Journal of Molecular Medicine.

[19]  Shuying Sun,et al.  Identifying DNA Methylation Variation Patterns to Obtain Potential Breast Cancer Biomarker Genes , 2015 .

[20]  David Gomez-Cabrero,et al.  Data integration in the era of omics: current and future challenges , 2014, BMC Systems Biology.

[21]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[22]  Guanghua Xiao,et al.  Modeling Three-Dimensional Chromosome Structures Using Gene Expression Data , 2011, Journal of the American Statistical Association.

[23]  Ralf Herwig,et al.  The ConsensusPathDB interaction database: 2013 update , 2012, Nucleic Acids Res..

[24]  N. Davidson,et al.  DNA methylation in breast cancer. , 2001, Endocrine-related cancer.

[25]  Shi-Hua Zhang,et al.  Identifying multi-layer gene regulatory modules from multi-dimensional genomic data , 2012, Bioinform..