Heritable clustering and pathway discovery in breast cancer integrating epigenetic and phenotypic data

BackgroundIn order to recapitulate tumor progression pathways using epigenetic data, we developed novel clustering and pathway reconstruction algorithms, collectively referred to as heritable clustering. This approach generates a progression model of altered DNA methylation from tumor tissues diagnosed at different developmental stages. The samples act as surrogates for natural progression in breast cancer and allow the algorithm to uncover distinct epigenotypes that describe the molecular events underlying this process. Furthermore, our likelihood-based clustering algorithm has great flexibility, allowing for incomplete epigenotype or clinical phenotype data and also permitting dependencies among variables.ResultsUsing this heritable clustering approach, we analyzed methylation data obtained from 86 primary breast cancers to recapitulate pathways of breast tumor progression. Detailed annotation and interpretation are provided to the optimal pathway recapitulated. The result confirms the previous observation that aggressive tumors tend to exhibit higher levels of promoter hypermethylation.ConclusionOur results indicate that the proposed heritable clustering algorithms are a useful tool for stratifying both methylation and clinical variables of breast cancer. The application to the breast tumor data illustrates that this approach can select meaningful progression models which may aid the interpretation of pathways having biological and clinical significance. Furthermore, the framework allows for other types of biological data, such as microarray gene expression or array CGH data, to be integrated.

[1]  Andrew P Feinberg,et al.  The epigenetics of cancer etiology. , 2004, Seminars in cancer biology.

[2]  W. Lam,et al.  Chromosome-wide and promoter-specific analyses identify sites of differential DNA methylation in normal and transformed human cells , 2005, Nature Genetics.

[3]  C. Isaacs,et al.  BRCA1 in hormonal carcinogenesis: basic and clinical research. , 2005, Endocrine-related cancer.

[4]  Peter A. Jones,et al.  A blueprint for a Human Epigenome Project: the AACR Human Epigenome Workshop. , 2005, Cancer research.

[5]  Carl T. Bergstrom,et al.  A population-epigenetic model to infer site-specific methylation rates from double-stranded DNA methylation patterns. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Victor X Jin,et al.  Loss of Estrogen Receptor Signaling Triggers Epigenetic Silencing of Downstream Targets in Breast Cancer , 2004, Cancer Research.

[7]  S. Baylin,et al.  DNA methylation and gene silencing in cancer , 2005, Nature Clinical Practice Oncology.

[8]  Martin Widschwendter,et al.  DNA methylation and breast carcinogenesis , 2002, Oncogene.

[9]  Huidong Shi,et al.  (Cancer Res., 61(23):8375-8380)Disecting complex epigenetic alterations in breast cancer using CpG island microarrays , 2001 .

[10]  Susmita Datta,et al.  Comparisons and validation of statistical clustering techniques for microarray gene expression data , 2003, Bioinform..

[11]  J. Franklin,et al.  The elements of statistical learning: data mining, inference and prediction , 2005 .

[12]  Tim Hui-Ming Huang,et al.  Methylation target array for rapid analysis of CpG island hypermethylation in multiple tissue genomes. , 2003, The American journal of pathology.

[13]  C. Streuli,et al.  Apoptosis regulation in the mammary gland , 2004, Cellular and Molecular Life Sciences.

[14]  Feng Jiang,et al.  Distance-Based Reconstruction of Tree Models for Oncogenesis , 2000, J. Comput. Biol..

[15]  C. M. Chen,et al.  Dissecting complex epigenetic alterations in breast cancer using CpG island microarrays. , 2001, Cancer research.

[16]  P. Laird Cancer epigenetics. , 2005, Human molecular genetics.

[17]  Peter A. Jones,et al.  The fundamental role of epigenetic events in cancer , 2002, Nature Reviews Genetics.

[18]  J. Costello,et al.  Comparative epigenomics of leukemia , 2005, Nature Genetics.

[19]  Ellen R. Laird,et al.  Molecular basis for interaction of the protein tyrosine kinase ZAP-70 with the T-cell receptor , 2007, Nature.

[20]  Vijay V. Raghavan,et al.  BitCube: A Three-Dimensional Bitmap Indexing for XML Documents , 2004, Journal of Intelligent Information Systems.

[21]  M. Newton Discovering Combinations of Genomic Aberrations Associated With Cancer , 2002 .

[22]  Jun S. Liu,et al.  Clustering analysis of SAGE data using a Poisson approach , 2004, Genome Biology.

[23]  司履生 Cancer epigenetics , 2006 .

[24]  P. Laird,et al.  CpG island methylator phenotype underlies sporadic microsatellite instability and is tightly associated with BRAF mutation in colorectal cancer , 2006, Nature Genetics.

[25]  A. Bird,et al.  Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals , 2003, Nature Genetics.

[26]  K. Robertson DNA methylation and human disease , 2005, Nature Reviews Genetics.

[27]  Peter W. Laird,et al.  A comparison of cluster analysis methods using DNA methylation data , 2004, Bioinform..