Identification of putative cancer genes through data integration and comparative genomics between plants and humans

Coordination of cell division with growth and development is essential for the survival of organisms. Mistakes made during replication of genetic material can result in cell death, growth defects, or cancer. Because of the essential role of the molecular machinery that controls DNA replication and mitosis during development, its high degree of conservation among organisms is not surprising. Mammalian cell cycle genes have orthologues in plants, and vice versa. However, besides the many known and characterized proliferation genes, still undiscovered regulatory genes are expected to exist with conserved functions in plants and humans. Starting from genome-wide Arabidopsis thaliana microarray data, an integrative strategy based on coexpression, functional enrichment analysis, and cis-regulatory element annotation was combined with a comparative genomics approach between plants and humans to detect conserved cell cycle genes involved in DNA replication and/or DNA repair. With this systemic strategy, a set of 339 genes was identified as potentially conserved proliferation genes. Experimental analysis confirmed that 20 out of 40 selected genes had an impact on plant cell proliferation; likewise, an evolutionarily conserved role in cell division was corroborated for two human orthologues. Moreover, association analysis integrating Homo sapiens gene expression data with clinical information revealed that, for 45 genes, altered transcript levels and relapse risk clearly correlated. Our results illustrate how a systematic exploration of the A.thaliana genome can contribute to the experimental identification of new cell cycle regulators that might represent novel oncogenes or/and tumor suppressors.

[1]  Ash A. Alizadeh,et al.  Towards a novel classification of human malignancies based on gene expression patterns , 2001, The Journal of pathology.

[2]  R. Narayanan,et al.  Cancer gene discovery using digital differential display. , 2000, Cancer research.

[3]  Yves Van de Peer,et al.  TREECON for Windows: a software package for the construction and drawing of evolutionary trees for the Microsoft Windows environment , 1994, Comput. Appl. Biosci..

[4]  M. Studer,et al.  In vitro analysis of antiangiogenic activity of fungi isolated from clinical cases of equine keratomycosis. , 2000, Veterinary ophthalmology.

[5]  O. Elemento,et al.  Revealing global regulatory perturbations across human cancers. , 2009, Molecular cell.

[6]  Shih-Yin Tsai,et al.  Emerging roles of E2Fs in cancer: an exit from cell cycle control , 2009, Nature Reviews Cancer.

[7]  Andreas Sommer,et al.  NF-κB is essential for epithelial-mesenchymal transition and metastasis in a model of breast cancer progression , 2004 .

[8]  Rinat Abramovitch,et al.  NF-kappaB functions as a tumour promoter in inflammation-associated cancer. , 2004, Nature.

[9]  Kriston L. McGary,et al.  Systematic discovery of nonobvious human disease models through orthologous phenotypes , 2010, Proceedings of the National Academy of Sciences.

[10]  Wen-Lin Kuo,et al.  A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes. , 2006, Cancer cell.

[11]  Chris Mungall,et al.  AmiGO: online access to ontology and annotation data , 2008, Bioinform..

[12]  Feng Chen,et al.  OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups , 2005, Nucleic Acids Res..

[13]  W. Muller,et al.  Targeted disruption of beta1-integrin in a transgenic mouse model of human breast cancer reveals an essential role in mammary tumor induction. , 2004, Cancer cell.

[14]  I. Henderson,et al.  Gardening the genome: DNA methylation in Arabidopsis thaliana , 2005, Nature Reviews Genetics.

[15]  P. Bork,et al.  Co-evolution of transcriptional and post-translational cell-cycle regulation , 2006, Nature.

[16]  D. Inzé,et al.  The Plant-Specific Cyclin-Dependent Kinase CDKB1;1 and Transcription Factor E2Fa-DPa Control the Balance of Mitotically Dividing and Endoreduplicating Cells in Arabidopsis , 2004, The Plant Cell Online.

[17]  G. Parmigiani,et al.  The Consensus Coding Sequences of Human Breast and Colorectal Cancers , 2006, Science.

[18]  D O Morgan,et al.  Cyclin-dependent kinases: engines, clocks, and microprocessors. , 1997, Annual review of cell and developmental biology.

[19]  Dirk Inzé,et al.  Cell cycle regulation in plant development. , 2006, Annual review of genetics.

[20]  D. Inzé,et al.  The DP-E2F-like Gene DEL1 Controls the Endocycle in Arabidopsis thaliana , 2005, Current Biology.

[21]  Xiaoqin Xu,et al.  Genome-wide Analysis of Transcription Factor E2F1 Mutant Proteins Reveals That N- and C-terminal Protein Interaction Domains Do Not Participate in Targeting E2F1 to the Human Genome , 2011, The Journal of Biological Chemistry.

[22]  Joshua M. Stuart,et al.  A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules , 2003, Science.

[23]  H. Puchta,et al.  A homologue of the breast cancer‐associated gene BARD1 is involved in DNA repair in plants , 2006, The EMBO journal.

[24]  G. Mitchell,et al.  Cirhin up-regulates a canonical NF-kappaB element through strong interaction with Cirip/HIVEP1. , 2009, Experimental cell research.

[25]  K. B. McIntosh,et al.  How common are extraribosomal functions of ribosomal proteins? , 2009, Molecular cell.

[26]  A. Bellacosa,et al.  Analysis of cyclin E and CDK2 in ovarian cancer: Gene amplification and RNA overexpression , 1998, International journal of cancer.

[27]  J. Yates,et al.  The human kinetochore Ska1 complex facilitates microtubule depolymerization-coupled motility. , 2009, Developmental cell.

[28]  J. Cameron,et al.  Discovery of new markers of cancer through serial analysis of gene expression: prostate stem cell antigen is overexpressed in pancreatic adenocarcinoma. , 2001, Cancer research.

[29]  Rafael A Irizarry,et al.  Frozen robust multiarray analysis (fRMA). , 2010, Biostatistics.

[30]  D. S. Hsu,et al.  Role of mouse cryptochrome blue-light photoreceptor in circadian photoresponses. , 1998, Science.

[31]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[32]  H. Tsukaya,et al.  Genetics, cell cycle and cell expansion in organogenesis in plants , 2005, Journal of Plant Research.

[33]  T. Luedde,et al.  Mouse models of hepatocarcinogenesis: What can we learn for the prevention of human hepatocellular carcinoma? , 2010, Oncotarget.

[34]  Y. Ionov,et al.  RNA-binding motif protein 35A is a novel tumor suppressor for colorectal cancer , 2009, Cell cycle.

[35]  Jane Fridlyand,et al.  Differentiation of lobular versus ductal breast carcinomas by expression microarray analysis. , 2003, Cancer research.

[36]  Nick James,et al.  NASCArrays: a repository for microarray data generated by NASC's transcriptomics service , 2004, Nucleic Acids Res..

[37]  F. Couch,et al.  Structural analysis of the 17q22-23 amplicon identifies several independent targets of amplification in breast cancer cell lines and tumors. , 2001, Cancer research.

[38]  Jonathan D. G. Jones,et al.  The plant immune system , 2006, Nature.

[39]  D. Inzé,et al.  Control of proliferation, endoreduplication and differentiation by the Arabidopsis E2Fa–DPa transcription factor , 2002, The EMBO journal.

[40]  E. Liu,et al.  Expression genomics in breast cancer research: microarrays at the crossroads of biology and medicine , 2007, Breast Cancer Research.

[41]  Rosario M. Piro,et al.  Prediction of Human Disease Genes by Human-Mouse Conserved Coexpression Analysis , 2008, PLoS Comput. Biol..

[42]  Kathleen Marchal,et al.  A Gibbs sampling method to detect over-represented motifs in the upstream regions of co-expressed genes , 2001, RECOMB.

[43]  Dirk Inzé,et al.  The MCM-Binding Protein ETG1 Aids Sister Chromatid Cohesion Required for Postreplicative Homologous Recombination Repair , 2010, PLoS genetics.

[44]  Peter A. Jones,et al.  Cancer-epigenetics comes of age , 1999, Nature Genetics.

[45]  A. Jemal,et al.  Cancer Statistics, 2010 , 2010, CA: a cancer journal for clinicians.

[46]  Y. Ben-Neriah,et al.  NF-κB functions as a tumour promoter in inflammation-associated cancer , 2004, Nature.

[47]  N. Dubrawsky Cancer statistics , 1989, CA: a cancer journal for clinicians.

[48]  Hong Ma GTP-binding proteins in plants: new members of an old family , 1994, Plant Molecular Biology.

[49]  Rebecca L Poole The TAIR database. , 2007, Methods in molecular biology.

[50]  D. Inzé,et al.  The DNA replication checkpoint aids survival of plants deficient in the novel replisome factor ETG1 , 2008, The EMBO journal.

[51]  H. Pehamberger,et al.  NF-kappaB is essential for epithelial-mesenchymal transition and metastasis in a model of breast cancer progression. , 2004, The Journal of clinical investigation.

[52]  T. Hudson,et al.  A missense mutation (R565W) in cirhin (FLJ14728) in North American Indian childhood cirrhosis. , 2002, American journal of human genetics.

[53]  P. Brown,et al.  Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[54]  Jaume Bertranpetit,et al.  Comparative analysis of cancer genes in the human and chimpanzee genomes , 2006, BMC Genomics.

[55]  G. Mills,et al.  Genome-wide association scan of tag SNPs identifies a susceptibility locus for lung cancer at 15q25.1 , 2008, Nature Genetics.

[56]  J. Xu,et al.  Ribosomal proteins and colorectal cancer. , 2007, Current genomics.

[57]  J. Nevins,et al.  The Rb/E2F pathway and cancer. , 2001, Human molecular genetics.

[58]  J. G. Cory,et al.  Use of an aqueous soluble tetrazolium/formazan assay for cell growth assays in culture. , 1991, Cancer communications.

[59]  Y. Pekarsky,et al.  FHIT: from gene discovery to cancer treatment and prevention. , 2002, The Lancet. Oncology.

[60]  M. Méchali,et al.  MCM-BP regulates unloading of the MCM2-7 helicase in late S phase. , 2011, Genes & development.

[61]  M. Verma,et al.  Proteomics for cancer biomarker discovery. , 2002, Clinical chemistry.

[62]  L. Hartwell Role of yeast in cancer research , 1992 .

[63]  Klaas Vandepoele,et al.  Unraveling Transcriptional Control in Arabidopsis Using cis-Regulatory Elements and Coexpression Networks1[C][W] , 2009, Plant Physiology.

[64]  L. Liotta,et al.  Reduced Nm23/Awd protein in tumour metastasis and aberrant Drosophila development , 1989, Nature.

[65]  K. Bhat,et al.  An ARF-independent c-MYC-activated tumor suppression pathway mediated by ribosomal protein-Mdm2 Interaction. , 2010, Cancer cell.

[66]  Lester L. Peters,et al.  Genome-wide association study identifies novel breast cancer susceptibility loci , 2007, Nature.

[67]  F. McCormick,et al.  The RB and p53 pathways in cancer. , 2002, Cancer cell.

[68]  D. Greiner,et al.  Humanized SCID mouse models for biomedical research. , 2008, Current topics in microbiology and immunology.

[69]  M. Roizen,et al.  Hallmarks of Cancer: The Next Generation , 2012 .

[70]  Alan M. Jones,et al.  The Impact of Arabidopsis on Human Health: Diversifying Our Portfolio , 2008, Cell.

[71]  M. Matzke,et al.  RNA-based silencing strategies in plants. , 2001, Current opinion in genetics & development.

[72]  Daniel A. Haber,et al.  Archipelago regulates Cyclin E levels in Drosophila and is mutated in human cancer cell lines , 2001, Nature.

[73]  G. Evan,et al.  Proliferation, cell cycle and apoptosis in cancer , 2001, Nature.

[74]  Dirk Inzé,et al.  Genome-Wide Identification of Potential Plant E2F Target Genes1[w] , 2005, Plant Physiology.

[75]  P. Fearnhead,et al.  Genome-wide association study of prostate cancer identifies a second risk locus at 8q24 , 2007, Nature Genetics.

[76]  E. Caussinus,et al.  Induction of tumor growth by altered stem-cell asymmetric division in Drosophila melanogaster , 2005, Nature Genetics.