Biological and functional analysis of statistically significant pathways deregulated in colon cancer by using gene expression profiles

Gene expression profiling offers a great opportunity for studying multi-factor diseases and for understanding the key role of genes in mechanisms which drive a normal cell to a cancer state. Single gene analysis is insufficient to describe the complex perturbations responsible for cancer onset, progression and invasion. A deeper understanding of the mechanisms of tumorigenesis can be reached focusing on deregulation of gene sets or pathways rather than on individual genes. We apply two known and statistically well founded methods for finding pathways and biological processes deregulated in pathological conditions by analyzing gene expression profiles. In particular, we measure the amount of deregulation and assess the statistical significance of predefined pathways belonging to a curated collection (Molecular Signature Database) in a colon cancer data set. We find that pathways strongly involved in different tumors are strictly connected with colon cancer. Moreover, our experimental results show that the study of complex diseases through pathway analysis is able to highlight genes weakly connected to the phenotype which may be difficult to detect by using classical univariate statistics. Our study shows the importance of using gene sets rather than single genes for understanding the main biological processes and pathways involved in colorectal cancer. Our analysis evidences that many of the genes involved in these pathways are strongly associated to colorectal tumorigenesis. In this new perspective, the focus shifts from finding differentially expressed genes to identifying biological processes, cellular functions and pathways perturbed in the phenotypic conditions by analyzing genes co-expressed in a given pathway as a whole, taking into account the possible interactions among them and, more importantly, the correlation of their expression with the phenotypical conditions.

[1]  G. Kouraklis,et al.  Replication protein A is an independent prognostic indicator with potential therapeutic implications in colon cancer , 2007, Modern Pathology.

[2]  N. Tsuchida,et al.  Transcriptional activation of the human stress-inducible transcriptional repressor ATF3 gene promoter by p53. , 2002, Biochemical and biophysical research communications.

[3]  R Pingitore,et al.  Neoangiogenesis in colon cancer: correlation between vascular density, vascular endothelial growth factor (VEGF) and p53 protein expression. , 2002, Oncology reports.

[4]  S. Berndt,et al.  Mismatch repair polymorphisms and the risk of colorectal cancer , 2007, International journal of cancer.

[5]  Jing Peng,et al.  SVM vs regularized least squares classification , 2004, ICPR 2004.

[6]  D. Coradini,et al.  Modulation of angiogenesis-related proteins synthesis by sodium butyrate in colon cancer cell line HT29. , 2002, Carcinogenesis.

[7]  D. Hanahan,et al.  The Hallmarks of Cancer , 2000, Cell.

[8]  W. Park,et al.  Chk1 frameshift mutation in sporadic and hereditary non-polyposis colorectal cancers with microsatellite instability. , 2007, European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology.

[9]  N. Petrelli,et al.  aCGH local copy number aberrations associated with overall copy number genomic instability in colorectal cancer: coordinate involvement of the regions including BCR and ABL. , 2007, Mutation research.

[10]  Jeffrey T. Chang,et al.  Oncogenic pathway signatures in human cancers as a guide to targeted therapies , 2006, Nature.

[11]  Y. Nakajima,et al.  Annexin II overexpression correlates with stromal tenascin‐C overexpression , 2001, Cancer.

[12]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[13]  T. Fujita,et al.  Testing of multiple samples increases the sensitivity of stool decay-accelerating factor test for the detection of colorectal cancer , 2003, American Journal of Gastroenterology.

[14]  Teizo Fujita,et al.  Difference in Ulex europaeus agglutinin I-binding activity of decay-accelerating factor detected in the stools of patients with colorectal cancer and ulcerative colitis. , 2004, The Journal of laboratory and clinical medicine.

[15]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[16]  Qi Dong,et al.  Induction of HSF1 expression is associated with sporadic colorectal cancer. , 2004, World journal of gastroenterology.

[17]  M. Bryś,et al.  Genotyping of p53 codon 175 in colorectal cancer. , 2003, Medical science monitor : international medical journal of experimental and clinical research.

[18]  Graziano Pesole,et al.  Statistical assessment of functional categories of genes deregulated in pathological conditions by using microarray data , 2007, Bioinform..

[19]  V. Stigliano,et al.  Activation of c-MYC and c-MYB proto-oncogenes is associated with decreased apoptosis in tumor colon progression. , 2001, Anticancer research.

[20]  G. F. Wagner,et al.  Hypoxia-inducible factor-1-mediated activation of stanniocalcin-1 in human cancer cells. , 2005, Endocrinology.

[21]  J. García,et al.  The GADD45, ZBRK1 and BRCA1 pathway: quantitative analysis of mRNA expression in colon carcinomas , 2005, The Journal of pathology.

[22]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[23]  L. Aaltonen,et al.  CHEK2 I157T associates with familial and sporadic colorectal cancer , 2005, Journal of Medical Genetics.

[24]  N. Risch Searching for genetic determinants in the new millennium , 2000, Nature.

[25]  T. Fujita,et al.  ADVANCES IN THE DEVELOPMENT OF A RELIABLE ASSAY FOR THE MEASUREMENT OF STOOL DECAY-ACCELERATING FACTOR IN THE DETECTION OF COLORECTAL CANCER , 2002, Journal of immunoassay & immunochemistry.

[26]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[27]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[28]  P. Park,et al.  Discovering statistically significant pathways in expression profiling studies. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Graziano Pesole,et al.  On the statistical assessment of classifiers using DNA microarray data , 2006, BMC Bioinformatics.

[30]  T. Fujita,et al.  Polymorphic expression of decay‐accelerating factor in human colorectal cancer , 2001, Journal of gastroenterology and hepatology.

[31]  Sayan Mukherjee,et al.  Modeling Cancer Progression via Pathway Dependencies , 2008, PLoS Comput. Biol..

[32]  P. Blache,et al.  Expression of the carcinoembryonic antigen gene is inhibited by SOX9 in human colon carcinoma cells. , 2005, Cancer research.

[33]  J. Turnay,et al.  Differentiation of human colon adenocarcinoma cells alters the expression and intracellular localization of annexins A1, A2, and A5 , 2005, Journal of cellular biochemistry.

[34]  D. Boyd,et al.  ATF3 Regulates the Stability of p53: A Link to Cancer , 2006, Cell cycle.

[35]  Adam Elzagheid,et al.  E-cadherin expression pattern in primary colorectal carcinomas and their metastases reflects disease outcome. , 2006, World journal of gastroenterology.

[36]  Jin Han,et al.  [Expression of vascular endothelial growth factor in colorectal cancer and its clinical significance]. , 2002, Zhonghua yi xue za zhi.

[37]  Hong-fu Zhang,et al.  Mad2 and p27 expression profiles in colorectal cancer and its clinical significance. , 2004, World journal of gastroenterology.

[38]  D. Powe,et al.  Distribution of collagenase and tissue inhibitor of metalloproteinases (TIMP) in colorectal tumours , 1991, International journal of cancer.

[39]  P. Good,et al.  Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses , 1995 .

[40]  G. Baretton,et al.  Overexpression of the insulin‐like growth factor I receptor in human colon carcinomas , 2002, Cancer.

[41]  A. Schatzkin,et al.  Loss of insulin-like growth factor-II imprinting and the presence of screen-detected colorectal adenomas in women. , 2004, Journal of the National Cancer Institute.

[42]  S. Ye Polymorphism in matrix metalloproteinase gene promoters: implication in regulation of gene expression and susceptibility of various diseases. , 2000, Matrix biology : journal of the International Society for Matrix Biology.

[43]  Purvesh Khatri,et al.  Ontological analysis of gene expression data: current tools, limitations, and open problems , 2005, Bioinform..

[44]  Phil Quirke,et al.  Expression of DNA Double-Strand Break Repair Proteins ATM and BRCA1 Predicts Survival in Colorectal Cancer , 2006, Clinical Cancer Research.

[45]  M. McArthur,et al.  Epidermal hyperplasia and oral carcinoma in mice overexpressing the transcription factor ATF3 in basal epithelial cells , 2007, Molecular carcinogenesis.

[46]  J. Mariadason,et al.  Genetic reprogramming in pathways of colonic cell maturation induced by short chain fatty acids: comparison with trichostatin A, sulindac, and curcumin and implications for chemoprevention of colon cancer. , 2000, Cancer research.

[47]  Minho Won,et al.  Sustained activation of protein kinase C downregulates nuclear factor-κB signaling by dissociation of IKK-γ and Hsp90 complex in human colonic epithelial cells , 2007 .

[48]  C. Creighton Multiple Oncogenic Pathway Signatures Show Coordinate Expression Patterns in Human Prostate Tumors , 2008, PloS one.

[49]  Stefan Michiels,et al.  Prediction of cancer outcome with microarrays: a multiple random validation strategy , 2005, The Lancet.

[50]  Seong-ho Lee,et al.  Indole-3-carbinol and 3,3'-diindolylmethane induce expression of NAG-1 in a p53-independent manner. , 2005, Biochemical and biophysical research communications.

[51]  Sayan Mukherjee,et al.  Estimating Dataset Size Requirements for Classifying DNA Microarray Data , 2003, J. Comput. Biol..

[52]  D. Buckley,et al.  Enhanced expression of the complement regulatory protein CD55 predicts a poor prognosis in colorectal cancer patients , 2003, Cancer Immunology, Immunotherapy.

[53]  M. Loda,et al.  HRad17, a human homologue of the Schizosaccharomyces pombe checkpoint gene rad17, is overexpressed in colon carcinoma. , 1999, Cancer research.

[54]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[55]  Hiroyuki Yamamoto,et al.  Interplay of Insulin-Like Growth Factor-II, Insulin-Like Growth Factor-I, Insulin-Like Growth Factor-I Receptor, COX-2, and Matrix Metalloproteinase-7, Play Key Roles in the Early Stage of Colorectal Carcinogenesis , 2004, Clinical Cancer Research.

[56]  G. Schinzari,et al.  Expression of vascular endothelial growth factor, mitogen-activated protein kinase and p53 in human colorectal cancer. , 2002, Anticancer research.

[57]  M. Kikuchi,et al.  Expressions of two adenomatous polyposis coli and E-cadherin proteins on human colorectal cancers , 2003, Virchows Archiv.

[58]  Susumu Goto,et al.  The KEGG databases at GenomeNet , 2002, Nucleic Acids Res..

[59]  I. Screpanti,et al.  Mutations of an intronic repeat induce impaired MRE11 expression in primary human cancer with microsatellite instability , 2004, Oncogene.

[60]  Y. Bignon,et al.  Allelic imbalance at NBS1 is frequent in both proximal and distal colorectal carcinoma. , 2000, Oncology reports.

[61]  Sakae Nagaoka,et al.  Expression level of Wnt signaling components possibly influences the biological behavior of colorectal cancer in different age groups. , 2004, Experimental and molecular pathology.

[62]  Graziano Pesole,et al.  Selection of relevant genes in cancer diagnosis based on their prediction accuracy , 2007, Artif. Intell. Medicine.

[63]  Graziano Pesole,et al.  Regularized Least Squares Cancer Classifiers from DNA microarray data , 2005, BMC Bioinformatics.

[64]  Motoharu Seiki,et al.  Membrane-type 1 matrix metalloproteinase: a key enzyme for tumor invasion. , 2003, Cancer letters.

[65]  Puneeth Iyengar,et al.  Adipocyte-secreted factors synergistically promote mammary tumorigenesis through induction of anti-apoptotic transcriptional programs and proto-oncogene stabilization , 2003, Oncogene.

[66]  S. Dudoit,et al.  Stage II colon cancer prognosis prediction by tumor gene expression profiling. , 2006, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[67]  P. Khatri,et al.  Profiling gene expression using onto-express. , 2002, Genomics.