Gene Selection for the Discrimination of Colorectal Cancer.

BACKGROUND Colorectal cancer (CRC) is the third most common cancer worldwide. Cancer discrimination is a typical application of gene expression analysis using microarray technique. However, microarray data suffers from the curse of dimensionality and usual imbalanced class distribution between majority (tumor samples) and minority (normal samples) classes. Feature gene selection is necessary and important for cancer discrimination. OBJECTIVES To select feature gene for the discrimination of CRC. METHODS We improve the feature selection algorithm based on differential evolution, DEFSw by using RUSBoost classifier and weight accuracy instead of the common classifier and evaluation measure for selecting feature genes from imbalance data. We firstly extract differently expressed genes (DEGs) from the CRC dataset of the TCGA and then select the feature genes from the DEGs using the improved DEFSw algorithm. Finally, we validate the selected feature gene sets using independent datasets and retrieve the cancer related information for these genes based on text mining through the Coremine Medical online database. RESULTS We select out 16 single-gene feature sets for colorectal cancer discrimination and 19 single-gene feature sets only for colon cancer discriminationConclusions: In summary, we find a series of high potential candidate biomarkers or signatures, which can discriminate either or both of colon cancer and rectal cancer with high sensitivity and specificity.

[1]  S. Pileri,et al.  Relationship between Dyskerin Expression and Telomerase Activity in Human Breast Cancer , 2008, Cellular oncology : the official journal of the International Society for Cellular Oncology.

[2]  Mira Ayadi,et al.  Gene Expression Classification of Colon Cancer into Molecular Subtypes: Characterization, Validation, and Prognostic Value , 2013, PLoS medicine.

[3]  Somatic mutations of amino acid metabolism-related genes in gastric and colorectal cancers and their regional heterogeneity - a short report , 2014, Cellular Oncology.

[4]  Zhaoqing Yang,et al.  IL-6 Inhibits the Targeted Modulation of PDCD4 by miR-21 in Prostate Cancer , 2015, PloS one.

[5]  Y. Adachi,et al.  Neurotransmitter Transporter Family Including SLC6A6 and SLC6A13 Contributes to the 5‐Aminolevulinic Acid (ALA)‐Induced Accumulation of Protoporphyrin IX and Photodamage, through Uptake of ALA by Cancerous Cells , 2014, Photochemistry and photobiology.

[6]  Xiangjiao Meng,et al.  PDCD4 as a predictor of sensitivity to neoadjuvant chemoradiotherapy in locally advanced rectal cancer patients. , 2014, Asian Pacific journal of cancer prevention : APJCP.

[7]  R Engers,et al.  DKC1 overexpression associated with prostate cancer progression , 2009, British Journal of Cancer.

[8]  Yong Xu,et al.  RPCA-Based Tumor Classification Using Gene Expression Data , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[9]  B. Wiedenmann,et al.  Activin A stimulates vascular endothelial growth factor gene transcription in human hepatocellular carcinoma cells. , 2004, Gastroenterology.

[10]  S. Tomida,et al.  KIAA1199 interacts with glycogen phosphorylase kinase β-subunit (PHKB) to promote glycogen breakdown and cancer cell survival , 2014, Oncotarget.

[11]  F. Petraglia,et al.  Activin, inhibin and the human breast , 2004, Molecular and Cellular Endocrinology.

[12]  G. G. Van den Eynden,et al.  P‐cadherin in adhesion and invasion: Opposite roles in colon and bladder carcinoma , 2011, International journal of cancer.

[13]  H. Tsuda,et al.  Reduced Expression of GNA11 and Silencing of MCT1 in Human Breast Cancers , 2003, Oncology.

[14]  C. S. St. Hill,et al.  C2-O-sLeX Glycoproteins Are E-Selectin Ligands that Regulate Invasion of Human Colon and Hepatic Carcinoma Cells , 2011, PloS one.

[15]  K. Shroyer,et al.  Hypoxia promotes colon cancer dissemination through up-regulation of cell migration-inducing protein (CEMIP) , 2015, Oncotarget.

[16]  Li-Yeh Chuang,et al.  Improved binary PSO for feature selection using gene expression data , 2008, Comput. Biol. Chem..

[17]  Jianfeng Jin,et al.  MicroRNA-107: a novel promoter of tumor progression that targets the CPEB3/EGFR axis in human hepatocellular carcinoma , 2015, Oncotarget.

[18]  J. Scheller,et al.  The IL-6/sIL-6R complex as a novel target for therapeutic approaches , 2007, Expert opinion on therapeutic targets.

[19]  Lixin Sun,et al.  P-cadherin promotes liver metastasis and is associated with poor prognosis in colon cancer. , 2011, The American journal of pathology.

[20]  Y. Nakamura,et al.  Overexpressing PKIB in prostate cancer promotes its aggressiveness by linking between PKA and Akt pathways , 2009, Oncogene.

[21]  B. Dickson,et al.  Familial pheochromocytoma and renal cell carcinoma syndrome: TMEM127 as a novel candidate gene for the association , 2015, Virchows Archiv.

[22]  Antonio J. Nebro,et al.  Molecular Docking Optimization in the Context of Multi-Drug Resistant and Sensitive EGFR Mutants , 2016, Molecules.

[23]  W. Jiang,et al.  KIAA1199 and its biological role in human cancer and cancer cells (review). , 2014, Oncology reports.

[24]  K. Mimori,et al.  Significance of INHBA expression in human colorectal cancer. , 2013, Oncology reports.

[25]  Masato Terashima,et al.  Activin signal promotes cancer progression and is involved in cachexia in a subset of pancreatic cancer. , 2015, Cancer letters.

[26]  Y. Matsumura,et al.  Role of SLC6A6 in promoting the survival and multidrug resistance of colorectal cancer , 2014, Scientific Reports.

[27]  P. Vivas-Mejia,et al.  Upregulation of miR-21 in Cisplatin Resistant Ovarian Cancer via JNK-1/c-Jun Pathway , 2014, PloS one.

[28]  W. Ahn,et al.  An 8-gene signature, including methylated and down-regulated glutathione peroxidase 3, of gastric cancer. , 2009, International journal of oncology.

[29]  D. Lodygin,et al.  IL-6R/STAT3/miR-34a feedback loop promotes EMT-mediated colorectal cancer invasion and metastasis. , 2014, The Journal of clinical investigation.

[30]  H. Nemoto,et al.  Frequent CDH3 demethylation in advanced gastric carcinoma. , 2009, Anticancer research.

[31]  Wengang Zhou,et al.  A novel class dependent feature selection method for cancer biomarker discovery , 2014, Comput. Biol. Medicine.

[32]  G. Seki,et al.  Localization of NBC-1 variants in human kidney and renal cell carcinoma. , 2003, Biochemical and biophysical research communications.

[33]  A. Oberg,et al.  Highly Methylated Genes in Colorectal Neoplasia: Implications for Screening , 2007, Cancer Epidemiology Biomarkers & Prevention.

[34]  P. Sadłecki,et al.  Serum Inhibin A and Inhibin B Levels in Epithelial Ovarian Cancer Patients , 2014, PloS one.

[35]  Maode Lai,et al.  Identification of differentially expressed proteins in colorectal cancer by proteomics: Down‐regulation of secretagogin , 2006, Proteomics.

[36]  K. Lawson,et al.  P-cadherin potentiates ligand-dependent EGFR and IGF-1R signaling in dysplastic and malignant oral keratinocytes. , 2014, Oncology reports.

[37]  A. Tejerina,et al.  Relationship between Genotypes Sult1a2 and Cyp2d6 and Tamoxifen Metabolism in Breast Cancer Patients , 2013, PloS one.

[38]  M. Furia,et al.  Real-time PCR quantification of human DKC1 expression in colorectal cancer , 2008, Acta oncologica.

[39]  Michael Krawczak,et al.  Genome-wide association study for colorectal cancer identifies risk polymorphisms in German familial cases and implicates MAPK signalling pathways in disease susceptibility. , 2010, Carcinogenesis.

[40]  Wei Xie,et al.  Accurate Cancer Classification Using Expressions of Very Few Genes , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[41]  Taghi M. Khoshgoftaar,et al.  RUSBoost: A Hybrid Approach to Alleviating Class Imbalance , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[42]  Jie Liu,et al.  miR-183 induces cell proliferation, migration, and invasion by regulating PDCD4 expression in the SW1990 pancreatic cancer cell line. , 2015, Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie.

[43]  Hala Alshamlan,et al.  mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling , 2015, BioMed research international.

[44]  Anirban Mukherjee,et al.  Cancer Classification from Gene Expression Data by NPPC Ensemble , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[45]  R. Seruca,et al.  CCAAT/Enhancer Binding Protein β (C/EBPβ) Isoforms as Transcriptional Regulators of the Pro-Invasive CDH3/P-Cadherin Gene in Human Breast Cancer Cells , 2013, PloS one.

[46]  Nitesh V. Chawla,et al.  SMOTEBoost: Improving Prediction of the Minority Class in Boosting , 2003, PKDD.

[47]  Rami N. Khushaba,et al.  Feature subset selection using differential evolution and a wheel based search strategy , 2013, Swarm Evol. Comput..

[48]  L. Wȩglarz,et al.  Transcriptional regulation of interleukin 6 and its receptor in colon cancer cells by phytic acid. , 2010, Acta poloniae pharmaceutica.

[49]  Daniel Heim,et al.  Copy number changes of clinically actionable genes in melanoma, non‐small cell lung cancer and colorectal cancer—A survey across 822 routine diagnostic cases , 2016, Genes, chromosomes & cancer.

[50]  N Chawla,et al.  SMOTEBoost:ブースティングにおけるマイノリティクラスの予測改善(原標題は英語) , 2003 .

[51]  Joobae Park,et al.  Association of interindividual differences in p14ARF promoter methylation with single nucleotide polymorphism in primary colorectal cancer , 2008, Cancer.

[52]  Jan Komorowski,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm486 Data and text mining Monte Carlo , 2022 .

[53]  A. Koong,et al.  Epigenetic Regulation of Gene Expression in Cervical Cancer Cells by the Tumor Microenvironment 1 , 2000 .

[54]  Hui Liu,et al.  Dyskerin Overexpression in Human Hepatocellular Carcinoma Is Associated with Advanced Clinical Stage and Poor Patient Prognosis , 2012, PloS one.

[55]  H. Allgayer,et al.  Loss of programmed cell death 4 expression marks adenoma‐carcinoma transition, correlates inversely with phosphorylated protein kinase B, and is an independent prognostic factor in resected colorectal cancer , 2007, Cancer.

[56]  G. Ryffel,et al.  HNF4α reduces proliferation of kidney cells and affects genes deregulated in renal cell carcinoma , 2005, Oncogene.

[57]  Yanqin Sun,et al.  Overexpression of secretagogin inhibits cell apoptosis and induces chemoresistance in small cell lung cancer under the regulation of miR-494 , 2014, Oncotarget.

[58]  Steven S. Chang,et al.  Inactivation of the Tumor Suppressor Genes Causing the Hereditary Syndromes Predisposing to Head and Neck Cancer via Promoter Hypermethylation in Sporadic Head and Neck Cancers , 2010, ORL.

[59]  M. Suiko,et al.  Sulfation of benzyl alcohol by the human cytosolic sulfotransferases (SULTs): a systematic analysis , 2016, Journal of applied toxicology : JAT.

[60]  S. Ingvarsson,et al.  MicroRNA-451 suppresses tumor cell growth by down-regulating IL6R gene expression. , 2014, Cancer epidemiology.

[61]  Chen Jiang,et al.  Genomic profiling in locally advanced and inflammatory breast cancer and its link to DCE-MRI and overall survival , 2015, International journal of hyperthermia : the official journal of European Society for Hyperthermic Oncology, North American Hyperthermia Group.

[62]  PKIB expression strongly correlated with phosphorylated Akt expression in breast cancers and also with triple-negative breast cancer subtype , 2012, Medical Molecular Morphology.

[63]  K. Kelsey,et al.  Interaction between the bone morphogenetic proteins and Ras/MAP-kinase signalling pathways in lung cancer , 2005, British Journal of Cancer.

[64]  Zhen Ji,et al.  Iterative ensemble feature selection for multiclass classification of imbalanced microarray data , 2016, Journal of Biological Research-Thessaloniki.

[65]  Anne-Mette K. Hein,et al.  Alternative Splicing in Colon, Bladder, and Prostate Cancer Identified by Exon Array Analysis*S , 2008, Molecular & Cellular Proteomics.

[66]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[67]  J. Hackermüller,et al.  CD31, EDNRB and TSPAN7 are promising prognostic markers in clear‐cell renal cell carcinoma revealed by genome‐wide expression analyses of primary tumors and metastases , 2012, International journal of cancer.

[68]  A Min Tjoa,et al.  Performance Comparison between Naïve Bayes, Decision Tree and k-Nearest Neighbor in Searching Alternative Design in an Energy Simulation Tool , 2013 .

[69]  L. Fliegel,et al.  Defining the Na+/H+ exchanger NHE1 interactome in triple-negative breast cancer cells. , 2017, Cellular signalling.

[70]  J. Souverijn,et al.  Multitarget stool DNA testing for colorectal-cancer screening. , 2014, The New England journal of medicine.

[71]  Yusuke Nakamura,et al.  Identification of a Novel Tumor-Associated Antigen, Cadherin 3/P-Cadherin, as a Possible Target for Immunotherapy of Pancreatic, Gastric, and Colorectal Cancers , 2008, Clinical Cancer Research.

[72]  Erik D. Goodman,et al.  Swarmed feature selection , 2004, 33rd Applied Imagery Pattern Recognition Workshop (AIPR'04).

[73]  H. Allgayer Pdcd4, a colon cancer prognostic that is regulated by a microRNA. , 2007, Critical reviews in oncology/hematology.

[74]  R. Russell,et al.  Expression of bone morphogenetic proteins in human prostatic adenocarcinoma and benign prostatic hyperplasia. , 1992, British Journal of Cancer.

[75]  N. Miller,et al.  MicroRNA-21 and PDCD4 expression in colorectal cancer. , 2011, European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology.

[76]  Jill S Barnholtz-Sloan,et al.  Induction of KIAA1199/CEMIP is associated with colon cancer phenotype and poor patient survival , 2015, Oncotarget.

[77]  Yate-Ching Yuan,et al.  Down-regulation of programmed cell death 4 (PDCD4) is associated with aromatase inhibitor resistance and a poor prognosis in estrogen receptor-positive breast cancer , 2015, Breast Cancer Research and Treatment.

[78]  T. Matsuda,et al.  Bystin in human cancer cells: intracellular localization and function in ribosome biogenesis. , 2007, The Biochemical journal.

[79]  I. Pastan,et al.  Expression of the interleukin 6 receptor and interleukin 6 in prostate carcinoma cells. , 1990, Cancer research.

[80]  Z. Ye,et al.  Role of BMP3 in progression of gastric carcinoma in Chinese people. , 2010, World journal of gastroenterology.

[81]  Jing-he Li,et al.  N-cadherin and P-cadherin are biomarkers for invasion, metastasis, and poor prognosis of gallbladder carcinomas. , 2014, Pathology, research and practice.

[82]  T. Dingermann,et al.  An Interstitial Deletion at 3p21.3 Results in the Genetic Fusion of MLH1 and ITGA9 in a Lynch Syndrome Family , 2009, Clinical Cancer Research.

[83]  Liviu Badea,et al.  Identification of potential biomarkers for early and advanced gastric adenocarcinoma detection. , 2010, Hepato-gastroenterology.

[84]  André F. Vieira,et al.  The basal epithelial marker P-cadherin associates with breast cancer cell populations harboring a glycolytic and acid-resistant phenotype , 2014, BMC Cancer.

[85]  E. Sauter,et al.  Increased shedding of soluble fragments of P‐cadherin in nipple aspirate fluids from women with breast cancer , 2008, Cancer science.

[86]  H. Bhat,et al.  Natural Antioxidants Exhibit Chemopreventive Characteristics through the Regulation of CNC b‐Zip Transcription Factors in Estrogen‐Induced Breast Carcinogenesis , 2014, Journal of biochemical and molecular toxicology.

[87]  Byung Ro Moon,et al.  Hybrid Genetic Algorithms for Feature Selection , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[88]  B. Minsky Unique considerations in the patient with rectal cancer. , 2011, Seminars in oncology.

[89]  B. Li,et al.  miR-150 Modulates Cisplatin Chemosensitivity and Invasiveness of Muscle-Invasive Bladder Cancer Cells via Targeting PDCD4 In Vitro , 2014, Medical science monitor : international medical journal of experimental and clinical research.

[90]  Shiquan Sun,et al.  A Kernel-Based Multivariate Feature Selection Method for Microarray Data Classification , 2014, PloS one.

[91]  Namita Srivastava,et al.  A fuzzy based feature selection from independent component subspace for machine learning classification of microarray data , 2016, Genomics data.

[92]  Q. Zou,et al.  Hierarchical Classification of Protein Folds Using a Novel Ensemble Classifier , 2013, PloS one.

[93]  C. Mathers,et al.  Cancer incidence and mortality worldwide: Sources, methods and major patterns in GLOBOCAN 2012 , 2015, International journal of cancer.

[94]  L. Crinò,et al.  DKC1 gene mutations in human sporadic cancer. , 2013, Histology and histopathology.

[95]  Xin Huang,et al.  Functional proteomic analysis reveals the involvement of KIAA1199 in breast cancer growth, motility and invasiveness , 2014, BMC Cancer.

[96]  Adel Al-Jumaily,et al.  Differential evolution based feature subset selection , 2008, 2008 19th International Conference on Pattern Recognition.

[97]  Hua-mei Tang,et al.  Screening of new tumor suppressor genes in sporadic colorectal cancer patients. , 2008, Hepato-gastroenterology.

[98]  C. Hellerbrand,et al.  Downregulation of P-cadherin expression in hepatocellular carcinoma induces tumorigenicity. , 2013, International journal of clinical and experimental pathology.

[99]  Y. Li,et al.  Specific N-glycans of Hepatocellular Carcinoma Cell Surface and the Abnormal Increase of Core-α-1, 6-fucosylated Triantennary Glycan via N-acetylglucosaminyltransferases-IVa Regulation , 2015, Scientific Reports.

[100]  J. Ko,et al.  Overexpression and β-1,6-N-Acetylglucosaminylation-initiated Aberrant Glycosylation of TIMP-1 , 2012, The Journal of Biological Chemistry.

[101]  Karen A Gelmon,et al.  P-cadherin expression as a prognostic biomarker in a 3992 case tissue microarray series of breast cancer , 2011, Modern Pathology.

[102]  Toshiaki Watanabe,et al.  Prognostic significance of PDCD4 expression and association with microRNA-21 in each Dukes' stage of colorectal cancer patients. , 2012, Oncology reports.

[103]  V. Prasolov,et al.  Altered Expression of Multiple Genes Involved in Retinoic Acid Biosynthesis in Human Colorectal Cancer , 2014, Pathology & Oncology Research.

[104]  W. Isaacs,et al.  P-Cadherin is a basal cell-specific epithelial marker that is not expressed in prostate cancer. , 1997, Clinical cancer research : an official journal of the American Association for Cancer Research.

[105]  Xudong Wang,et al.  Association between KIAA1199 overexpression and tumor invasion, TNM stage, and poor prognosis in colorectal cancer. , 2015, International journal of clinical and experimental pathology.

[106]  Xi-shun Cheng,et al.  Hypermethylation and Expression Silencing of PDCD4 Gene in Hepatocellular Carcinoma , 2016, Medicine.

[107]  J. Jessurun,et al.  The high affinity selectin glycan ligand C2-O-sLex and mRNA transcripts of the core 2 β-1,6-N-acetylglusaminyltransferase (C2GnT1) gene are highly expressed in human colorectal adenocarcinomas , 2009, BMC Cancer.

[108]  Chang-Yi Hsieh,et al.  Activin A Enhances Prostate Cancer Cell Migration Through Activation of Androgen Receptor and Is Overexpressed in Metastatic Prostate Cancer , 2009, Journal of bone and mineral research : the official journal of the American Society for Bone and Mineral Research.

[109]  I. Csabai,et al.  Aberrant DNA methylation of WNT pathway genes in the development and progression of CIMP-negative colorectal cancer , 2016, Epigenetics.

[110]  J. Pouysségur,et al.  The Na+/HCO3− Co‐Transporter SLC4A4 Plays a Role in Growth and Migration of Colon and Breast Cancer Cells , 2015, Journal of cellular physiology.

[111]  Yun Chen,et al.  Regulation of the Expression of Cytoplasmic Polyadenylation Element Binding Proteins for the Treatment of Cancer. , 2016, Anticancer research.

[112]  P. Conesa‐Zamora,et al.  Immunohistochemical expression profile of β-catenin, E-cadherin, P-cadherin, laminin-5γ2 chain, and SMAD4 in colorectal serrated adenocarcinoma. , 2012, Human pathology.

[113]  M. Cairns,et al.  Regulation of the tumour suppressor PDCD4 by miR-499 and miR-21 in oropharyngeal cancers , 2016, BMC Cancer.

[114]  S. Wakabayashi,et al.  Crystallization and preliminary crystallographic analysis of the human calcineurin homologous protein CHP2 bound to the cytoplasmic region of the Na+/H+ exchanger NHE1. , 2005, Acta crystallographica. Section F, Structural biology and crystallization communications.

[115]  Wenfu Wang,et al.  Helicobacter pylori Promotes Epithelial–Mesenchymal Transition in Gastric Cancer by Downregulating Programmed Cell Death Protein 4 (PDCD4) , 2014, PloS one.

[116]  W. Fang,et al.  Reduced PDCD4 Expression Promotes Cell Growth Through PI3K/Akt Signaling in Non-Small Cell Lung Cancer , 2016, Oncology research.

[117]  R. Domingues,et al.  Coexistence of paraganglioma/pheochromocytoma and papillary thyroid carcinoma: a four-case series analysis , 2015, Familial Cancer.

[118]  D. Latchman,et al.  Identification and characterization of the promoter region of the Nav1.7 voltage-gated sodium channel gene (SCN9A) , 2008, Molecular and Cellular Neuroscience.

[119]  T. Campbell,et al.  Functional expression of the voltage-gated Na+-channel Nav1.7 is necessary for EGF-mediated invasion in human non-small cell lung cancer cells , 2013, Journal of Cell Science.

[120]  S. Srikantan,et al.  The tumor susceptibility gene TMEM127 is mutated in renal cell carcinomas and modulates endolysosomal function. , 2014, Human molecular genetics.

[121]  Li Lin,et al.  Voltage‐gated sodium channel Nav1.7 promotes gastric cancer progression through MACC1‐mediated upregulation of NHE1 , 2016, International journal of cancer.

[122]  N. Cho,et al.  Panel of candidate biomarkers for renal cell carcinoma. , 2010, Journal of proteome research.

[123]  G. Chen,et al.  Concomitant high expression of BRAFV600E, P‐cadherin and cadherin 6 is associated with High TNM stage and lymph node metastasis in conventional papillary thyroid carcinoma , 2016, Clinical endocrinology.

[124]  W. El-Rifai,et al.  Activin a signaling regulates cell invasion and proliferation in esophageal adenocarcinoma , 2015, Oncotarget.