Specific Biomarkers: Detection of Cancer Biomarkers Through High-Throughput Transcriptomics Data

Cancer is a systemic disease involving dysregulated biological processes of cell proliferation, metabolism, and apoptosis. It is known that some types of cancer have longer life span, and they are even curable if they are diagnosed and treated properly in the early stage. So it is essential to find biomarkers to detect these cancers in their early stages. With the rapid development of high-throughput microarray and sequencing technologies, many biomarker-based cancer early diagnosis assays are proposed and some are already available in the market. Most of the cancer biomarkers are detected through comparing cancer samples versus normal samples in a certain cancer type, but most of them are not in the comparison against other cancer types. In this research, we propose a novel computational method to comprehensively detect highly accurate cancer biomarkers for different groups of cancer types, with a special emphasis on the detection specificity against the control samples including both those from healthy persons and those from other cancer types. Such biomarkers are called specific biomarkers for a given cancer group, which may be defined as cancers of the same type, cancers with similar survival rates, grade, development stage, or cancers in the same human body systems, etc. The proposed algorithm is extensively evaluated across eight cancer types, and the detection performance shows that the specific biomarkers have reasonable sensitivities and very high specificities. The main contributions of this work are (a) the detection of highly specific biomarkers for eight cancer types and (b) the detection of specific biomarkers for cancers with the similar survival rates. The proposed algorithm may also be used to detect specific biomarkers for cancers of given stages, grades or belonging systems, etc.

[1]  M. Karamouzis,et al.  Why is p53-inducible gene 3 rarely affected in cancer? , 2010, Oncogene.

[2]  Yanchun Liang,et al.  Computational Prediction of Human Salivary Proteins from Blood Circulation and Application to Diagnostic Biomarker Identification , 2013, PloS one.

[3]  C. Turner,et al.  Distinct roles for paxillin and Hic-5 in regulating breast cancer cell morphology, invasion, and metastasis , 2011, Molecular biology of the cell.

[4]  Xiaomeng Zhang,et al.  The multiple roles of Id-1 in cancer progression. , 2006, Differentiation; research in biological diversity.

[5]  Joseph A Califano,et al.  Use of integrative epigenetic and cytogenetic analyses to identify novel tumor-suppressor genes in malignant melanoma , 2011, Melanoma research.

[6]  T. Barrette,et al.  Oncomine 3.0: genes, pathways, and networks in a collection of 18,000 cancer gene expression profiles. , 2007, Neoplasia.

[7]  Saman K. Halgamuge,et al.  An unsupervised hierarchical dynamic self-organizing approach to cancer class discovery and marker gene identification in microarray data , 2003, Bioinform..

[8]  R. Srinivasan,et al.  The Smad family and its role in pancreatic cancer. , 2011, Indian journal of cancer.

[9]  Gösta Winberg,et al.  RBSP3 (HYA22) is a tumor suppressor gene implicated in major epithelial malignancies. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Thibault Helleputte,et al.  Robust biomarker identification for cancer diagnosis with ensemble feature selection methods , 2010, Bioinform..

[11]  Peter N. Robinson,et al.  Binary State Pattern Clustering: A Digital Paradigm for Class and Biomarker Discovery in Gene Microarray Studies of Cancer , 2006, J. Comput. Biol..

[12]  S. Dudoit,et al.  Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. , 2002, Nucleic acids research.

[13]  Louise C. Showe,et al.  Classification and biomarker identification using gene network modules and support vector machines , 2009, BMC Bioinformatics.

[14]  Richard J. Lee,et al.  Circulating tumour cells—monitoring treatment response in prostate cancer , 2014, Nature Reviews Clinical Oncology.

[15]  D Meyer,et al.  Increased metastatic potential of tumor cells in von Willebrand factor‐deficient mice , 2006, Journal of thrombosis and haemostasis : JTH.

[16]  E. Kranias,et al.  Phospholamban and cardiac contractile function. , 2000, Journal of molecular and cellular cardiology.

[17]  Juan Cui,et al.  Cancer Bioinformatics , 2014, Springer New York.

[18]  M. Yutsudo,et al.  Multi-functional gene ASY/Nogo/RTN-X/RTN4: Apoptosis, tumor suppression, and inhibition of neuronal regeneration , 2004, Apoptosis.

[19]  Peng Huang,et al.  Keratinocyte Growth Factor/Fibroblast Growth Factor-7-regulated Cell Migration and Invasion through Activation of NF-κB Transcription Factors* , 2007, Journal of Biological Chemistry.

[20]  Peter Kraft,et al.  Pooled analysis of phosphatidylinositol 3-kinase pathway variants and risk of prostate cancer. , 2010, Cancer research.

[21]  Madhuchhanda Bhattacharjee,et al.  Conserved gene expression programs integrate mammalian prostate development and tumorigenesis. , 2009, Cancer research.

[22]  Theofanis Sapatinas,et al.  Discriminant Analysis and Statistical Pattern Recognition , 2005 .

[23]  Ying Xu,et al.  Computational prediction of human proteins that can be secreted into the bloodstream , 2008, Bioinform..

[24]  Ying Liu,et al.  A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification , 2007, Cancer informatics.

[25]  Kazuhiko Aoyagi,et al.  Discovery of aberrant expression of R-RAS by cancer-linked DNA hypomethylation in gastric cancer using microarrays. , 2005, Cancer research.

[26]  L. Walsh,et al.  Discoidin domain receptor 2 is a critical regulator of epithelial-mesenchymal transition. , 2011, Matrix biology : journal of the International Society for Matrix Biology.

[27]  Massimo Rossi,et al.  Alpha‐fetoprotein and modified response evaluation criteria in Solid Tumors progression after locoregional therapy as predictors of hepatocellular cancer recurrence and death after transplantation , 2013, Liver transplantation : official publication of the American Association for the Study of Liver Diseases and the International Liver Transplantation Society.

[28]  Frank Speleman,et al.  The αE-catenin gene (CTNNA1) acts as an invasion-suppressor gene in human colon cancer cells , 1999, Oncogene.

[29]  V. Olman,et al.  A Comparative Analysis of Gene-Expression Data of Multiple Cancer Types , 2010, PloS one.

[30]  H. Friess,et al.  Copyright © American Society for Investigative Pathology Id-1 and Id-2 Are Overexpressed in Pancreatic Cancer and in Dysplastic Lesions in Chronic Pancreatitis , 2022 .

[31]  M. Plummer,et al.  International agency for research on cancer. , 2020, Archives of pathology.

[32]  Gavin Sherlock,et al.  Implementation of GenePattern within the Stanford Microarray Database , 2008, Nucleic Acids Res..

[33]  Ernst Hafen,et al.  The Drosophila homolog of human tumor suppressor TSC-22 promotes cellular growth, proliferation, and survival , 2008, Proceedings of the National Academy of Sciences.

[34]  Noushin Ghaffari,et al.  Biomarker discovery across annotated and unannotated microarray datasets using semi-supervised learning , 2008, BMC Genomics.

[35]  Jin-Ming Yang,et al.  The individualization of cancer therapy: the unexpected role of p53. , 2006, Transactions of the American Clinical and Climatological Association.

[36]  Peng Huang,et al.  Keratinocyte growth factor/fibroblast growth factor-7-regulated cell migration and invasion through activation of NF-kappaB transcription factors. , 2007, The Journal of biological chemistry.

[37]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[38]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[39]  A. Samadani,et al.  Molecular Insight in Gastric Cancer Induction: An Overview of Cancer Stemness Genes , 2013, Cell Biochemistry and Biophysics.

[40]  Amaury Lendasse,et al.  A Two-Stage Methodology Using K-NN and False-Positive Minimizing ELM for Nominal Data Classification , 2014, Cognitive Computation.

[41]  X. Hou,et al.  A new role of NUAK1: directly phosphorylating p53 and regulating cell proliferation , 2011, Oncogene.

[42]  Yan Gao,et al.  Calpain 2 Regulates Akt-FoxO-p27Kip1 Protein Signaling Pathway in Mammary Carcinoma* , 2012, The Journal of Biological Chemistry.

[43]  F. Liu,et al.  MiTF links Erk1/2 kinase and p21 activation after UVC radiation in normal human melanocytes and melanoma cells , 2010 .

[44]  D. Nomura,et al.  Monoacylglycerol Lipase Regulates a Fatty Acid Network that Promotes Cancer Pathogenesis , 2010, Cell.

[45]  P. Wilkinson,et al.  Ovarian cancer antigen CA125: a prospective clinical assessment of its role as a tumour marker. , 1984, British Journal of Cancer.

[46]  Akhilesh Pandey,et al.  Identification of novel highly expressed genes in pancreatic ductal adenocarcinomas through a bioinformatics analysis of expressed sequence tags , 2004, Cancer biology & therapy.

[47]  M. Duffy,et al.  Carcinoembryonic antigen as a marker for colorectal cancer: is it clinically useful? , 2001, Clinical chemistry.

[48]  Chen Zhang,et al.  A novel multi-stage feature selection method for microarray expression data analysis , 2013, Int. J. Data Min. Bioinform..

[49]  Juan Cui,et al.  A Comparative Study of Gene-Expression Data of Basal Cell Carcinoma and Melanoma Reveals New Insights about the Two Cancers , 2012, PloS one.

[50]  Brian J. Smith,et al.  Extracellular Matrix 1 (ECM1) Expression Is a Novel Prognostic Marker for Poor Long-Term Survival in Breast Cancer: A Hospital-Based Cohort Study in Iowa , 2009, Annals of Surgical Oncology.

[51]  D. DeMets,et al.  Biomarkers and surrogate endpoints: Preferred definitions and conceptual framework , 2001, Clinical pharmacology and therapeutics.

[52]  Eeva Kettunen,et al.  Changes in gene expression during progression of ovarian carcinoma , 2001, Cancer genetics and cytogenetics.

[53]  P. Philip,et al.  Carbohydrate antigen 19‐9 is a prognostic and predictive biomarker in patients with advanced pancreatic cancer who receive gemcitabine‐containing chemotherapy , 2013, Cancer.

[54]  S. Tanaka,et al.  Cloning of a human gene closely related to the genes coding for the c-myc single-strand binding proteins. , 2000, Gene.

[55]  R. Pio,et al.  Expression of the adrenomedullin binding protein, complement factor H, in the pancreas and its physiological impact on insulin secretion. , 2001, The Journal of endocrinology.

[56]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[57]  Celine S. Hong,et al.  A Computational Method for Prediction of Excretory Proteins and Application to Identification of Gastric Cancer Markers in Urine , 2011, PloS one.

[58]  R. Pollack,et al.  Cancer biology. , 1978, Science.

[59]  Sundaram Suresh,et al.  A Meta-Cognitive Learning Algorithm for an Extreme Learning Machine Classifier , 2013, Cognitive Computation.

[60]  Louise C. Showe,et al.  Recursive Cluster Elimination (RCE) for classification and feature selection from gene expression data , 2007, BMC Bioinformatics.

[61]  Jaakko Astola,et al.  Comparison of Affymetrix data normalization methods using 6,926 experiments across five array generations , 2009, BMC Bioinformatics.

[62]  F. Speleman,et al.  The alphaE-catenin gene (CTNNA1) acts as an invasion-suppressor gene in human colon cancer cells. , 1999, Oncogene.

[63]  Xuefeng Bruce Ling,et al.  Multiclass cancer classification and biomarker discovery using GA-based algorithms , 2005, Bioinform..

[64]  Jie Zheng,et al.  [Study of genes related to gastric cancer and its premalignant lesions with fluorescent differential display]. , 2004, Ai zheng = Aizheng = Chinese journal of cancer.

[65]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[66]  Angela Russo,et al.  Intersectin 1 is required for neuroblastoma tumorigenesis , 2011, Oncogene.

[67]  Taesung Park,et al.  Robust imputation method for missing values in microarray data , 2007, BMC Bioinformatics.

[68]  Peihua Lu,et al.  Interleukin-1β mediates proliferation and differentiation of multipotent neural precursor cells through the activation of SAPK/JNK pathway , 2007, Molecular and Cellular Neuroscience.