Immunosignature Screening for Multiple Cancer Subtypes Based on Expression Rule

Liquid biopsy (i.e., fluid biopsy) involves a series of clinical examination approaches. Monitoring of cancer immunological status by the “immunosignature” of patients presents a novel method for tumor-associated liquid biopsy. The major work content and the core technological difficulties for the monitoring of cancer immunosignature are the recognition of cancer-related immune-activating antigens by high-throughput screening approaches. Currently, one key task of immunosignature-based liquid biopsy is the qualitative and quantitative identification of typical tumor-specific antigens. In this study, we reused two sets of peptide microarray data that detected the expression level of potential antigenic peptides derived from tumor tissues to avoid the detection differences induced by chip platforms. Several machine learning algorithms were applied on these two sets. First, the Monte Carlo Feature Selection (MCFS) method was used to analyze features in two sets. A feature list was obtained according to the MCFS results on each set. Second, incremental feature selection method incorporating one classification algorithm (support vector machine or random forest) followed to extract optimal features and construct optimal classifiers. On the other hand, the repeated incremental pruning to produce error reduction, a rule learning algorithm, was applied on key features yielded by the MCFS method to extract quantitative rules for accurate cancer immune monitoring and pathologic diagnosis. Finally, obtained key features and quantitative rules were extensively analyzed.

[1]  M. Huibers,et al.  Molecular analysis in liquid biopsies for diagnostics of primary central nervous system lymphoma: Review of literature and future opportunities. , 2018, Critical reviews in oncology/hematology.

[2]  K. Braune,et al.  Characterization of Alstrom Syndrome 1 (ALMS1) Transcript Variants in Hodgkin Lymphoma Cells , 2017, PloS one.

[3]  Michael Seifert,et al.  Comparative transcriptomics reveals similarities and differences between astrocytoma grades , 2015, BMC Cancer.

[4]  Jing Lu,et al.  A similarity-based method for prediction of drug side effects with heterogeneous information. , 2018, Mathematical biosciences.

[5]  Jiannis Ragoussis,et al.  Direct reprogramming of fibroblasts into endothelial cells capable of angiogenesis and reendothelialization in tissue-engineered vessels , 2012, Proceedings of the National Academy of Sciences.

[6]  Q. Wei,et al.  Associations of PI3KR1 and mTOR Polymorphisms with Esophageal Squamous Cell Carcinoma Risk and Gene-Environment Interactions in Eastern Chinese Populations , 2015, Scientific Reports.

[7]  R. McLendon,et al.  Laminin alpha 2 enables glioblastoma stem cell growth , 2012, Annals of neurology.

[8]  Peter Nürnberg,et al.  Mutations in PLK4, encoding a master regulator of centriole biogenesis, cause microcephaly, growth failure and retinopathy , 2014, Nature Genetics.

[9]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[10]  Xiaoyong Pan,et al.  Identification of the Gene Expression Rules That Define the Subtypes in Glioma , 2018, Journal of clinical medicine.

[11]  Cong Sun,et al.  Aberrant CXCR4 and β-catenin expression in osteosarcoma correlates with patient survival , 2015, Oncology letters.

[12]  Andrew L. Miller,et al.  Ca(2+) coding and decoding strategies for the specification of neural and renal precursor cells during development. , 2016, Cell calcium.

[13]  Shuaiqun Wang,et al.  Recognizing Novel Tumor Suppressor Genes Using a Network Machine Learning Strategy , 2019, IEEE Access.

[14]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[15]  M. Fu,et al.  Confronting the unexpected: temporal, situational, and attributive dimensions of distressing symptom experience for breast cancer survivors. , 2010, Oncology nursing forum.

[16]  V. Mansouri,et al.  Protein profiling of infected human gastric epithelial cells with an Iranian Helicobacter pylori clinical isolate , 2017, Gastroenterology and hepatology from bed to bench.

[17]  T. Timmusk,et al.  Structure, alternative splicing, and expression of the human and mouse KCNIP gene family. , 2005, Genomics.

[18]  Sanjay V. Boddul,et al.  SNAP‐23 and VAMP‐3 contribute to the release of IL‐6 and TNFα from a human synovial sarcoma cell line , 2014, The FEBS journal.

[19]  Lei Chen,et al.  Predicting Drug Side Effects with Compact Integration of Heterogeneous Networks , 2019 .

[20]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[21]  O. Grzybowska-Izydorczyk,et al.  VEGF, ANGPT1, ANGPT2, and MMP-9 expression in the autologous hematopoietic stem cell transplantation and its impact on the time to engraftment , 2017, Annals of Hematology.

[22]  Helmut Krcmar,et al.  ProteomicsDB , 2017, Nucleic Acids Res..

[23]  A. Spencer,et al.  Analysis of Circulating Tumor DNA. , 2018, Methods in molecular biology.

[24]  Chung-Pin Li,et al.  Genes involved in angiogenesis and mTOR pathways are frequently mutated in Asian patients with pancreatic neuroendocrine tumors , 2016, International journal of biological sciences.

[25]  Farid Najafi,et al.  A Novel Hybrid Classification Model of Genetic Algorithms, Modified k-Nearest Neighbor and Developed Backpropagation Neural Network , 2014, PloS one.

[26]  K. Bae,et al.  Magnolol inhibits angiogenesis by regulating ROS-mediated apoptosis and the PI3K/AKT/mTOR signaling pathway in mES/EB-derived endothelial-like cells. , 2013, International journal of oncology.

[27]  Lei Chen,et al.  A Binary Classifier for the Prediction of EC Numbers of Enzymes , 2019, Current Proteomics.

[28]  Lin Lu,et al.  Identification of synthetic lethality based on a functional network by using machine learning algorithms , 2018, Journal of cellular biochemistry.

[29]  H. Schöler,et al.  Zfp296 Is a Novel, Pluripotent-Specific Reprogramming Factor , 2012, PloS one.

[30]  J. Trent,et al.  Targeting 6-Phosphofructo-2-Kinase (PFKFB3) as a Therapeutic Strategy against Cancer , 2013, Molecular Cancer Therapeutics.

[31]  David W Mount,et al.  Using the Basic Local Alignment Search Tool (BLAST). , 2007, CSH protocols.

[32]  D. Tollervey,et al.  Trf4 targets ncRNAs from telomeric and rDNA spacer regions and functions in rDNA copy number control , 2007, The EMBO journal.

[33]  F. Basolo,et al.  Analysis of circulating tumor DNA does not improve the clinical management of patients with locally advanced and metastatic papillary thyroid carcinoma , 2018, Head & neck.

[34]  C. Tu,et al.  Short hairpin RNA (shRNA) of type 2 interleukin-1 receptor (IL1R2) inhibits the proliferation of human osteosarcoma U-2 OS cells , 2014, Medical Oncology.

[35]  M. Lesina,et al.  The immune network in pancreatic cancer development and progression , 2014, Oncogene.

[36]  O. Medalia,et al.  Experimental analysis of co‐evolution within protein complexes: The yeast exosome as a model , 2013, Proteins.

[37]  Jan Komorowski,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm486 Data and text mining Monte Carlo , 2022 .

[38]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[39]  F. Opperdoes,et al.  Tissue Distribution and Evolution of Fructosamine 3-Kinase and Fructosamine 3-Kinase-related Protein* , 2004, Journal of Biological Chemistry.

[40]  Jialiang Yang,et al.  Identify Key Sequence Features to Improve CRISPR sgRNA Efficacy , 2017, IEEE Access.

[41]  S. Stöckler‐Ipsiroglu,et al.  Guanidinoacetate methyltransferase (GAMT) deficiency: non-invasive enzymatic diagnosis of a newly recognized inborn error of metabolism. , 2000, Clinica chimica acta; international journal of clinical chemistry.

[42]  Huan Liu,et al.  Incremental Feature Selection , 1998, Applied Intelligence.

[43]  C. Scrideli,et al.  Altered expression of noncanonical Wnt pathway genes in paediatric and adult adrenocortical tumours , 2014, Clinical endocrinology.

[44]  Aleksander Øhrn,et al.  Discernibility and Rough Sets in Medicine: Tools and Applications , 2000 .

[45]  Expression of T/NK-Cell and Plasma Cell Antigens in Nonhematopoietic Epithelioid Neoplasms , 2003 .

[46]  Ziteng Zhang,et al.  RasGRP3, a Ras guanyl releasing protein 3 that contributes to malignant proliferation and aggressiveness in human esophageal squamous cell carcinoma , 2018, Clinical and experimental pharmacology & physiology.

[47]  J. Kim,et al.  Angiopoietin-2 promotes ER+ breast cancer cell survival in bone marrow niche , 2016, Endocrine-related cancer.

[48]  S. Wiemann,et al.  Differential Response to α-Oxoaldehydes in Tamoxifen Resistant MCF-7 Breast Cancer Cells , 2014, PloS one.

[49]  M. Katoh,et al.  Identification and characterization of human PRICKLE1 and PRICKLE2 genes as well as mouse Prickle1 and Prickle2 genes homologous to Drosophila tissue polarity gene prickle. , 2003, International journal of molecular medicine.

[50]  Bernd Thiede,et al.  Consolidation of proteomics data in the Cancer Proteomics database , 2015, Proteomics.

[51]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[52]  Lei Chen,et al.  iATC-NRAKEL: an efficient multi-label classifier for recognizing anatomical therapeutic chemical classes of drugs , 2019, Bioinform..

[53]  P. Chu,et al.  Expression of T/NK-cell and plasma cell antigens in nonhematopoietic epithelioid neoplasms. An immunohistochemical study of 447 cases. , 2003, American journal of clinical pathology.

[54]  K. Ogawa,et al.  Myocardial norepinephrine and cyclic amp concentration following myocardial ischemia--relation to ventricular fibrillation and sudden death. , 1983, Japanese circulation journal.

[55]  Michael Schmuker,et al.  A neuromorphic network for generic multivariate data classification , 2014, Proceedings of the National Academy of Sciences.

[56]  B. Kuster,et al.  Mass-spectrometry-based draft of the human proteome , 2014, Nature.

[57]  B. Mellström,et al.  Kcnip1 a Ca²⁺-dependent transcriptional repressor regulates the size of the neural plate in Xenopus. , 2015, Biochimica et biophysica acta.

[58]  T. Fujita,et al.  Complement regulatory proteins in normal human esophagus and esophageal squamous cell carcinoma , 2004, Journal of gastroenterology and hepatology.

[59]  Phillip Stafford,et al.  Immunosignature system for diagnosis of cancer , 2014, Proceedings of the National Academy of Sciences.

[60]  Xiaoyong Pan,et al.  Gene expression differences among different MSI statuses in colorectal cancer , 2018, International journal of cancer.

[61]  Xiaoyong Pan,et al.  Tissue Expression Difference between mRNAs and lncRNAs , 2018, International journal of molecular sciences.

[62]  Melissa J. Landrum,et al.  RefSeq: an update on mammalian reference sequences , 2013, Nucleic Acids Res..

[63]  Jianming Wang,et al.  Accumulated promoter methylation as a potential biomarker for esophageal cancer , 2016, Oncotarget.

[64]  Jan Gorodkin,et al.  Comparing two K-category assignments by a K-category correlation coefficient , 2004, Comput. Biol. Chem..

[65]  B. Gallie,et al.  Expression analysis of 6p22 genomic gain in retinoblastoma , 2006, Genes, chromosomes & cancer.

[66]  E. Ma,et al.  Plasma DNA tissue mapping by genome-wide methylation sequencing for noninvasive prenatal, cancer, and transplantation assessments , 2015, Proceedings of the National Academy of Sciences.

[67]  K. Cibulskis,et al.  Systematic identification of personal tumor-specific neoantigens in chronic lymphocytic leukemia. , 2014, Blood.

[68]  Jan Gorodkin,et al.  Transcriptomic landscape of lncRNAs in inflammatory bowel disease , 2015, Genome Medicine.

[69]  F. Nicolantonio,et al.  Liquid biopsy: monitoring cancer-genetics in the blood , 2013, Nature Reviews Clinical Oncology.

[70]  Lei Chen,et al.  Identification of Drug-Drug Interactions Using Chemical Interactions , 2017 .

[71]  Z. Ram,et al.  Angiogenic Factors in the Cerebrospinal Fluid of Patients with Astrocytic Brain Tumors , 2004, Neurosurgery.

[72]  David S. Johnson,et al.  Approximation algorithms for combinatorial problems , 1973, STOC.

[73]  Lei Chen,et al.  Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms , 2022 .

[74]  M. Manjili,et al.  Tumor immunoediting and immunosculpting pathways to cancer progression. , 2007, Seminars in cancer biology.

[75]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[76]  J. Wrana,et al.  Mink1 Regulates β-Catenin-Independent Wnt Signaling via Prickle Phosphorylation , 2011, Molecular and Cellular Biology.

[77]  Lei Chen,et al.  Identification of Human Membrane Protein Types by Incorporating Network Embedding Methods , 2019, IEEE Access.

[78]  Hongbin Shen,et al.  Large-scale prediction of human protein-protein interactions from amino acid sequence based on latent topic features. , 2010, Journal of proteome research.

[79]  Hong-Bin Shen,et al.  Robust prediction of B-factor profile from sequence using two-stage SVR based on random forest feature selection. , 2009, Protein and peptide letters.