Machine Learning Identifies a Signature of Nine Exosomal RNAs That Predicts Hepatocellular Carcinoma

Simple Summary Hepatocellular carcinoma (HCC) is the third leading cause of cancer-related death worldwide. HCC is often diagnosed at a late stage when treatment effectiveness is limited and its prognosis remains dire. Exosomes are secreted by all living cells, including cancer cells, and contain biological material with potential to highlight disease conditions or dysregulated pathways involved in tumourigenesis. This study employs artificial intelligence methods to identify a signature of exosomal RNAs from 114,602 exosomal RNAs in 118 HCC patients and 112 healthy individuals that can predict HCC. A signature of nine exosomal RNAs, mainly in the immune, platelet/neutrophil and cytoskeletal pathways, was identified to predict HCC with an accuracy of ~85%. Hence, these nine exosomal RNAs have potential to be developed as clinically useful minimally invasive biomarkers for HCC. Abstract Hepatocellular carcinoma (HCC) is the third leading cause of cancer-related death worldwide. Although alpha fetoprotein (AFP) remains a commonly used serological marker of HCC, the sensitivity and specificity of AFP in detecting HCC is often limited. Exosomal RNA has emerged as a promising diagnostic tool for various cancers, but its use in HCC detection has yet to be fully explored. Here, we employed Machine Learning on 114,602 exosomal RNAs to identify a signature that can predict HCC. The exosomal expression data of 118 HCC patients and 112 healthy individuals were stratified split into Training, Validation and Unseen Test datasets. Feature selection was then performed on the initial training dataset using permutation importance, and the predictive performance of the selected features were tested on the validation dataset using Support Vector Machine (SVM) Classifier. A minimum of nine features were identified to be predictive of HCC and these nine features were then evaluated across six different models in an unseen test set. These features, mainly in the immune, platelet/neutrophil and cytoskeletal pathways, exhibited good predictive performance with ROC-AUC from 0.79–0.88 in the unseen test set. Hence, these nine exosomal RNAs have potential to be clinically useful minimally invasive biomarkers for HCC.

[1]  Jiatao Yan,et al.  Machine learning identifies exosome features related to hepatocellular carcinoma , 2022, Frontiers in Cell and Developmental Biology.

[2]  J. Ferlay,et al.  Global burden of primary liver cancer in 2020 and predictions to 2040 , 2022, Journal of hepatology.

[3]  Xuepeng Wang,et al.  Molecular mechanisms and clinical applications of exosomes in prostate cancer , 2022, Biomarker Research.

[4]  Zeeshan Ahmed,et al.  Artificial intelligence and machine learning approaches using gene expression and variant data for personalized medicine , 2022, Briefings Bioinform..

[5]  K. Ghaedi,et al.  NEAT1 can be a diagnostic biomarker in the breast cancer and gastric cancer patients by targeting XIST, hsa-miR-612, and MTRNR2L8: integrated RNA targetome interaction and experimental expression analysis , 2022, Genes and environment : the official journal of the Japanese Environmental Mutagen Society.

[6]  J. Berlanga-Acosta,et al.  Diagnostic Accuracy of Blood-Based Biomarker Panels: A Systematic Review , 2022, Frontiers in Aging Neuroscience.

[7]  M. Khan,et al.  Update on the applications and limitations of alpha-fetoprotein for hepatocellular carcinoma , 2022, World journal of gastroenterology.

[8]  Yan Li,et al.  exoRBase 2.0: an atlas of mRNA, lncRNA and circRNA in extracellular vesicles from human biofluids , 2021, Nucleic Acids Res..

[9]  Mariana Recamonde Mendoza,et al.  Machine learning methods for prediction of cancer driver genes: a survey paper , 2021, Briefings Bioinform..

[10]  Z. Lu,et al.  Integrative analysis of long extracellular RNAs reveals a detection panel of noncoding RNAs for liver cancer , 2021, Theranostics.

[11]  A. Singal,et al.  Update on the Diagnosis and Treatment of Hepatocellular Carcinoma. , 2020, Gastroenterology & hepatology.

[12]  L. Liang,et al.  Y disruption, autosomal hypomethylation and poor male lung cancer survival , 2020, Scientific Reports.

[13]  Farid García,et al.  A comprehensive survey on support vector machine classification: Applications, challenges and trends , 2020, Neurocomputing.

[14]  C. Garrido,et al.  Membrane-bound exosomal HSP70 as a biomarker for detection and monitoring of malignant solid tumours: a pilot study , 2020, Pilot and Feasibility Studies.

[15]  M. Dehmer,et al.  An Introductory Review of Deep Learning for Prediction Models With Big Data , 2020, Frontiers in Artificial Intelligence.

[16]  Amy Makler,et al.  Exosomal biomarkers for cancer diagnosis and patient monitoring , 2020, Expert review of molecular diagnostics.

[17]  Y. Ye,et al.  The threshold of alpha-fetoprotein (AFP) for the diagnosis of hepatocellular carcinoma: A systematic review and meta-analysis , 2020, PloS one.

[18]  Weiyi Wang,et al.  Advances in the early diagnosis of hepatocellular carcinoma , 2020, Genes & diseases.

[19]  Hsin-I Tsai,et al.  Normal Alpha-Fetoprotein Hepatocellular Carcinoma: Are They Really Normal? , 2019, Journal of clinical medicine.

[20]  Shuyang Zhang,et al.  Identification and validation of four hub genes involved in the plaque deterioration of atherosclerosis , 2019, Aging.

[21]  Michael Zhuo Wang,et al.  Overview of Extracellular Vesicles, Their Origin, Composition, Purpose, and Methods for Exosome Isolation and Analysis , 2019, Cells.

[22]  Huajun Zhao,et al.  HCC-Derived Exosomes: Critical Player and Target for Cancer Immune Escape , 2019, Cells.

[23]  Leili Shahriyari,et al.  Effect of normalization methods on the performance of supervised learning algorithms applied to HTSeq-FPKM-UQ data sets: 7SK RNA expression as a predictor of survival in patients with colon adenocarcinoma , 2019, Briefings Bioinform..

[24]  A. Möller,et al.  Breast Cancer‐Derived Exosomes Reflect the Cell‐of‐Origin Phenotype , 2019, Proteomics.

[25]  Y. Zhang,et al.  Exosomes: biogenesis, biologic function and clinical potential , 2019, Cell & Bioscience.

[26]  H. Binder,et al.  The immune contexture of hepatocellular carcinoma predicts clinical outcome , 2018, Scientific Reports.

[27]  Tao Jiang,et al.  Serum exosomal microRNAs combined with alpha‐fetoprotein as diagnostic markers of hepatocellular carcinoma , 2018, Cancer medicine.

[28]  Bing Chen,et al.  exoRBase: a database of circRNA, lncRNA and mRNA in human blood exosomes , 2017, Nucleic Acids Res..

[29]  De-Shuang Huang,et al.  Multi-task ranking SVM for image cosegmentation , 2017, Neurocomputing.

[30]  Jasmin A. Tiro,et al.  An assessment of benefits and harms of hepatocellular carcinoma surveillance in patients with cirrhosis , 2017, Hepatology.

[31]  M. Mittelbrunn,et al.  Role of exosomes in the protection of cellular homeostasis , 2017, Cell adhesion & migration.

[32]  X. Breakefield,et al.  Introduction to Extracellular Vesicles: Biogenesis, RNA Cargo Selection, Content, Release, and Uptake , 2016, Cellular and Molecular Neurobiology.

[33]  J. Raheja,et al.  Indian sign language recognition using SVM , 2016, Pattern Recognition and Image Analysis.

[34]  José Sergio Ruiz Castilla,et al.  Data selection based on decision tree for SVM classification on large data sets , 2015, Appl. Soft Comput..

[35]  So Hee Kang,et al.  Serum exosomal microRNAs as novel biomarkers for hepatocellular carcinoma , 2015, Experimental & Molecular Medicine.

[36]  Yijun Yang,et al.  Neoplastic Reprogramming of Patient‐Derived Adipose Stem Cells by Prostate Cancer Cell‐Associated Exosomes , 2014, Stem cells.

[37]  Michael Eickenberg,et al.  Machine learning for neuroimaging with scikit-learn , 2014, Front. Neuroinform..

[38]  Zuhong Lu,et al.  miRNA in Plasma Exosome is Stable under Different Storage Conditions , 2014, Molecules.

[39]  F. Court,et al.  Schwann cell‐derived exosomes enhance axonal regeneration in the peripheral nervous system , 2013, Glia.

[40]  Roberta Galli,et al.  MicroRNAs bind to Toll-like receptors to induce prometastatic inflammatory response , 2012, Proceedings of the National Academy of Sciences.

[41]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[42]  Sudhir Srivastava,et al.  A Framework for Evaluating Biomarkers for Early Detection: Validation of Biomarker Panels for Ovarian Cancer , 2011, Cancer Prevention Research.

[43]  Ralf Herwig,et al.  ConsensusPathDB: toward a more complete picture of cell biology , 2010, Nucleic Acids Res..

[44]  Lincoln Stein,et al.  Reactome: a database of reactions, pathways and biological processes , 2010, Nucleic Acids Res..

[45]  Anandarup Roy,et al.  SVM-based hierarchical architectures for handwritten Bangla character recognition , 2009, International Journal on Document Analysis and Recognition (IJDAR).

[46]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[47]  Ralf Herwig,et al.  ConsensusPathDB—a database for integrating human functional interaction networks , 2008, Nucleic Acids Res..

[48]  P. Finn,et al.  Molecular Systems Biology 3; Article number 98; doi:10.1038/msb4100138 Citation: Molecular Systems Biology 3:98 , 2022 .

[49]  Hau-San Wong,et al.  Prediction of protein B-factors using multi-class bounded SVM. , 2007, Protein and peptide letters.

[50]  M. Steinbach,et al.  Introduction to Data Mining , 2019, Scalable Comput. Pract. Exp..

[51]  De-Shuang Huang,et al.  Lidar signal denoising using least-squares support vector machine , 2005, IEEE Signal Processing Letters.

[52]  J. Zucman‐Rossi,et al.  Hepatocellular carcinoma , 1998, Nature Reviews Disease Primers.

[53]  OUP accepted manuscript , 2021, Nucleic Acids Research.

[54]  Danilo Bzdok,et al.  Points of Significance: Statistics versus machine learning , 2018, Nature Methods.

[55]  Manuel V. Hermenegildo,et al.  An Overview of , 2011 .

[56]  J. Sannella,et al.  Alpha-Fetoprotein , 1992, Definitions.