A Stack-based Ensemble Framework for Detecting Cancer MicroRNA Biomarkers

MicroRNA (miRNA) plays vital roles in biological processes like RNA splicing and regulation of gene expression. Studies have revealed that there might be possible links between oncogenesis and expression profiles of some miRNAs, due to their differential expression between normal and tumor tissues. However, the automatic classification of miRNAs into different categories by considering the similarity of their expression values has rarely been addressed. This article proposes a solution framework for solving some real-life classification problems related to cancer, miRNA, and mRNA expression datasets. In the first stage, a multiobjective optimization based framework, non-dominated sorting genetic algorithm II, is proposed to automatically determine the appropriate classifier type, along with its suitable parameter and feature combinations, pertinent for classifying a given dataset. In the second page, a stack-based ensemble technique is employed to get a single combinatorial solution from the set of solutions obtained in the first stage. The performance of the proposed two-stage approach is evaluated on several cancer and RNA expression profile datasets. Compared to several state-of-the-art approaches for classifying different datasets, our method shows supremacy in the accuracy of classification.

[1]  Chiranjib Chakraborty,et al.  miRNA-regulated cancer stem cells: understanding the property and the role of miRNA in carcinogenesis , 2016, Tumor Biology.

[2]  Zhang Xuegong,et al.  INTRODUCTION TO STATISTICAL LEARNING THEORY AND SUPPORT VECTOR MACHINES , 2000 .

[3]  R. Randles,et al.  Introduction to the Theory of Nonparametric Statistics , 1991 .

[4]  B. Stewart,et al.  World Cancer Report , 2003 .

[5]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[6]  Ujjwal Maulik,et al.  Development of the human cancer microRNA network , 2010 .

[7]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[8]  Douglas A. Wolfe,et al.  Introduction to the Theory of Nonparametric Statistics. , 1980 .

[9]  António Gaspar-Cunha,et al.  Feature Selection Using Multi-Objective Evolutionary Algorithms: Application to Cardiac SPECT Diagnosis , 2010, IWPACBB.

[10]  R. Gambari,et al.  Targeting microRNAs involved in human diseases: a novel approach for modification of gene expression and drug development. , 2011, Biochemical pharmacology.

[11]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[12]  Yongjun Li,et al.  Databases and Web Tools for Cancer Genomics Study , 2015, Genom. Proteom. Bioinform..

[13]  David L. Olson,et al.  Advanced Data Mining Techniques , 2008 .

[14]  P. Bickel,et al.  Mathematical Statistics: Basic Ideas and Selected Topics , 1977 .

[15]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[16]  Ming-Ming Wei,et al.  Long Non-coding RNAs and Their Roles in Non-small-cell Lung Cancer , 2016, Genom. Proteom. Bioinform..

[17]  Jinhai Tang,et al.  Searching for candidate microRNA biomarkers in detection of breast cancer: a meta-analysis. , 2013, Cancer biomarkers : section A of Disease markers.

[18]  Michael P Diamond,et al.  The emerging role of extracellular vesicle-derived miRNAs: implication in cancer progression and stem cell related diseases. , 2016, Journal of clinical epigenetics.

[19]  M. Plummer,et al.  International agency for research on cancer. , 2020, Archives of pathology.

[20]  Liang Chen,et al.  miRNA Biomarkers in Breast Cancer Detection and Management , 2011, Journal of Cancer.

[21]  C. Sander,et al.  Analysis of microRNA-target interactions across diverse cancer types , 2013, Nature Structural &Molecular Biology.

[22]  H. Horvitz,et al.  MicroRNA expression profiles classify human cancers , 2005, Nature.

[23]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[24]  C. Nusbaum,et al.  Mammalian microRNAs: experimental evaluation of novel and previously annotated genes. , 2010, Genes & development.

[25]  Jun Yu,et al.  Detection of miRNA as Non-Invasive Biomarkers of Colorectal Cancer , 2015, International journal of molecular sciences.

[26]  Andrian A. Tarmaev,et al.  MiRNAs as promising biomarkers in cancer , 2019, HERALD of North-Western State Medical University named after I.I. Mechnikov.

[27]  Liangbiao Chen,et al.  Multi-class cancer classification through gene expression profiles: microRNA versus mRNA. , 2009, Journal of genetics and genomics = Yi chuan xue bao.

[28]  Inyoul Lee,et al.  Extracellular microRNA: a new source of biomarkers. , 2011, Mutation research.

[29]  Weiying Zhou,et al.  UC Office of the President Recent Work Title De novo sequencing of circulating miRNAs identifies novel markers predicting clinical outcome of locally advanced breast cancer , 2012 .

[30]  U. Maulik,et al.  An SVM-Wrapped Multiobjective Evolutionary Feature Selection Approach for Identifying Cancer-MicroRNA Markers , 2013, IEEE Transactions on NanoBioscience.

[31]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[32]  M. Evans Statistical Distributions , 2000 .

[33]  P. Mishra,et al.  MicroRNAs as promising biomarkers in cancer diagnostics , 2014, Biomarker Research.