Identification of leukemia stem cell expression signatures through Monte Carlo feature selection strategy and support vector machine

Acute myeloid leukemia (AML) is a type of blood cancer characterized by the rapid growth of immature white blood cells from the bone marrow. Therapy resistance resulting from the persistence of leukemia stem cells (LSCs) are found in numerous patients. Comparative transcriptome studies have been previously conducted to analyze differentially expressed genes between LSC + and LSC − cells. However, these studies mainly focused on a limited number of genes with the most obvious expression differences between the two cell types. We developed a computational approach incorporating several machine learning algorithms, including Monte Carlo feature selection (MCFS), incremental feature selection (IFS), support vector machine (SVM), Repeated Incremental Pruning to Produce Error Reduction (RIPPER), to identify gene expression features specific to LSCs. One thousand 0ne hudred fifty-nine features (genes) were first identified, which can be used to build the optimal SVM classifier for distinguishing LSC + and LSC − cells. Among these 1159 genes, the top 17 genes were identified as LSC-specific biomarkers. In addition, six classification rules were produced by RIPPER algorithm. The subsequent literature review on these features/genes and the classification rules and functional enrichment analyses of the 1159 features/genes confirmed the relevance of extracted genes and rules to the characteristics of LSCs.

[1]  Differential long noncoding RNA/mRNA expression profiling and functional network analysis during osteogenic differentiation of human bone marrow mesenchymal stem cells , 2017, Stem Cell Research & Therapy.

[2]  R. Majeti,et al.  Biology and relevance of human acute myeloid leukemia stem cells. , 2017, Blood.

[3]  Ka Yee Yeung,et al.  The derivation of diagnostic markers of chronic myeloid leukemia progression from microarray data. , 2009, Blood.

[4]  Jan Komorowski,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm486 Data and text mining Monte Carlo , 2022 .

[5]  Lei Chen,et al.  Identification of Drug-Drug Interactions Using Chemical Interactions , 2017 .

[6]  F. Tang,et al.  ATF6 safeguards organelle homeostasis and cellular aging in human mesenchymal stem cells , 2018, Cell Discovery.

[7]  Kuan-Teh Jeang,et al.  A Genome-wide Short Hairpin RNA Screening of Jurkat T-cells for Human Proteins Contributing to Productive HIV-1 Replication* , 2009, The Journal of Biological Chemistry.

[8]  S. Winter,et al.  RasGRP1 overexpression in T-ALL increases basal nucleotide exchange on Ras rendering the Ras/PI3K/Akt pathway responsive to protumorigenic cytokines , 2016, Oncogene.

[9]  Seung-Hoon Lee,et al.  CPEB1 modulates differentiation of glioma stem cells via downregulation of HES1 and SIRT1 expression , 2014, Oncotarget.

[10]  A. Harada,et al.  Rab8b Regulates Transport of West Nile Virus Particles from Recycling Endosomes* , 2016, The Journal of Biological Chemistry.

[11]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[12]  Huamei Zhang,et al.  Endogenous and Synthetic ABHD5 Ligands Regulate ABHD5-Perilipin Interactions and Lipolysis in Fat and Muscle. , 2015, Cell metabolism.

[13]  E. Fredlund,et al.  Mass Cytometry and Topological Data Analysis Reveal Immune Parameters Associated with Complications after Allogeneic Stem Cell Transplantation. , 2017, Cell reports.

[14]  S. Fröhling,et al.  BCAT1 restricts αKG levels in AML stem cells leading to IDHmut-like DNA hypermethylation , 2017, Nature.

[15]  Lei Chen,et al.  Predicting Drug Side Effects with Compact Integration of Heterogeneous Networks , 2019 .

[16]  Claude Preudhomme,et al.  A 17-gene stemness score for rapid determination of risk in acute leukaemia , 2016, Nature.

[17]  S. Sugano,et al.  Analysis of RNA decay factor mediated RNA stability contributions on RNA abundance , 2015, BMC Genomics.

[18]  ChenLei,et al.  Analysis of cancer-related lncRNAs using gene ontology and KEGG pathways , 2017 .

[19]  M. Copland,et al.  CD93 Is a Novel Biomarker of Leukemia Stem Cells in Chronic Myeloid Leukemia , 2015 .

[20]  C. Bunker,et al.  Genetic contribution of SCARB1 variants to lipid traits in African Blacks: a candidate gene association study , 2015, BMC Medical Genetics.

[21]  A. Weng,et al.  Leukemia stem cells in T-ALL require active Hif1α and Wnt signaling. , 2015, Blood.

[22]  N. Chung,et al.  Low bone mineral density in adolescents with leukemia after hematopoietic stem cell transplantation: prolonged steroid therapy for GvHD and endocrinopathy after hematopoietic stem cell transplantation might be major concerns? , 2017, Bone Marrow Transplantation.

[23]  Jialiang Yang,et al.  Identify Key Sequence Features to Improve CRISPR sgRNA Efficacy , 2017, IEEE Access.

[24]  Yusuke Nakamura,et al.  DDX31 regulates the p53-HDM2 pathway and rRNA gene transcription through its interaction with NPM1 in renal cell carcinomas. , 2012, Cancer research.

[25]  Xiaohua Hu,et al.  Identifying Patients with Atrioventricular Septal Defect in Down Syndrome Populations by Using Self-Normalizing Neural Networks and Feature Selection , 2018, Genes.

[26]  Hui Xu,et al.  Evidence that high-migration drug-surviving MOLT4 leukemia cells exhibit cancer stem cell-like properties. , 2016, International journal of oncology.

[27]  Gabriele Rossi,et al.  Familial platelet disorder with propensity to acute myelogenous leukemia: Genetic heterogeneity and progression to leukemia via acquisition of clonal chromosome anomalies , 2004, Genes, chromosomes & cancer.

[28]  Ezh2 Controls an Early Hematopoietic Program and Growth and Survival Signaling in Early T Cell Precursor Acute Lymphoblastic Leukemia , 2016, Cell reports.

[29]  Jing Lu,et al.  Analysis and Identification of Aptamer-Compound Interactions with a Maximum Relevance Minimum Redundancy and Nearest Neighbor Algorithm , 2016, BioMed research international.

[30]  Yu-Dong Cai,et al.  Prediction of Protein Cleavage Site with Feature Selection by Random Forest , 2012, PloS one.

[31]  X. Xu,et al.  The viral oncogene Np9 acts as a critical molecular switch for co-activating β-catenin, ERK, Akt and Notch1 and promoting the growth of human leukemia stem/progenitor cells , 2013, Leukemia.

[32]  M. Zöller CD44, Hyaluronan, the Hematopoietic Stem Cell, and Leukemia-Initiating Cells , 2015, Front. Immunol..

[33]  S. Mineishi,et al.  In Vitro Pre-Clinical Validation of Suicide Gene Modified Anti-CD33 Redirected Chimeric Antigen Receptor T-Cells for Acute Myeloid Leukemia , 2016, PloS one.

[34]  Yue Zhang,et al.  ABHD 5 Interacts with BECN 1 to Regulate Autophagy and Tumorigenesis of Colon Cancer Independent of PNPLA 2 , 2016 .

[35]  J. Corbo,et al.  High-density lipoprotein receptor SCARB1 is required for carotenoid coloration in birds , 2017, Proceedings of the National Academy of Sciences.

[36]  D. Beighton,et al.  Utilization of Sialic Acid by Viridans Streptococci , 1996, Journal of dental research.

[37]  E. Morel,et al.  Triglyceride-rich lipoproteins and cytosolic lipid droplets in enterocytes: key players in intestinal physiology and metabolic disorders. , 2014, Biochimie.

[38]  O. Stephens,et al.  GWAS of 972 autologous stem cell recipients with multiple myeloma identifies 11 genetic variants associated with chemotherapy-induced oral mucositis , 2015, Supportive Care in Cancer.

[39]  Lei Chen,et al.  Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms , 2022 .

[40]  Yue Zhang,et al.  ABHD5 interacts with BECN1 to regulate autophagy and tumorigenesis of colon cancer independent of PNPLA2 , 2016, Autophagy.

[41]  Lin Lu,et al.  Identification of synthetic lethality based on a functional network by using machine learning algorithms , 2018, Journal of cellular biochemistry.

[42]  Zhenbo Wang,et al.  Rab3A, Rab27A, and Rab35 regulate different events during mouse oocyte meiotic maturation and activation , 2016, Histochemistry and Cell Biology.

[43]  Ratana Somrongthong,et al.  The Influence of Chronic Illness and Lifestyle Behaviors on Quality of Life among Older Thais , 2016, BioMed research international.

[44]  I. Weissman,et al.  CD96 is a leukemic stem cell-specific marker in human acute myeloid leukemia , 2007, Proceedings of the National Academy of Sciences.

[45]  G. Chan,et al.  Favorable outcomes of unrelated cord blood transplant for pediatric acute myeloid leukemia in Hong Kong , 2013 .

[46]  L. Crews,et al.  Selective elimination of leukemia stem cells: hitting a moving target. , 2013, Cancer letters.

[47]  J. Lafuente,et al.  Co-Administration of TiO2 Nanowired Mesenchymal Stem Cells with Cerebrolysin Potentiates Neprilysin Level and Reduces Brain Pathology in Alzheimer’s Disease , 2017, Molecular Neurobiology.

[48]  K. Ballen,et al.  Cord blood transplant for acute myeloid leukaemia , 2016, British journal of haematology.

[49]  T. Holyoake,et al.  Targeting survival pathways in chronic myeloid leukaemia stem cells , 2013, British journal of pharmacology.

[50]  T. Kitamura,et al.  Aberrant expression of RasGRP1 cooperates with gain-of-function NOTCH1 mutations in T-cell leukemogenesis , 2012, Leukemia.

[51]  E. Schwartz,et al.  Activation of 2',5'-oligoadenylate synthetase activity on induction of HL-60 leukemia cell differentiation , 1989, Molecular and cellular biology.

[52]  L. Brissette,et al.  Gender- and region-specific alterations in bone metabolism in Scarb1-null female mice. , 2014, The Journal of endocrinology.

[53]  S. Vishnubhatla,et al.  Correlation of Serum Immunoglobulins with Infection-Related Parameters During Induction Chemotherapy of Pediatric Acute Myeloid Leukemia: A Prospective Study , 2015, Pediatric hematology and oncology.

[54]  O. Reina,et al.  Circadian- and UPR-dependent control of CPEB4 mediates a translational response to counteract hepatic steatosis under ER stress , 2017, Nature Cell Biology.

[55]  R. Luo,et al.  The microRNA-1246 promotes metastasis in non-small cell lung cancer by targeting cytoplasmic polyadenylation element-binding protein 4 , 2015, Diagnostic Pathology.

[56]  Marco Y. Hein,et al.  A Human Interactome in Three Quantitative Dimensions Organized by Stoichiometries and Abundances , 2015, Cell.

[57]  B. Jastorff,et al.  ATP depletion, purine riboside triphosphate accumulation and rat thymocyte death induced by purine riboside. , 1999, Toxicology letters.

[58]  Lei Chen,et al.  Identification of gene expression signatures across different types of neural stem cells with the Monte‐Carlo feature selection method , 2018, Journal of cellular biochemistry.

[59]  Jing Li,et al.  Aberrant Expression of Splicing Factors in Newly Diagnosed Acute Myeloid Leukemia , 2012, Oncology Research and Treatment.

[60]  H. Leonard,et al.  Bone Mineral Content and Density in Rett Syndrome and Their Contributing Factors , 2011, Pediatric Research.

[61]  Ondřej Polanský,et al.  Different roles of CD4, CD8 and γδ T‐lymphocytes in naive and vaccinated chickens during Salmonella Enteritidis infection , 2017, Proteomics.

[62]  E. Campo,et al.  Common variants at 2q37.3, 8q24.21, 15q21.3, and 16q24.1 influence chronic lymphocytic leukemia risk , 2010, Nature Genetics.

[63]  R. Bhatia,et al.  Role of SIRT1 in the growth and regulation of normal hematopoietic and leukemia stem cells , 2015, Current opinion in hematology.

[64]  H. Meijer,et al.  Specificity factors in cytoplasmic polyadenylation , 2013, Wiley interdisciplinary reviews. RNA.

[65]  Lincoln D. Stein,et al.  Identification of pre-leukemic hematopoietic stem cells in acute leukemia , 2014, Nature.

[66]  D. Steinhilber,et al.  5-Lipoxygenase is a candidate target for therapeutic management of stem cell-like cells in acute myeloid leukemia. , 2014, Cancer research.

[67]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[68]  Z. Weng,et al.  The H3K4-Methyl Epigenome Regulates Leukemia Stem Cell Oncogenic Potential. , 2015, Cancer cell.

[69]  D. Gudbjartsson,et al.  Rare SCARB1 mutations associate with high-density lipoprotein cholesterol but not with coronary artery disease , 2018, European heart journal.

[70]  C. Ichim,et al.  Kinase‐Independent Mechanisms of Resistance of Leukemia Stem Cells to Tyrosine Kinase Inhibitors , 2014, Stem cells translational medicine.

[71]  L. Kaderali,et al.  ABHD5/CGI-58, the Chanarin-Dorfman Syndrome Protein, Mobilises Lipid Stores for Hepatitis C Virus Production , 2016, PLoS pathogens.

[72]  G. Nolan,et al.  Jak1 Integrates Cytokine Sensing to Regulate Hematopoietic Stem Cell Function and Stress Hematopoiesis. , 2017, Cell stem cell.

[73]  Gang Huang,et al.  Leukaemogenic effects of Ptpn11 activating mutations in the stem cell microenvironment , 2016, Nature.

[74]  Dajiang J. Liu,et al.  Rare variant in scavenger receptor BI raises HDL cholesterol and increases risk of coronary heart disease , 2016, Science.

[75]  Ian H. Witten,et al.  Data mining in bioinformatics using Weka , 2004, Bioinform..

[76]  G. Massonnet,et al.  Erosion of the chronic myeloid leukaemia stem cell pool by PPARγ agonists , 2013, Nature.

[77]  Yu-Dong Cai,et al.  Analysis and Prediction of Nitrated Tyrosine Sites with the mRMR Method and Support Vector Machine Algorithm , 2016 .

[78]  H. Saitoh,et al.  Establishment of a human cell line stably overexpressing mouse Nip45 and characterization of Nip45 subcellular localization. , 2013, Biochemical and biophysical research communications.

[79]  Yaping Wang,et al.  LncRNA NALT interaction with NOTCH1 promoted cell proliferation in pediatric T cell acute lymphoblastic leukemia , 2015, Scientific Reports.

[80]  Tao Huang,et al.  Analysis of cancer-related lncRNAs using gene ontology and KEGG pathways , 2017, Artif. Intell. Medicine.

[81]  Sandip Kar Unraveling Cell-Cycle Dynamics in Cancer. , 2016, Cell systems.

[82]  S. Aras,et al.  Loss of ABHD5 promotes the aggressiveness of prostate cancer cells , 2017, Scientific Reports.

[83]  Xiaoyong Pan,et al.  Gene expression differences among different MSI statuses in colorectal cancer , 2018, International journal of cancer.

[84]  M. Labopin,et al.  Comparison of umbilical cord blood allogeneic stem cell transplantation vs. auto‐SCT for adult acute myeloid leukemia patients in second complete remission at transplant: a retrospective study on behalf of the SFGM‐TC , 2015, European journal of haematology.

[85]  Jing Lu,et al.  A similarity-based method for prediction of drug side effects with heterogeneous information. , 2018, Mathematical biosciences.

[86]  A. Jauch,et al.  Reduced hematopoietic stem cell frequency predicts outcome in acute myeloid leukemia , 2017, Haematologica.

[87]  O. Margalit,et al.  Gene expression profiles of AML derived stem cells; similarity to hematopoietic stem cells , 2006, Leukemia.

[88]  J. Tyson,et al.  Cell-cycle transitions: a common role for stoichiometric inhibitors , 2017, Molecular biology of the cell.

[89]  Ø. Bruserud,et al.  The Complexity of Targeting PI3K-Akt-mTOR Signalling in Human Acute Myeloid Leukaemia: The Importance of Leukemic Cell Heterogeneity, Neighbouring Mesenchymal Stem Cells and Immunocompetent Cells , 2016, Molecules.

[90]  Karl Rihaczek,et al.  1. WHAT IS DATA MINING? , 2019, Data Mining for the Social Sciences.

[91]  R I Richards,et al.  Genetic heterogeneity in familial acute myelogenous leukemia: evidence for a second locus at chromosome 16q21-23.2. , 1997, American journal of human genetics.

[92]  S. Tohda [Biomarker for Hematopoietic Tumors--Aiming for Personalized Diagnosis of Leukemia Stem Cells]. , 2015, Rinsho byori. The Japanese journal of clinical pathology.

[93]  Shuo Xiao,et al.  Progesterone Receptor-Mediated Regulation of N-Acetylneuraminate Pyruvate Lyase (NPL) in Mouse Uterine Luminal Epithelium and Nonessential Role of NPL in Uterine Function , 2013, PloS one.

[94]  G. Dini,et al.  Rare viral infections in children receiving hemopoietic stem cell transplant , 2008, Bone Marrow Transplantation.

[95]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[96]  Lei Chen,et al.  A Binary Classifier for the Prediction of EC Numbers of Enzymes , 2019, Current Proteomics.

[97]  H. Klein,et al.  Endoplasmic reticulum protein GliPR1 regulates G protein signaling and the cell cycle and is overexpressed in AML. , 2013, Oncology reports.

[98]  Tao Huang,et al.  Identification of the core regulators of the HLA I-peptide binding process , 2017, Scientific Reports.

[99]  Lai Wei,et al.  Analysis and prediction of drug–drug interaction by minimum redundancy maximum relevance and incremental feature selection , 2017, Journal of biomolecular structure & dynamics.

[100]  Michiko Hirata,et al.  The low-density lipoprotein receptor-related protein 10 is a negative regulator of the canonical Wnt/beta-catenin signaling pathway. , 2010, Biochemical and biophysical research communications.

[101]  Nousheen Zaidi,et al.  Leukemia cells display lower levels of intracellular cholesterol irrespective of the exogenous cholesterol availability. , 2016, Clinica chimica acta; international journal of clinical chemistry.

[102]  Lei Chen,et al.  Gene expression profiling gut microbiota in different races of humans , 2016, Scientific Reports.

[103]  Xi Zhang,et al.  CAR-T cells and allogeneic hematopoietic stem cell transplantation for relapsed/refractory B-cell acute lymphoblastic leukemia. , 2017, Immunotherapy.

[104]  Lin Lu,et al.  Predicting Citrullination Sites in Protein Sequences Using mRMR Method and Random Forest Algorithm. , 2017, Combinatorial chemistry & high throughput screening.

[105]  Mark A. Dawson,et al.  BET inhibitor resistance emerges from leukaemia stem cells , 2015, Nature.

[106]  S. Choe,et al.  Aberrant proteomic expression of NSRP70 and its clinical implications and connection to the transcriptional level in adult acute leukemia. , 2014, Leukemia research.

[107]  Agnieszka Nowak-Brzezińska,et al.  The Monte Carlo feature selection and interdependency discovery is unbiased , 2011 .