Grouped gene selection and multi-classification of acute leukemia via new regularized multinomial regression.

Diagnosing acute leukemia is the necessary prerequisite to treating it. Multi-classification on the gene expression data of acute leukemia is help for diagnosing it which contains B-cell acute lymphoblastic leukemia (BALL), T-cell acute lymphoblastic leukemia (TALL) and acute myeloid leukemia (AML). However, selecting cancer-causing genes is a challenging problem in performing multi-classification. In this paper, weighted gene co-expression networks are employed to divide the genes into groups. Based on the dividing groups, a new regularized multinomial regression with overlapping group lasso penalty (MROGL) has been presented to simultaneously perform multi-classification and select gene groups. By implementing this method on three-class acute leukemia data, the grouped genes which work synergistically are identified, and the overlapped genes shared by different groups are also highlighted. Moreover, MROGL outperforms other five methods on multi-classification accuracy.

[1]  Lei Chen,et al.  Application of the Shortest Path Algorithm for the Discovery of Breast Cancer-Related Genes , 2016 .

[2]  Juntao Li,et al.  Huberized Multiclass Support Vector Machine for Microarray Classification: Huberized Multiclass Support Vector Machine for Microarray Classification , 2010 .

[3]  C. Francannet,et al.  A new mutation that predicted a drastic alteration of the BTK protein function. , 2013, Gene.

[4]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[5]  Yoonkyung Lee,et al.  Classification of Multiple Cancer Types by Multicategory Support Vector Machines Using Gene Expression Data , 2003, Bioinform..

[6]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[7]  Deyuan Meng,et al.  Grouped Gene Selection of Cancer via Adaptive Sparse Group Lasso Based on Conditional Mutual Information , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[8]  Rong Liu,et al.  Network-based approach to identify prognostic biomarkers for estrogen receptor–positive breast cancer treatment with tamoxifen , 2015, Cancer biology & therapy.

[9]  Mario Cazzola,et al.  The 2016 revision to the World Health Organization classification of myeloid neoplasms and acute leukemia. , 2016, Blood.

[10]  Li Wang,et al.  Hybrid huberized support vector machines for microarray classification and gene selection , 2008, Bioinform..

[11]  Yao Liu,et al.  Fascaplysin inhibit ovarian cancer cell proliferation and metastasis through inhibiting CDK4. , 2017, Gene.

[12]  Rahim Alhamzawi,et al.  Quantile regression with group lasso for classification , 2015, Advances in Data Analysis and Classification.

[13]  S. F. Konieczny,et al.  The Basic Helix-Loop-Helix Transcription Factor E47 Reprograms Human Pancreatic Cancer Cells to a Quiescent Acinar State With Reduced Tumorigenic Potential , 2015, Pancreas.

[14]  Lingling Zhu,et al.  MiR-93-5p promotes gastric cancer-cell progression via inactivation of the Hippo signaling pathway. , 2018, Gene.

[15]  Yi Cao,et al.  Characterization of germline mutations in familial lung cancer from the Chinese population. , 2018, Gene.

[16]  D. Witten,et al.  Simultaneous detection and estimation of trait associations with genomic phenotypes , 2016, Biostatistics.

[17]  Edward R B McCabe,et al.  Weighted gene co-expression network analysis identifies biomarkers in glycerol kinase deficient mice. , 2009, Molecular genetics and metabolism.

[18]  Trevor Hastie,et al.  A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression , 2013, 1311.6529.

[19]  Jing Lu,et al.  A hybrid method for prediction and repositioning of drug Anatomical Therapeutic Chemical classes. , 2014, Molecular bioSystems.

[20]  Yi Yang,et al.  A fast unified algorithm for solving group-lasso penalize learning problems , 2014, Statistics and Computing.

[21]  T. Smithgall,et al.  Downregulation of the c-Fes protein-tyrosine kinase inhibits the proliferation of human renal carcinoma cells. , 2009, International journal of oncology.

[22]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[23]  Xijing He,et al.  Predicting novel genes and pathways associated with osteosarcoma by using bioinformatics analysis. , 2017, Gene.

[24]  E. K. Lee,et al.  MicroRNA-195 desensitizes HCT116 human colon cancer cells to 5-fluorouracil. , 2018, Cancer letters.

[25]  K. Chou,et al.  Predicting Anatomical Therapeutic Chemical (ATC) Classification of Drugs by Integrating Chemical-Chemical Interactions and Similarities , 2012, PloS one.

[26]  J. Hartman,et al.  Estrogen Receptor β as a Therapeutic Target in Breast Cancer Stem Cells , 2017, Journal of the National Cancer Institute.

[27]  B. Hernández-Ledesma,et al.  Peptides derived from in vitro gastrointestinal digestion of germinated soybean proteins inhibit human colon cancer cells proliferation and inflammation. , 2018, Food chemistry.

[28]  Niels Richard Hansen,et al.  Sparse group lasso and high dimensional multinomial classification , 2012, Comput. Stat. Data Anal..

[29]  Chun Yip Yau,et al.  LARS-type algorithm for group lasso , 2016, Statistics and Computing.

[30]  S. Horvath,et al.  A General Framework for Weighted Gene Co-Expression Network Analysis , 2005, Statistical applications in genetics and molecular biology.

[31]  Yi-Jin Gao,et al.  Philadelphia chromosome-positive acute lymphoblastic leukemia in China: a retrospective study from the Chinese Childhood Cancer Group , 2016, Leukemia & lymphoma.

[32]  Zhiwen Yu,et al.  Hybrid Adaptive Classifier Ensemble , 2015, IEEE Transactions on Cybernetics.

[33]  M. Yamaguchi,et al.  Shift in energy metabolism caused by glucocorticoids enhances the effect of cytotoxic anti-cancer drugs against acute lymphoblastic leukemia cells , 2017, Oncotarget.

[34]  Krishna C. Persaud,et al.  Analysis of volatile organic compounds in exhaled breath for lung cancer diagnosis using a sensor system , 2018 .

[35]  Mohammad Goodarzi,et al.  Diagnostic and Therapeutic Potential of Exosomes in Cancer: The Beginning of a New Tale? , 2017, Journal of cellular physiology.

[36]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[37]  D. A. G. Galton,et al.  Classification of Acute Leukemia , 1977 .

[38]  S. Raimondi,et al.  High expression of myocyte enhancer factor 2C (MEF2C) is associated with adverse-risk features and poor outcome in pediatric acute myeloid leukemia: a report from the Children’s Oncology Group , 2015, Journal of Hematology & Oncology.

[39]  Si-ying Zhou,et al.  Predictive role of GSTP1-containing exosomes in chemotherapy-resistant breast cancer. , 2017, Gene.

[40]  R. Brunning Classification of acute leukemias. , 2003, Seminars in diagnostic pathology.

[41]  John Hardy,et al.  An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks , 2017, BMC Systems Biology.