A Multi-classifier Model to Identify Mitochondrial Respiratory Gene Signatures in Human Cancer

Whether alteration of mitochondrial gene expression can serve as an effective molecular signature for cancer classification currently remains controversy. To tackle this challenge, here we present a multi-classifier model to identify mitochondrial aberrant gene signatures and then assess the effectiveness on cancer classification. Specifically, we first applied a supervised learning model, Empirical Bayes statistics, to detect differentially expressed genes from a liver cancer mitochondrial gene expression dataset (GEO accession: GSE64505). Next, we applied two well-known classifiers, Prediction Analysis of Microarrays (PAM) and Random Forest (RF), with several folds of cross-validation, to find molecular signatures that can classify liver cancer samples from normal samples. We obtained a mitochondrial molecular signature comprising 25 genes. Classification accuracy was 87.5% by PAM classifier (5 fold cross-validation with 30 repeats), while it was 87.5% and 75.0% by Random Forest with 5- and 2-fold cross-validations (30 repeats), respectively. We further performed literature mining and Gene Set Enrichment Analysis (GSEA) to evaluate the biological significance and novelty of genes in this gene signature.

[1]  E. Robin,et al.  Mitochondrial DNA molecules and virtual number of mitochondria per cell in mammalian cells , 1988, Journal of cellular physiology.

[2]  B F Lang,et al.  Mitochondrial genome evolution and the origin of eukaryotes. , 1999, Annual review of genetics.

[3]  E. Shoubridge Nuclear genetic defects of oxidative phosphorylation. , 2001, Human molecular genetics.

[4]  S. Dimauro,et al.  Mitochondrial respiratory-chain diseases. , 2003, The New England journal of medicine.

[5]  J. Wiebe,et al.  Expression of progesterone metabolizing enzyme genes (AKR1C1, AKR1C2, AKR1C3, SRD5A1, SRD5A2) is altered in human breast carcinoma , 2004, BMC Cancer.

[6]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[7]  M. Bertrand,et al.  Comparative expression analysis of the MAGED genes during embryogenesis and brain development , 2004, Developmental dynamics : an official publication of the American Association of Anatomists.

[8]  Laura C. Greaves,et al.  Mitochondrial DNA mutations in human disease , 2006, IUBMB life.

[9]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[10]  T. Godfrey,et al.  Whole genome exon arrays identify differential expression of alternatively spliced, cancer-related genes in lung cancer , 2008, Nucleic acids research.

[11]  V. Rotter,et al.  Modulated expression of WFDC1 during carcinogenesis and cellular senescence , 2008, Carcinogenesis.

[12]  Wu-ling Zhu,et al.  Abnormal expression of fibrinogen gamma (FGG) and plasma level of fibrinogen in patients with hepatocellular carcinoma. , 2009, Anticancer research.

[13]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[14]  F. Lehner,et al.  Inhibition of the Liver Enriched Protein FOXA2 Recovers HNF6 Activity in Human Colon Carcinoma and Liver Hepatoma Cells , 2010, PloS one.

[15]  Mitchell Ho Advances in Liver Cancer Antibody Therapies : A Focus on Glypican-3 and Mesothelin , 2011 .

[16]  A. Visel,et al.  A liver enhancer in the fibrinogen gene cluster. , 2011, Blood.

[17]  J. Schulze,et al.  Basal and Regulatory Promoter Studies of the AKR1C3 Gene in Relation to Prostate Cancer , 2012, Front. Pharmacol..

[18]  Zhaoyu Li,et al.  Foxa1 and Foxa2 Are Essential for Sexual Dimorphism in Liver Cancer , 2012, Cell.

[19]  Eva Falck,et al.  Expression patterns of Phf5a/PHF5A and Gja1/GJA1 in rat and human endometrial cancer , 2013, Cancer Cell International.

[20]  Vincenzo Lagani,et al.  Biomarker signature identification in “omics” data with multi-class outcome , 2013, Computational and structural biotechnology journal.

[21]  Caroline Mollevi,et al.  Specific Extracellular Matrix Remodeling Signature of Colon Hepatic Metastases , 2013, PloS one.

[22]  Chen-Ching Lin,et al.  Dynamic protein interaction modules in human hepatocellular carcinoma progression , 2013, BMC Systems Biology.

[23]  Charity W. Law,et al.  voom: precision weights unlock linear model analysis tools for RNA-seq read counts , 2014, Genome Biology.

[24]  Lei Yang,et al.  Quantitative Evaluation of Aldo–keto Reductase Expression in Hepatocellular Carcinoma (HCC) Cell Lines , 2013, Genom. Proteom. Bioinform..

[25]  S. Dakhel,et al.  S100P antibody-mediated therapy as a new promising strategy for the treatment of pancreatic cancer , 2014, Oncogenesis.

[26]  Anirban Mukhopadhyay,et al.  A Survey and Comparative Study of Statistical Tests for Identifying Differential Expression from Microarray Data , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[27]  E. Giovannetti,et al.  Role of CYB5A in pancreatic cancer prognosis and autophagy modulation. , 2014, Journal of the National Cancer Institute.

[28]  Chiaki Nakarai,et al.  Expression of AKR1C3 and CNN3 as markers for detection of lymph node metastases in colorectal cancer , 2014, Clinical and Experimental Medicine.

[29]  Kristina L. Blanke,et al.  Polymorphisms in the carcinogen detoxification genes CYB5A and CYB5R3 and breast cancer risk in African American women , 2014, Cancer Causes & Control.

[30]  Young-Kyoung Lee,et al.  Identification of a mitochondrial defect gene signature reveals NUPR1 as a key regulator of liver cancer progression , 2015, Hepatology.

[31]  Ujjwal Maulik,et al.  RANWAR: Rank-Based Weighted Association Rule Mining From Gene Expression and Methylation Data , 2015, IEEE Transactions on NanoBioscience.

[32]  Kaori Tanaka,et al.  ALDH1A1-overexpressing cells are differentiated cells but not cancer stem or progenitor cells in human hepatocellular carcinoma , 2015, Oncotarget.

[33]  H. Klocker,et al.  Putative Prostate Cancer Risk SNP in an Androgen Receptor‐Binding Site of the Melanophilin Gene Illustrates Enrichment of Risk SNPs in Androgen Receptor Target Sites , 2015, Human mutation.

[34]  F. Sonohara,et al.  Prognostic significance of AKR1B10 gene expression in hepatocellular carcinoma and surrounding non-tumorous liver tissue. , 2016, Oncology letters.

[35]  B. Frycz,et al.  Transcript level of AKR1C3 is down-regulated in gastric cancer. , 2016, Biochemistry and cell biology = Biochimie et biologie cellulaire.

[36]  M. Tsuboi,et al.  The transcription factor HOXB7 regulates ERK kinase activity and thereby stimulates the motility and invasiveness of pancreatic cancer cells , 2017, The Journal of Biological Chemistry.

[37]  S. Pearce,et al.  Regulation of Mammalian Mitochondrial Gene Expression: Recent Advances , 2017, Trends in biochemical sciences.

[38]  Archibold Mposhi,et al.  Regulation of mitochondrial gene expression, the epigenetic enigma. , 2017, Frontiers in bioscience.

[39]  L. Niu,et al.  Detecting signatures of selection within the Tibetan sheep mitochondrial genome , 2017, Mitochondrial DNA. Part A, DNA mapping, sequencing, and analysis.

[40]  X. He,et al.  Mesothelin promotes epithelial-to-mesenchymal transition and tumorigenicity of human lung cancer and mesothelioma cells , 2017, Molecular Cancer.

[41]  M. Weerts,et al.  Sensitive detection of mitochondrial DNA variants for analysis of mitochondrial DNA-enriched extracts from frozen tumor tissue , 2018, Scientific Reports.

[42]  Qingchong Qiu,et al.  CHI3L1 promotes tumor progression by activating TGF-β signaling pathway in hepatocellular carcinoma , 2018, Scientific Reports.

[43]  R. Matsunuma,et al.  DPYSL3 modulates mitosis, migration, and epithelial-to-mesenchymal transition in claudin-low breast cancer , 2018, Proceedings of the National Academy of Sciences.

[44]  Saurav Mallik,et al.  Integrating Multiple Data Sources for Combinatorial Marker Discovery: A Study in Tumorigenesis , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[45]  G. Tiscia,et al.  Human Fibrinogen: Molecular and Genetic Aspects of Congenital Disorders , 2018, International journal of molecular sciences.

[46]  Wenyu Li,et al.  Identification of core genes in ovarian cancer by an integrative meta-analysis , 2018, Journal of Ovarian Research.

[47]  Z. Nwosu,et al.  Liver cancer cell lines distinctly mimic the metabolic gene expression pattern of the corresponding human tumours , 2018, Journal of Experimental & Clinical Cancer Research.

[48]  Zhongming Zhao,et al.  Identification of gene signatures from RNA-seq data using Pareto-optimal cluster algorithm , 2018, BMC Systems Biology.

[49]  Wen-Wen Lv,et al.  Effects of PKM2 on global metabolic changes and prognosis in hepatocellular carcinoma: from gene expression to drug discovery , 2018, BMC Cancer.

[50]  K. Mrozik,et al.  N-cadherin in cancer metastasis, its emerging role in haematological malignancies and potential as a therapeutic target in cancer , 2018, BMC Cancer.

[51]  Zhongming Zhao,et al.  Distance based knowledge retrieval through rule mining for complex biomarker recognition from tri-omics profiles , 2019 .

[52]  Zhongming Zhao,et al.  Multi-Objective Optimized Fuzzy Clustering for Detecting Cell Clusters from Single-Cell Expression Profiles , 2019, Genes.

[53]  Yanlin Huang,et al.  Promising diagnostic and prognostic value of E2Fs in human hepatocellular carcinoma , 2019, Cancer management and research.