Consistent gene signature of schizophrenia identified by a novel feature selection strategy from comprehensive sets of transcriptomic data

The etiology of schizophrenia (SCZ) is regarded as one of the most fundamental puzzles in current medical research, and its diagnosis is limited by the lack of objective molecular criteria. Although plenty of studies were conducted, SCZ gene signatures identified by these independent studies are found highly inconsistent. As one of the most important factors contributing to this inconsistency, the feature selection methods used currently do not fully consider the reproducibility among the signatures discovered from different datasets. Therefore, it is crucial to develop new bioinformatics tools of novel strategy for ensuring a stable discovery of gene signature for SCZ. In this study, a novel feature selection strategy (1) integrating repeated random sampling with consensus scoring and (2) evaluating the consistency of gene rank among different datasets was constructed. By systematically assessing the identified SCZ signature comprising 135 differentially expressed genes, this newly constructed strategy demonstrated significantly enhanced stability and better differentiating ability compared with the feature selection methods popular in current SCZ research. Based on a first-ever assessment on methods' reproducibility cross-validated by independent datasets from three representative studies, the new strategy stood out among the popular methods by showing superior stability and differentiating ability. Finally, 2 novel and 17 previously reported transcription factors were identified and showed great potential in revealing the etiology of SCZ. In sum, the SCZ signature identified in this study would provide valuable clues for discovering diagnostic molecules and potential targets for SCZ.

[1]  M. Barnes,et al.  Analysis of gene expression in two large schizophrenia cohorts identifies multiple changes associated with nerve terminal function , 2009, Molecular Psychiatry.

[2]  E. Thomas,et al.  Molecular profiling of antipsychotic drug function , 2006, Molecular Neurobiology.

[3]  B. Pickard,et al.  Schizophrenia biomarkers: Translating the descriptive into the diagnostic , 2015, Journal of psychopharmacology.

[4]  Norio Ozaki,et al.  Immunohistochemical evaluation of the GABAergic neuronal system in the prefrontal cortex of a DISC1 knockout mouse model of schizophrenia , 2016, Synapse.

[5]  Xing Chen,et al.  A novel relationship for schizophrenia, bipolar and major depressive disorder Part 3: Evidence from chromosome 3 high density association screen , 2018, The Journal of comparative neurology.

[6]  Martin J. Schmidt,et al.  Immune system gene dysregulation in autism and schizophrenia , 2012, Developmental neurobiology.

[7]  Dmitri Volfson,et al.  STEP Levels Are Unchanged in Pre-Frontal Cortex and Associative Striatum in Post-Mortem Human Brain Samples from Subjects with Schizophrenia, Bipolar Disorder and Major Depressive Disorder , 2015, PloS one.

[8]  David J. Porteous,et al.  Association Between Schizophrenia-Related Polygenic Liability and the Occurrence and Level of Mood-Incongruent Psychotic Symptoms in Bipolar Disorder , 2017, JAMA psychiatry.

[9]  Josef Parnas,et al.  Identity Disturbance, Feelings of Emptiness, and the Boundaries of the Schizophrenia Spectrum , 2018, Schizophrenia bulletin.

[10]  V. Haroutunian,et al.  Variations in differential gene expression patterns across multiple brain regions in schizophrenia , 2005, Schizophrenia Research.

[11]  Sirintra Nakjang,et al.  Novel molecular subgroups for clinical classification and outcome prediction in childhood medulloblastoma: a cohort study , 2017, The Lancet. Oncology.

[12]  Pedro Larrañaga,et al.  Filter versus wrapper gene selection approaches in DNA microarray domains , 2004, Artif. Intell. Medicine.

[13]  Feng Zhu,et al.  Clinical trials, progression-speed differentiating features and swiftness rule of the innovative targets of first-in-class drugs , 2019, Briefings Bioinform..

[14]  Sarven Sabunciyan,et al.  Putative psychosis genes in the prefrontal cortex: combined analysis of gene expression microarrays , 2008, BMC psychiatry.

[15]  Anat Levit,et al.  STRUCTURE OF THE D2 DOPAMINE RECEPTOR BOUND TO THE ATYPICAL ANTIPSYCHOTIC DRUG RISPERIDONE , 2018, Nature.

[16]  Murray J. Cairns,et al.  Upregulation of Dicer and MicroRNA Expression in the Dorsolateral Prefrontal Cortex Brodmann Area 46 in Schizophrenia , 2011, Biological Psychiatry.

[17]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[18]  Jesse Gillis,et al.  Genome-wide expression profiling of schizophrenia using a large combined cohort , 2011, Molecular Psychiatry.

[19]  Jens Roat Kultima,et al.  Potential of fecal microbiota for early‐stage detection of colorectal cancer , 2014 .

[20]  Kazuya Iwamoto,et al.  Altered expression of mitochondria-related genes in postmortem brains of patients with bipolar disorder or schizophrenia, as revealed by large-scale DNA microarray analysis. , 2005, Human molecular genetics.

[21]  Pat Levitt,et al.  Molecular Evidence for Increased Expression of Genes Related to Immune and Chaperone Function in the Prefrontal Cortex in Schizophrenia , 2007, Biological Psychiatry.

[22]  Mao Sheng Yang,et al.  Association between schizophrenia and single nucleotide polymorphisms in lipoprotein lipase gene in a Han Chinese population , 2011, Psychiatric genetics.

[23]  L. Ein-Dor,et al.  Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Martin Kidd,et al.  Changes in erythrocyte membrane fatty acids during a clinical trial of eicosapentaenoic acid (EPA) supplementation in schizophrenia , 2009, Metabolic Brain Disease.

[25]  T. Hashimoto,et al.  Molecular mechanisms contributing to dendritic spine alterations in the prefrontal cortex of subjects with schizophrenia , 2006, Molecular Psychiatry.

[26]  Jacques Corbeil,et al.  Comparative gene expression analysis of blood and brain provides concurrent validation of SELENBP1 up-regulation in schizophrenia , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Sam Richman,et al.  An online database for brain disease research , 2006, BMC Genomics.

[28]  Bo Li,et al.  NOREVA: normalization and evaluation of MS-based metabolomics data , 2017, Nucleic Acids Res..

[29]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[30]  Susan Michie,et al.  Clinical and cost-effectiveness of an intervention for reducing cholesterol and cardiovascular risk for people with severe mental illness in English primary care: a cluster randomised controlled trial. , 2018, The lancet. Psychiatry.

[31]  Tingting Fu,et al.  Therapeutic target database update 2018: enriched resource for facilitating bench-to-clinic research of targeted therapeutics , 2017, Nucleic Acids Res..

[32]  Stelios Pavlidis,et al.  Unique Immune Gene Expression Patterns in Bronchoalveolar Lavage and Tumor Adjacent Non-Neoplastic Lung Tissue in Non-Small Cell Lung Cancer , 2018, Front. Immunol..

[33]  Yong-Gang Yao,et al.  SZDB: A Database for Schizophrenia Genetic Research , 2016, Schizophrenia bulletin.

[34]  Ronald L Klein,et al.  Cre‐dependent AAV vectors for highly targeted expression of disease‐related proteins and neurodegeneration in the substantia nigra , 2018, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[35]  David A Lewis,et al.  Transcriptome alterations in the prefrontal cortex of subjects with schizophrenia who committed suicide. , 2008, Neuropsychopharmacologia Hungarica : a Magyar Pszichofarmakologiai Egyesulet lapja = official journal of the Hungarian Association of Psychopharmacology.

[36]  Steven R. Head,et al.  Molecular profiles of schizophrenia in the CNS at different stages of illness , 2008, Brain Research.

[37]  M. Cairns,et al.  Imprinted DLK1-DIO3 region of 14q32 defines a schizophrenia-associated miRNA signature in peripheral blood mononuclear cells , 2011, Molecular Psychiatry.

[38]  Eric S. Lander,et al.  A polygenic burden of rare disruptive mutations in schizophrenia , 2014, Nature.

[39]  Bharti,et al.  fMRI based computer aided diagnosis of schizophrenia using fuzzy kernel feature extraction and hybrid feature selection , 2017, Multimedia Tools and Applications.

[40]  Hugo G. Schnack,et al.  Improving individual predictions: Machine learning approaches for detecting and attacking heterogeneity in schizophrenia (and other psychiatric diseases) , 2017, Schizophrenia Research.

[41]  David A. Lewis,et al.  Altered Cortical CDC42 Signaling Pathways in Schizophrenia: Implications for Dendritic Spine Deficits , 2010, Biological Psychiatry.

[42]  Feng Xu,et al.  Therapeutic target database update 2016: enriched resource for bench to clinical drug target and targeted pathway information , 2015, Nucleic Acids Res..

[43]  Feng Zhu,et al.  Identification of Key Long Non-Coding RNAs in the Pathology of Alzheimer's Disease and their Functions Based on Genome-Wide Associations Study, Microarray, and RNA-seq Data. , 2019, Journal of Alzheimer's disease : JAD.

[44]  Christopher H Woelk,et al.  A combined analysis of microarray gene expression studies of the human prefrontal cortex identifies genes implicated in schizophrenia. , 2012, Journal of psychiatric research.

[45]  Johan A. K. Suykens,et al.  Systematic benchmarking of microarray data classification: assessing the role of non-linearity and dimensionality reduction , 2004, Bioinform..

[46]  Simon C. Potter,et al.  Genome-wide Association Analysis Identifies 14 New Risk Loci for Schizophrenia , 2013, Nature Genetics.

[47]  Cangzhi Jia,et al.  4mCPred: machine learning methods for DNA N4‐methylcytosine sites prediction , 2018, Bioinform..

[48]  R. Yolken,et al.  Mitochondrial dysfunction in schizophrenia: evidence for compromised brain metabolism and oxidative stress , 2004, Molecular Psychiatry.

[49]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[50]  Pat Levitt,et al.  Molecular Characterization of Schizophrenia Viewed by Microarray Analysis of Gene Expression in Prefrontal Cortex , 2000, Neuron.

[51]  Dan Cohen,et al.  Candidate CSPG4 mutations and induced pluripotent stem cell modeling implicate oligodendrocyte progenitor cell dysfunction in familial schizophrenia , 2018, Molecular Psychiatry.

[52]  Juan Cui,et al.  Derivation of stable microarray cancer-differentiating signatures using consensus scoring of multiple random sampling and gene-ranking consistency evaluation. , 2007, Cancer research.

[53]  Feng Zhu,et al.  Clinical Success of Drug Targets Prospectively Predicted by In Silico Study. , 2017, Trends in pharmacological sciences.

[54]  Murray J. Cairns,et al.  MicroRNA and Posttranscriptional Dysregulation in Psychiatry , 2015, Biological Psychiatry.

[55]  P. Kinnunen,et al.  Increased serum phospholipase A2 activity in schizophrenia: a replication study. , 1990, Biological psychiatry.

[56]  K. Lindblad-Toh,et al.  Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammals , 2005, Nature.

[57]  M. Hill,et al.  Knockdown of the schizophrenia susceptibility gene TCF4 alters gene expression and proliferation of progenitor cells from the developing human neocortex , 2017, Journal of psychiatry & neuroscience : JPN.

[58]  Toshikazu Saito,et al.  The role of neural stem cells for in vitro models of schizophrenia: Neuroprotection via Akt/ERK signal regulation , 2010, Schizophrenia Research.

[59]  Hongyi Zhou,et al.  A knowledge-based approach for predicting gene-disease associations , 2016, Bioinform..

[60]  Sandra Romero-Steiner,et al.  Molecular signatures of antibody responses derived from a systems biology study of five human vaccines , 2022 .

[61]  Nobumasa Kato,et al.  Association between the neurofibromatosis‐1 (NF1) locus and autism in the Japanese population , 2004, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[62]  P. Mortensen,et al.  Excess early mortality in schizophrenia. , 2014, Annual review of clinical psychology.

[63]  Sylvia Tippmann,et al.  Programming tools: Adventures with R , 2014, Nature.

[64]  Stefan Michiels,et al.  Prediction of cancer outcome with microarrays: a multiple random validation strategy , 2005, The Lancet.

[65]  Hugues Bersini,et al.  Batch effect removal methods for microarray gene expression data integration: a survey , 2013, Briefings Bioinform..

[66]  Sinead M O’Donovan,et al.  Neuron-specific deficits of bioenergetic processes in the dorsolateral prefrontal cortex in schizophrenia , 2018, Molecular Psychiatry.

[67]  Jeffrey T Leek,et al.  qSVA framework for RNA quality correction in differential expression analysis , 2017, Proceedings of the National Academy of Sciences of the United States of America.

[68]  Charles L. Raison,et al.  Tyrosine metabolism during interferon-alpha administration: Association with fatigue and CSF dopamine concentrations , 2013, Brain, Behavior, and Immunity.

[69]  U. Rapp,et al.  The GABP-responsive Element of the Interleukin-2 Enhancer Is Regulated by JNK/SAPK-activating Pathways in T Lymphocytes* , 1998, The Journal of Biological Chemistry.

[70]  J. Chun,et al.  LPA signaling initiates schizophrenia-like brain and behavioral changes in a mouse model of prenatal brain hemorrhage , 2015, Translational Psychiatry.

[71]  M. Butler,et al.  Functional analysis of schizophrenia genes using GeneAnalytics program and integrated databases. , 2018, Gene.

[72]  Q. Zou,et al.  Gene2vec: gene subsequence embedding for prediction of mammalian N6-methyladenosine sites from mRNA , 2018, RNA.

[73]  Helena Brentani,et al.  Innate immune response is differentially dysregulated between bipolar disease and schizophrenia , 2015, Schizophrenia Research.

[74]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[75]  Xinzhi Zhao,et al.  Positive association between ALDH1A2 and schizophrenia in the Chinese population , 2009, Progress in Neuro-Psychopharmacology and Biological Psychiatry.

[76]  Károly Mirnics,et al.  Immune System Disturbances in Schizophrenia , 2014, Biological Psychiatry.

[77]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[78]  C. Spencer,et al.  Biological Insights From 108 Schizophrenia-Associated Genetic Loci , 2014, Nature.

[79]  Cameron S Carter,et al.  Levels of Cognitive Control: A Functional Magnetic Resonance Imaging-Based Test of an RDoC Domain Across Bipolar Disorder and Schizophrenia , 2018, Neuropsychopharmacology.

[80]  Ziyan Shi,et al.  STAT4 Polymorphisms are Associated with Neuromyelitis Optica Spectrum Disorders , 2017, NeuroMolecular Medicine.

[81]  Murray J Cairns,et al.  Optimal consistency in microRNA expression analysis using reference-gene-based normalization. , 2015, Molecular bioSystems.

[82]  Klaus Pantel,et al.  Data Normalization Strategies for MicroRNA Quantification. , 2015, Clinical chemistry.

[83]  Feng Zhu,et al.  Performance Evaluation and Online Realization of Data-driven Normalization Methods Used in LC/MS based Untargeted Metabolomics Analysis , 2016, Scientific Reports.

[84]  D. Rujescu,et al.  Schizophrenia shows a unique metabolomics signature in plasma , 2012, Translational Psychiatry.

[85]  Benjamin A. Logsdon,et al.  Gene Expression Elucidates Functional Impact of Polygenic Risk for Schizophrenia , 2016, Nature Neuroscience.

[86]  Noam Shomron,et al.  Molecular Risk Factors for Schizophrenia. , 2016, Trends in molecular medicine.

[87]  C. Spencer,et al.  A contribution of novel CNVs to schizophrenia from a genome-wide study of 41,321 subjects: CNV Analysis Group and the Schizophrenia Working Group of the Psychiatric Genomics Consortium , 2016, bioRxiv.

[88]  Patrick F. Sullivan,et al.  Comparative Genomic Evidence for the Involvement of Schizophrenia Risk Genes in Antipsychotic Effects , 2017, Molecular Psychiatry.

[89]  Ya-Mei Bai,et al.  The coding‐synonymous polymorphism rs1045280 (Ser280Ser) in β‐arrestin 2 (ARRB2) gene is associated with tardive dyskinesia in Chinese patients with schizophrenia , 2008, European journal of neurology.

[90]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[91]  T M Hyde,et al.  Altered gene expression in the dorsolateral prefrontal cortex of individuals with schizophrenia , 2013, Molecular Psychiatry.

[92]  P. D. Rijk,et al.  MIR137 variants identified in psychiatric patients affect synaptogenesis and neuronal transmission gene sets , 2014, Molecular Psychiatry.

[93]  Xiaofeng Li,et al.  ANPELA: analysis and performance assessment of the label-free quantification workflow for metaproteomic studies , 2019, Briefings Bioinform..

[94]  Humberto Nicolini,et al.  Comparative Analysis of Gene Expression Profiles Involved in Calcium Signaling Pathways Using the NLVH Animal Model of Schizophrenia , 2017, Journal of Molecular Neuroscience.

[95]  Colin Norman,et al.  What Don't We Know? , 2005, Science.