Discovering transnosological molecular basis of human brain diseases using biclustering analysis of integrated gene expression data

BackgroundIt has been reported that several brain diseases can be treated as transnosological manner implicating possible common molecular basis under those diseases. However, molecular level commonality among those brain diseases has been largely unexplored. Gene expression analyses of human brain have been used to find genes associated with brain diseases but most of those studies were restricted either to an individual disease or to a couple of diseases. In addition, identifying significant genes in such brain diseases mostly failed when it used typical methods depending on differentially expressed genes.ResultsIn this study, we used a correlation-based biclustering approach to find coexpressed gene sets in five neurodegenerative diseases and three psychiatric disorders. By using biclustering analysis, we could efficiently and fairly identified various gene sets expressed specifically in both single and multiple brain diseases. We could find 4,307 gene sets correlatively expressed in multiple brain diseases and 3,409 gene sets exclusively specified in individual brain diseases. The function enrichment analysis of those gene sets showed many new possible functional bases as well as neurological processes that are common or specific for those eight diseases.ConclusionsThis study introduces possible common molecular bases for several brain diseases, which open the opportunity to clarify the transnosological perspective assumed in brain diseases. It also showed the advantages of correlation-based biclustering analysis and accompanying function enrichment analysis for gene expression data in this type of investigation.

[1]  Min-Sung Kim,et al.  Combined Database System for Binary Protein Interaction and Co-complex Association , 2009, 2009 International Association of Computer Science and Information Technology - Spring Conference.

[2]  Donald Geman,et al.  Merging microarray data from separate breast cancer studies provides a robust prognostic test , 2008, BMC Bioinformatics.

[3]  R. Altman,et al.  Pharmacogenomics Knowledge for Personalized Medicine , 2012, Clinical pharmacology and therapeutics.

[4]  Martina Wiedau-Pazos,et al.  Integrative gene-tissue microarray-based approach for identification of human disease biomarkers: application to amyotrophic lateral sclerosis. , 2010, Human molecular genetics.

[5]  M. Webster,et al.  Gene expression analysis of bipolar disorder reveals downregulation of the ubiquitin cycle and alterations in synaptic genes , 2006, Molecular Psychiatry.

[6]  Taeho Hwang,et al.  CLIC: clustering analysis of large microarray datasets with individual dimension-based clustering , 2010, Nucleic Acids Res..

[7]  Martin Ester,et al.  Inferring cancer subnetwork markers using density-constrained biclustering , 2010, Bioinform..

[8]  Hugues Bersini,et al.  Batch effect removal methods for microarray gene expression data integration: a survey , 2013, Briefings Bioinform..

[9]  Min-Sung Kim,et al.  COFECO: composite function annotation enriched by protein complex data , 2009, Nucleic Acids Res..

[10]  Sean R. Davis,et al.  GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor , 2007, Bioinform..

[11]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[12]  C. Brayne,et al.  Microarray analysis of the astrocyte transcriptome in the aging brain: relationship to Alzheimer's pathology and APOE genotype , 2011, Neurobiology of Aging.

[13]  J. Davis Bioinformatics and Computational Biology Solutions Using R and Bioconductor , 2007 .

[14]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Susumu Goto,et al.  KEGG for representation and analysis of molecular networks involving diseases and drugs , 2009, Nucleic Acids Res..

[16]  Ki-Yeol Kim,et al.  Reuse of imputed data in microarray analysis increases imputation efficiency , 2004, BMC Bioinformatics.

[17]  Chunyu Liu,et al.  Removing Batch Effects in Analysis of Expression Microarray Data: An Evaluation of Six Batch Adjustment Methods , 2011, PloS one.

[18]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[19]  E. Rolls,et al.  Cognitive dysfunction in psychiatric disorders: characteristics, causes and the quest for improved therapy , 2012, Nature Reviews Drug Discovery.

[20]  Gordon K. Smyth,et al.  limma: Linear Models for Microarray Data , 2005 .

[21]  Taeho Hwang,et al.  DynaMod: dynamic functional modularity analysis , 2010, Nucleic Acids Res..

[22]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[23]  M. Khoury,et al.  A navigator for human genome epidemiology , 2008, Nature Genetics.

[24]  Gwan-Su Yi,et al.  Biclustering for the comprehensive search of correlated gene expression patterns using clustered seed expansion , 2013, BMC Genomics.

[25]  L. Moran,et al.  Transcriptome analysis reveals link between proteasomal and mitochondrial pathways in Parkinson’s disease , 2006, Neurogenetics.

[26]  N. Campbell Genetic association database , 2004, Nature Reviews Genetics.

[27]  Taeho Hwang,et al.  FiGS: a filter-based gene selection workbench for microarray data , 2010, BMC Bioinformatics.

[28]  Rafael A. Irizarry,et al.  Bioinformatics and Computational Biology Solutions using R and Bioconductor , 2005 .

[29]  S. Horvath,et al.  Transcriptomic Analysis of Autistic Brain Reveals Convergent Molecular Pathology , 2011, Nature.

[30]  M. Rattray,et al.  Gene expression profiling in human neurodegenerative disease , 2012, Nature Reviews Neurology.