Detecting disease-associated genes with confounding variable adjustment and the impact on genomic meta-analysis: With application to major depressive disorder

BackgroundDetecting candidate markers in transcriptomic studies often encounters difficulties in complex diseases, particularly when overall signals are weak and sample size is small. Covariates including demographic, clinical and technical variables are often confounded with the underlying disease effects, which further hampers accurate biomarker detection. Our motivating example came from an analysis of five microarray studies in major depressive disorder (MDD), a heterogeneous psychiatric illness with mostly uncharacterized genetic mechanisms.ResultsWe applied a random intercept model to account for confounding variables and case-control paired design. A variable selection scheme was developed to determine the effective confounders in each gene. Meta-analysis methods were used to integrate information from five studies and post hoc analyses enhanced biological interpretations. Simulations and application results showed that the adjustment for confounding variables and meta-analysis improved detection of biomarkers and associated pathways.ConclusionsThe proposed framework simultaneously considers correction for confounding variables, selection of effective confounders, random effects from paired design and integration by meta-analysis. The approach improved disease-related biomarker and pathway detection, which greatly enhanced understanding of MDD neurobiology. The statistical framework can be applied to similar experimental design encountered in other complex and heterogeneous diseases.

[1]  Sangsoo Kim,et al.  Combining multiple microarray studies and modeling interstudy variation , 2003, ISMB.

[2]  S. Falcon,et al.  Combining Results of Microarray Experiments: A Rank Aggregation Approach , 2006, Statistical applications in genetics and molecular biology.

[3]  T. Barrette,et al.  Meta-analysis of microarrays: interstudy validation of gene expression profiles reveals pathway dysregulation in prostate cancer. , 2002, Cancer research.

[4]  E. Palazidou The neurobiology of depression. , 2012, British medical bulletin.

[5]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[6]  BMC Bioinformatics , 2005 .

[7]  B WILKINSON,et al.  A statistical consideration in psychological research. , 1951, Psychological bulletin.

[8]  Etienne Sibille,et al.  Between destiny and disease: Genetics and molecular pathways of human central nervous system aging , 2011, Progress in Neurobiology.

[9]  G. Tseng,et al.  Comprehensive literature review and statistical considerations for GWAS meta-analysis , 2012, Nucleic acids research.

[10]  P. Brown,et al.  Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Rainer Breitling,et al.  A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments , 2008, Bioinform..

[12]  Taesung Park,et al.  Combining multiple microarrays in the presence of controlling variables , 2006, Bioinform..

[13]  Jean-Louis Foulley,et al.  Gene expression Moderated effect size and P-value combinations for microarray meta-analyses , 2009 .

[14]  Rafael A. Irizarry,et al.  Bioinformatics and Computational Biology Solutions using R and Bioconductor , 2005 .

[15]  R. Lempicki,et al.  Evaluation of gene expression measurements from commercial microarray platforms. , 2003, Nucleic acids research.

[16]  Douglas G Altman,et al.  Key Issues in Conducting a Meta-Analysis of Gene Expression Microarray Datasets , 2008, PLoS medicine.

[17]  K. Iwamoto,et al.  Molecular characterization of bipolar disorder by comparing gene expression profiles of postmortem brains of major mental disorders , 2004, Molecular Psychiatry.

[18]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Jean Yee Hwa Yang,et al.  Comparison study of microarray meta-analysis methods , 2010, BMC Bioinformatics.

[20]  Nobumasa Kato,et al.  Gene expression profiling of major depression and suicide in the prefrontal cortex of postmortem brains , 2008, Neuroscience Research.

[21]  Jia Li,et al.  An adaptively weighted statistic for detecting differential gene expression when combining multiple transcriptomic studies , 2011, 1108.3180.

[22]  P. Pavlidis,et al.  A cross-laboratory comparison of expression profiling data from normal human postmortem brain , 2010, Neuroscience.

[23]  M. Barrot,et al.  Neurobiology of Depression , 2002, Neuron.

[24]  Jörg Rahnenführer,et al.  Robert Gentleman, Vincent Carey, Wolfgang Huber, Rafael Irizarry, Sandrine Dudoit (2005): Bioinformatics and Computational Biology Solutions Using R and Bioconductor , 2009 .

[25]  Ruth Etzioni,et al.  Combining Results of Microarray Experiments: A Rank Aggregation Approach , 2006 .

[26]  Rainer Breitling,et al.  RankProd: a bioconductor package for detecting differentially expressed genes in meta-analysis , 2006, Bioinform..

[27]  Jia Li,et al.  Biomarker detection in the integration of multiple multi-class genomic studies , 2010, Bioinform..

[28]  C. Aston,et al.  Original Research Article , 2004 .

[29]  George C. Tseng,et al.  Meta-analysis for pathway enrichment analysis when combining multiple genomic studies , 2010, Bioinform..

[30]  Naftali Kaminski,et al.  MetaQC: objective quality control and inclusion/exclusion criteria for genomic meta-analysis , 2011, Nucleic acids research.

[31]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[32]  Etienne Sibille,et al.  Brain molecular aging, promotion of neurological disease and modulation by Sirtuin5 longevity gene polymorphism , 2011, Neurobiology of Disease.

[33]  Eytan Domany,et al.  Outcome signature genes in breast cancer: is there a unique set? , 2004, Breast Cancer Research.

[34]  G. Tseng,et al.  Comprehensive literature review and statistical considerations for microarray meta-analysis , 2012, Nucleic acids research.

[35]  Allan Birnbaum,et al.  Combining Independent Tests of Significance , 1954 .

[36]  James C. Overholser,et al.  Gene Expression Profiling in Postmortem Prefrontal Cortex of Major Depressive Disorder , 2007, The Journal of Neuroscience.

[37]  Paul Pavlidis,et al.  Gene Expression Profiling of Depression and Suicide in Human Prefrontal Cortex , 2004, Neuropsychopharmacology.

[38]  David A Lewis,et al.  A molecular signature of depression in the amygdala. , 2009, The American journal of psychiatry.

[39]  Hui Xiao,et al.  Evaluating reproducibility of differential expression discoveries in microarray studies by considering correlated molecular changes , 2009, Bioinform..

[40]  Fuad G. Gwadry,et al.  Implication of SSAT by gene expression and genetic variation in suicide and major depression. , 2006, Archives of general psychiatry.