Associating quantitative behavioral traits with gene expression in the brain: searching for diamonds in the hay

UNLABELLED Gene expression and phenotypic functionality can best be associated when they are measured quantitatively within the same experiment. The analysis of such a complex experiment is presented, searching for associations between measures of exploratory behavior in mice and gene expression in brain regions. The analysis of such experiments raises several methodological problems. First and foremost, the size of the pool of potential discoveries being screened is enormous yet only few biologically relevant findings are expected, making the problem of multiple testing especially severe. We present solutions based on screening by testing related hypotheses, then testing the hypotheses of interest. In one variant the subset is selected directly, in the other one a tree of hypotheses is tested hierarchical; both variants control the False Discovery Rate (FDR). Other problems in such experiments are in the fact that the level of data aggregation may be different for the quantitative traits (one per animal) and gene expression measurements (pooled across animals); in that the association may not be linear; and in the resolution of interest only few replications exist. We offer solutions to these problems as well. The hierarchical FDR testing strategies presented here can serve beyond the structure of our motivating example study to any complex microarray study. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Gordon K. Smyth,et al.  Use of within-array replicate spots for assessing differential expression in microarray experiments , 2005, Bioinform..

[2]  Yoav Benjamini,et al.  Identifying differentially expressed genes using false discovery rate controlling procedures , 2003, Bioinform..

[3]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[4]  G. Churchill,et al.  Statistical design and the analysis of gene expression microarray data. , 2007, Genetical research.

[5]  Robert W. Williams,et al.  Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function , 2005, Nature Genetics.

[6]  Y. Benjamini,et al.  Quantitative Trait Loci Analysis Using the False Discovery Rate , 2005, Genetics.

[7]  Robert W. Williams,et al.  Ethanol-Responsive Brain Region Expression Networks: Implications for Behavioral Responses to Acute Ethanol in DBA/2J versus C57BL/6J Mice , 2005, The Journal of Neuroscience.

[8]  C. Kreipke,et al.  Failure of MK-801 to suppress D1 receptor-mediated induction of locomotor activity and striatal preprotachykinin mRNA expression in the dopamine-depleted rat , 2006, Neuroscience.

[9]  Robert L. Mason,et al.  Statistical Principles in Experimental Design , 2003 .

[10]  J. Robins,et al.  Invited commentary: ecologic studies--biases, misconceptions, and counterexamples. , 1994, American journal of epidemiology.

[11]  Y. Benjamini,et al.  On the Adaptive Control of the False Discovery Rate in Multiple Testing With Independent Statistics , 2000 .

[12]  Gordon K. Smyth,et al.  limma: Linear Models for Microarray Data , 2005 .

[13]  Pierre R. Bushel,et al.  STATISTICAL ANALYSIS OF A GENE EXPRESSION MICROARRAY EXPERIMENT WITH REPLICATION , 2002 .

[14]  A. Reiner-Benaim FDR Control by the BH Procedure for Two‐Sided Correlated Tests with Implications to Gene Expression Data Analysis , 2007, Biometrical journal. Biometrische Zeitschrift.

[15]  William Stafford Noble,et al.  The effect of replication on gene expression microarray experiments , 2003, Bioinform..

[16]  Gary A. Churchill,et al.  Analysis of Variance for Gene Expression Microarray Data , 2000, J. Comput. Biol..

[17]  M. Segal,et al.  Assessment of differential gene expression in human peripheral nerve injury , 2002, BMC Genomics.

[18]  Paul J Thornalley The enzymatic defence against glycation in health, disease and therapeutics: a symposium to examine the concept. , 2003, Biochemical Society transactions.

[19]  Y. Benjamini,et al.  Combined Application of Behavior Genetics and Microarray Analysis to Identify Regional Expression Themes and Gene–Behavior Associations , 2006, The Journal of Neuroscience.

[20]  Y. Benjamini,et al.  Adaptive linear step-up procedures that control the false discovery rate , 2006 .

[21]  John D. Storey,et al.  SAM Thresholding and False Discovery Rates for Detecting Differential Gene Expression in DNA Microarrays , 2003 .

[22]  A. Saeed,et al.  Microarrays: an overview. , 2007, Methods in molecular biology.

[23]  Y. Benjamini,et al.  Controlling the false discovery rate in behavior genetics research , 2001, Behavioural Brain Research.

[24]  A. Stromberg,et al.  In silico analysis of gene expression profiles in the olfactory mucosae of aging senescence‐accelerated mice , 2004, Journal of neuroscience research.

[25]  Hongzhe Li,et al.  Model-based methods for identifying periodically expressed genes based on time course microarray gene expression data , 2004, Bioinform..

[26]  John D. Storey A direct approach to false discovery rates , 2002 .

[27]  O. Ottersen,et al.  Glutamine from Glial Cells Is Essential for the Maintenance of the Nerve Terminal Pool of Glutamate: Immunogold Evidence from Hippocampal Slice Cultures , 1995, Journal of neurochemistry.

[28]  Yoav Benjamini,et al.  Statistical discrimination of natural modes of motion in rat exploratory behavior , 2000, Journal of Neuroscience Methods.

[29]  P. Pavlidis Using ANOVA for gene selection from microarray studies of the nervous system. , 2003, Methods.

[30]  Hongmei Jiang A two-step procedure for multiple pairwise comparisons in microarray experiments , 2004 .

[31]  Peter Bauer,et al.  Two-stage designs for experiments with a large number of hypotheses , 2005, Bioinform..

[32]  John D. Storey,et al.  Empirical Bayes Analysis of a Microarray Experiment , 2001 .

[33]  W. S. Robinson,et al.  Ecological correlations and the behavior of individuals. , 1950, International journal of epidemiology.

[34]  J. Davis Bioinformatics and Computational Biology Solutions Using R and Bioconductor , 2007 .

[35]  S. Dudoit,et al.  STATISTICAL METHODS FOR IDENTIFYING DIFFERENTIALLY EXPRESSED GENES IN REPLICATED cDNA MICROARRAY EXPERIMENTS , 2002 .

[36]  J. Dormand,et al.  A family of embedded Runge-Kutta formulae , 1980 .

[37]  Rafael A. Irizarry,et al.  Bioinformatics and Computational Biology Solutions using R and Bioconductor , 2005 .

[38]  Eric E. Schadt,et al.  Glyoxalase 1 and glutathione reductase 1 regulate anxiety in mice , 2005, Nature.

[39]  D. Yekutieli Hierarchical False Discovery Rate–Controlling Methodology , 2008 .

[40]  Yoav Benjamini,et al.  Approaches to multiplicity issues in complex research in microarray analysis , 2006 .

[41]  Anat Sakov,et al.  Genotype-environment interactions in mouse behavior: a way out of the problem. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[42]  S. Salzberg,et al.  Physiogenomic resources for rat models of heart, lung and blood disorders , 2006, Nature Genetics.

[43]  William Stafford Noble,et al.  Analysis of strain and regional variation in gene expression in mouse brain , 2001, Genome Biology.

[44]  W. Schmidt,et al.  Behavioural pharmacology of glutamate receptors in the basal ganglia , 1997, Neuroscience & Biobehavioral Reviews.

[45]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[46]  E. Neidhart,et al.  Deficiency in short-chain fatty acid β-oxidation affects theta oscillations during sleep , 2003, Nature Genetics.