Methodological aspects of the genetic dissection of gene expression

MOTIVATION Dissection of the genetics underlying gene expression utilizes techniques from microarray analyses as well as quantitative trait loci (QTL) mapping. Available QLT mapping methods are not tailored for the highly automated analyses required to deal with the thousand of gene transcripts encountered in the mapping of QTL affecting gene expression (sometimes referred to as eQTL). This report focuses on the adaptation of QTL mapping methodology to perform automated mapping of QTL affecting gene expression. RESULTS The analyses of expression data on > 12,000 gene transcripts in BXD recombinant inbred mice found, on average, 629 QTL exceeding the genome-wide 5% threshold. Using additional information on trait repeatabilities and QTL location, 168 of these were classified as 'high confidence' QTL. Current sample sizes of genetical genomics studies make it possible to detect a reasonable number of QTL using simple genetic models, but considerably larger studies are needed to evaluate more complex genetic models. After extensive analyses of real data and additional simulated data (altogether > 300,000 genome scans) we make the following recommendations for detection of QTL for gene expression: (1) For populations with an unbalanced number of replicates on each genotype, weighted least squares should be preferred above ordinary least squares. Weights can be based on repeatability of the trait and the number of replicates. (2) A genome scan based on multiple marker information but analysing only at marker locations is a good approximation to a full interval mapping procedure. (3) Significance testing should be based on empirical genome-wide significance thresholds that are derived for each trait separately. (4) The significant QTL can be separated into high and low confidence QTL using a false discovery rate that incorporates prior information such as transcript repeatabilities and co-localization of gene-transcripts and QTL. (5) Including observations on the founder lines in the QTL analysis should be avoided as it inflates the test statistic and increases the Type I error. (6) To increase the computational efficiency of the study, use of parallel computing is advised. These recommendations are summarized in a possible strategy for mapping of QTL in a least squares framework. AVAILABILITY The software used for this study is available on request from the authors.

[1]  Y. Benjamini,et al.  More powerful procedures for multiple significance testing. , 1990, Statistics in medicine.

[2]  B. Yandell,et al.  Dimension reduction for mapping mRNA abundance as quantitative traits. , 2003, Genetics.

[3]  Albert-László Barabási,et al.  Genetic Dissection of Transcriptional Regulation in Budding Yeast , 2002 .

[4]  Örjan Carlborg,et al.  New methods for mapping quantitative trait loci , 2002 .

[5]  M. Daly,et al.  MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and natural populations. , 1987, Genomics.

[6]  Rachel B. Brem,et al.  Budding Yeast Genetic Dissection of Transcriptional Regulation in , 2007 .

[7]  Lauren M McIntyre,et al.  Intersection tests for single marker QTL analysis can be more powerful than two marker QTL analysis , 2003, BMC Genetics.

[8]  S. Knapp,et al.  Using molecular markers to estimate quantitative trait locus parameters: power and genetic variances for unreplicated and replicated progeny. , 1990, Genetics.

[9]  M. Lynch,et al.  Genetics and Analysis of Quantitative Traits , 1996 .

[10]  O. Carlborg,et al.  Parallel computing in interval mapping of quantitative trait loci. , 2001, The Journal of heredity.

[11]  H. Grüneberg,et al.  Introduction to quantitative genetics , 1960 .

[12]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[13]  R. Doerge,et al.  Permutation tests for multiple loci affecting a quantitative character. , 1996, Genetics.

[14]  R. Stoughton,et al.  Genetics of gene expression surveyed in maize, mouse and man , 2003, Nature.

[15]  Kai Stühler,et al.  Genetic analysis of the mouse brain proteome , 2002, Nature Genetics.

[16]  D Chollet,et al.  The Homeostatic Regulation of Sleep Need Is under Genetic Control , 2001, The Journal of Neuroscience.

[17]  J. Belknap,et al.  Quantitative trait loci for acute behavioral sensitivity to paraoxon. , 2000, Neurotoxicology and teratology.

[18]  J. Nap,et al.  Genetical genomics: the added value from segregation. , 2001, Trends in genetics : TIG.

[19]  J. Crabbe,et al.  Mapping of quantitative trait loci underlying ethanol metabolism in BXD recombinant inbred mouse strains. , 2002, Alcoholism, clinical and experimental research.

[20]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[21]  R. Jansen,et al.  University of Groningen High Resolution of Quantitative Traits Into Multiple Loci via Interval Mapping , 2022 .

[22]  Robert W. Williams,et al.  WebQTL - Web-based complex trait analysis , 2003, Neuroinformatics.

[23]  C. Haley,et al.  A simple regression method for mapping quantitative trait loci in line crosses using flanking markers , 1992, Heredity.