Gene expression FastMap : Fast eQTL mapping in homozygous populations

Motivation: Gene expression Quantitative Trait Locus (eQTL) mapping measures the association between transcript expression and genotype in order to find genomic locations likely to regulate transcript expression. The availability of both gene expression and high-density genotype data has improved our ability to perform eQTL mapping in inbred mouse and other homozygous populations. However, existing eQTL mapping software does not scale well when the number of transcripts and markers are on the order of 105 and 105–106, respectively. Results: We propose a new method, FastMap, for fast and efficient eQTL mapping in homozygous inbred populations with binary allele calls. FastMap exploits the discrete nature and structure of the measured single nucleotide polymorphisms (SNPs). In particular, SNPs are organized into a Hamming distance-based tree that minimizes the number of arithmetic operations required to calculate the association of a SNP by making use of the association of its parent SNP in the tree. FastMap’s tree can be used to perform both single marker mapping and haplotype association mapping over an m-SNP window. These performance enhancements also permit permutation-based significance testing. Availability: The FastMap program and source code are available at the website: http://cebc.unc.edu/fastmap86.html Contact: iir@unc.edu; nobel@email.unc.edu Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  D. Heckerman,et al.  Efficient Control of Population Structure in Model Organism Association Mapping , 2008, Genetics.

[2]  Gary A. Churchill,et al.  An imputed genotype resource for the laboratory mouse , 2008, Mammalian Genome.

[3]  Jean L. Chang,et al.  Initial sequence and comparative analysis of the cat genome. , 2007, Genome research.

[4]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[5]  Hyuna Yang,et al.  On the subspecific origin of the laboratory mouse , 2007, Nature Genetics.

[6]  Eleazar Eskin,et al.  A sequence-based variation map of 8.27 million SNPs in inbred mouse strains , 2007, Nature.

[7]  Wei Wang,et al.  The polymorphism architecture of mouse genetic resources elucidated using genome-wide resequencing data: implications for QTL discovery and systems genetics , 2007, Mammalian Genome.

[8]  Ivan Rusyn,et al.  Genome‐level analysis of genetic regulation of liver gene expression networks , 2007, Hepatology.

[9]  Joel Parker,et al.  Inferring missing genotypes in large SNP panels using fast nearest-neighbor searches over sliding windows , 2007, ISMB/ECCB.

[10]  Serge Batalov,et al.  Genomewide Association Analysis in Diverse Inbred Mice: Power and Population Structure , 2007, Genetics.

[11]  C. Kendziorski,et al.  Statistical Methods for Expression Quantitative Trait Loci (eQTL) Mapping , 2006, Biometrics.

[12]  Thomas R. Sutter,et al.  How replicable are mRNA expression QTL? , 2006, Mammalian Genome.

[13]  P. von Rohr,et al.  From genetical genomics to systems genetics: potential applications in quantitative genomics and animal breeding , 2006, Mammalian Genome.

[14]  Andrew I. Su,et al.  Comparative analysis of haplotype association mapping algorithms , 2006, BMC Bioinformatics.

[15]  Eric E Schadt,et al.  Integrating QTL and high-density SNP analyses in mice to identify Insig2 as a susceptibility gene for plasma cholesterol levels. , 2005, Genomics.

[16]  E E Schadt,et al.  Integrating genotypic and expression data in a segregating mouse population to identify 5-lipoxygenase as a susceptibility gene for obesity and bone traits , 2005, Nature Genetics.

[17]  Jörg Köhl,et al.  Complement factor 5 is a quantitative trait gene that modifies liver fibrogenesis in mice and humans , 2005, Nature Genetics.

[18]  Robert W. Williams,et al.  Methodological aspects of the genetic dissection of gene expression , 2005, Bioinform..

[19]  Andrew I Su,et al.  Uncovering regulatory pathways that affect hematopoietic stem cell function using 'genetical genomics' , 2005, Nature Genetics.

[20]  Robert W. Williams,et al.  Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function , 2005, Nature Genetics.

[21]  Serge Batalov,et al.  Use of a Dense Single Nucleotide Polymorphism Map for In Silico Mapping in the Mouse , 2004, PLoS biology.

[22]  Nengjun Yi,et al.  The Collaborative Cross, a community resource for the genetic analysis of complex traits , 2004, Nature Genetics.

[23]  David Higgins,et al.  Haplotype analysis in multiple crosses to identify a QTL gene. , 2004, Genome research.

[24]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[25]  R. Stoughton,et al.  Genetics of gene expression surveyed in maize, mouse and man , 2003, Nature.

[26]  Robert W. Williams,et al.  WebQTL - Web-based complex trait analysis , 2003, Neuroinformatics.

[27]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[28]  K. Manly,et al.  Map Manager QTX, cross-platform software for genetic mapping , 2001, Mammalian Genome.

[29]  C H Kao,et al.  On the differences between maximum likelihood and regression interval mapping in the analysis of quantitative trait loci. , 2000, Genetics.

[30]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[31]  Janan T. Eppig,et al.  Genealogies of mouse inbred strains , 2000, Nature Genetics.

[32]  D Siegmund,et al.  Statistical methods for mapping quantitative trait loci from a dense set of markers. , 1999, Genetics.

[33]  R. Doerge,et al.  Permutation tests for multiple loci affecting a quantitative character. , 1996, Genetics.

[34]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[35]  A Kong,et al.  Asymptotic theory for gene mapping. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[36]  C. Haley,et al.  A simple regression method for mapping quantitative trait loci in line crosses using flanking markers , 1992, Heredity.

[37]  E. Lander,et al.  Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. , 1989, Genetics.