Genome-wide screening for cis-regulatory variation using a classical diallel crossing scheme

Large-scale screening studies carried out to date for genetic variants that affect gene regulation are generally limited to descriptions of differences in allele-specific expression (ASE) detected in vivo. Allele-specific differences in gene expression provide evidence for a model whereby cis-acting genetic variation results in differential expression between alleles. Such gene surveys for regulatory variation are a first step in identifying the specific nucleotide changes that govern gene expression differences, but they leave the underlying mechanisms unexplored. Here, we propose a quantitative genetics approach to perform a genome-wide analysis of ASE differences (GASED). The GASED approach is based on a diallel design that is often used in plant breeding programs to estimate general combining abilities (GCA) of specific inbred lines and to identify high-yielding hybrid combinations of parents based on their specific combining abilities (SCAs). In a context of gene expression, the values of GCA and SCA parameters allow cis- and trans-regulatory changes to be distinguished and imbalances in gene expression to be ascribed to cis-regulatory variation. With this approach, a total of 715 genes could be identified that are likely to carry allelic polymorphisms responsible for at least a 1.5-fold allelic expression difference in a total of 10 diploid Arabidopsis thaliana hybrids. The major strength of the GASED approach, compared to other ASE detection methods, is that it is not restricted to genes with allelic transcript variants. Although a false-positive rate of 9/41 was observed, the GASED approach is a valuable pre-screening method that can accelerate systematic surveys of naturally occurring cis-regulatory variation among inbred lines for laboratory species, such as Arabidopsis, mouse, rat and fruitfly, and economically important crop species, such as corn.

[1]  Els Goetghebeur,et al.  Selecting "Significant" Differentially Expressed Genes from the Combined Perspective of the Null and the Alternative , 2006, J. Comput. Biol..

[2]  M. Kuiper,et al.  Genetic dissection of transcriptional regulation by cDNA-AFLP. , 2006, The Plant journal : for cell and molecular biology.

[3]  Dan Nettleton,et al.  Genetic Regulation of Gene Expression During Shoot Development in Arabidopsis , 2006, Genetics.

[4]  E. Goetghebeur,et al.  Significance and impotence: towards a balanced view of the null and the alternative hypotheses in marker selection for plant breeding , 2006 .

[5]  Martin Kuiper,et al.  Genetic Analysis of Variation in Gene Expression in Arabidopsis thaliana , 2005, Genetics.

[6]  B. Weir,et al.  The quantitative genetics of transcription. , 2005, Trends in genetics : TIG.

[7]  Leonid Kruglyak,et al.  Local Regulatory Variation in Saccharomyces cerevisiae , 2005, PLoS genetics.

[8]  Eric E Schadt,et al.  Cis-acting expression quantitative trait loci in mice. , 2005, Genome research.

[9]  E. Petretto,et al.  Integrated transcriptional profiling and linkage analysis for identification of genes underlying disease , 2005, Nature Genetics.

[10]  Andrew I Su,et al.  Uncovering regulatory pathways that affect hematopoietic stem cell function using 'genetical genomics' , 2005, Nature Genetics.

[11]  Robert W. Williams,et al.  Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function , 2005, Nature Genetics.

[12]  L. Kruglyak,et al.  Simultaneous genotyping, gene-expression measurement, and detection of allele-specific expression with oligonucleotide arrays. , 2005, Genome research.

[13]  Yves Moreau,et al.  Benchmarking the CATMA Microarray. A Novel Tool forArabidopsis Transcriptome Analysis1[w] , 2005, Plant Physiology.

[14]  Peter Johnstone,et al.  Normalization of microarray data using a spatial mixed model analysis which includes splines , 2004, Bioinform..

[15]  S. Nuzhdin,et al.  Additivity and trans-acting Effects on Gene Expression in Male Drosophila simulans , 2004, Genetics.

[16]  Thomas J. Hudson,et al.  Cis-Acting Regulatory Variation in the Human Genome , 2004, Science.

[17]  Thomas Altmann,et al.  Versatile gene-specific sequence tags for Arabidopsis functional genomics: transcript profiling and reverse genetics applications. , 2004, Genome research.

[18]  C. Molony,et al.  Genetic analysis of genome-wide variation in human gene expression , 2004, Nature.

[19]  Andrew G. Clark,et al.  Evolutionary changes in cis and trans gene regulation , 2004, Nature.

[20]  A. Sandelin,et al.  Applied bioinformatics for the identification of regulatory elements , 2004, Nature Reviews Genetics.

[21]  J. Knight,et al.  Allele-specific gene expression uncovered. , 2004, Trends in genetics : TIG.

[22]  Pierre Rouzé,et al.  Automatic design of gene-specific sequence tags for genome-wide functional studies , 2003, Bioinform..

[23]  Zihua Hu,et al.  Genome-wide mRNA profiling reveals heterochronic allelic variation and a new imprinted gene in hybrid maize endosperm. , 2003, The Plant journal : for cell and molecular biology.

[24]  K. Buetow,et al.  Allelic variation in gene expression is common in the human genome. , 2003, Genome research.

[25]  R. Stoughton,et al.  Genetics of gene expression surveyed in maize, mouse and man , 2003, Nature.

[26]  E. Lander,et al.  Detection of regulatory variation in mouse genes , 2002, Nature Genetics.

[27]  Bert Vogelstein,et al.  Allelic Variation in Human Gene Expression , 2002, Science.

[28]  L. Wasserman,et al.  Operating characteristics and extensions of the false discovery rate procedure , 2002 .

[29]  Rachel B. Brem,et al.  Genetic Dissection of Transcriptional Regulation in Budding Yeast , 2002, Science.

[30]  David B. Allison,et al.  A mixture model approach for the analysis of microarray gene expression data , 2002 .

[31]  J. Görlach,et al.  Growth Stage–Based Phenotypic Analysis of Arabidopsis , 2001, The Plant Cell Online.

[32]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[33]  D G Brown,et al.  Selective mapping: a strategy for optimizing the construction of high-density linkage maps. , 2000, Genetics.

[34]  W. Ewens Genetics and analysis of quantitative traits , 1999 .

[35]  M Koornneef,et al.  Development of an AFLP based linkage map of Ler, Col and Cvi Arabidopsis thaliana ecotypes and construction of a Ler/Cvi recombinant inbred line population. , 1998, The Plant journal : for cell and molecular biology.

[36]  Robert Tibshirani,et al.  The 'miss rate' for the analysis of gene expression data. , 2005, Biostatistics.

[37]  Karl J. Friston,et al.  Variance Components , 2003 .

[38]  Albert-László Barabási,et al.  Genetic Dissection of Transcriptional Regulation in Budding Yeast , 2002 .

[39]  Pierre R. Bushel,et al.  Assessing Gene Significance from cDNA Microarray Expression Data via Mixed Models , 2001, J. Comput. Biol..