iFORM/eQTL: an ultrahigh-dimensional platform for inferring the global genetic architecture of gene transcripts

Knowledge about how changes in gene expression are encoded by expression quantitative trait loci (eQTLs) is a key to construct the genotype-phenotype map for complex traits or diseases. Traditional eQTL mapping is to associate one transcript with a single marker at a time, thereby limiting our inference about a complete picture of the genetic architecture of gene expression. Here, we implemented an ultrahigh-dimensional variable selection model to build a computing platform that can systematically scan main effects and interaction effects among all possible loci and identify a set of significant eQTLs modulating differentiation and function of gene expression. This platform, named iFORM/eQTL, was assembled by forward-selection-based procedures to tackle complex covariance structures of gene-gene interactions. iFORM/eQTL can particularly discern the role of cis-QTLs, trans-QTLs and their epistatic interactions in gene expression. Results from the reanalysis of a published genetic and genomic data set through iFORM/eQTL gain new discoveries on the genetic origin of gene expression differentiation in Caenorhabditis elegans, which could not be detected by a traditional one-locus/one-transcript analysis approach.

[1]  Hans-Peter Piepho,et al.  Power to Detect Higher-Order Epistatic Interactions in a Metabolic Pathway Using a New Mapping Strategy , 2007, Genetics.

[2]  Hansheng Wang Forward Regression for Ultra-High Dimensional Variable Screening , 2009 .

[3]  C. D. dos Remedios,et al.  Genome-Wide Identification of Expression Quantitative Trait Loci (eQTLs) in Human Heart , 2014, PLoS ONE.

[4]  R. Fisher XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. , 1919, Transactions of the Royal Society of Edinburgh.

[5]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[6]  Terence Tao,et al.  The Dantzig selector: Statistical estimation when P is much larger than n , 2005, math/0506081.

[7]  Chris S. Haley,et al.  Epistasis: too often neglected in complex trait studies? , 2004, Nature Reviews Genetics.

[8]  E. Dermitzakis,et al.  Expression quantitative trait loci: present and future , 2013, Philosophical Transactions of the Royal Society B: Biological Sciences.

[9]  Vladimir I. Vladimirov,et al.  A meta-analysis of gene expression quantitative trait loci in brain , 2014, Translational Psychiatry.

[10]  L. Liang,et al.  Using eQTL weights to improve power for genome-wide association studies: a genetic study of childhood asthma , 2013, Front. Genet..

[11]  Ning Hao,et al.  Interaction Screening for Ultrahigh-Dimensional Data , 2014, Journal of the American Statistical Association.

[12]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[13]  G. Casella,et al.  Functional mapping of quantitative trait loci underlying the character process: a theoretical framework. , 2002, Genetics.

[14]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[15]  Peng Zhao,et al.  On Model Selection Consistency of Lasso , 2006, J. Mach. Learn. Res..

[16]  J. Cheverud,et al.  Epistasis and its contribution to genetic variance components. , 1995, Genetics.

[17]  R. Wu,et al.  Functional Clustering of Periodic Transcriptional Profiles through ARMA(p,q) , 2010, PloS one.

[18]  F. V. van Eeuwijk,et al.  Detection and use of QTL for complex traits in multiple environments. , 2010, Current opinion in plant biology.

[19]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[20]  Calin Belta,et al.  Exploiting the pathway structure of metabolism to reveal high-order epistasis , 2008, BMC Systems Biology.

[21]  Joseph K. Pickrell,et al.  Understanding mechanisms underlying human gene expression variation with RNA sequencing , 2010, Nature.

[22]  T. Mackay Epistasis and quantitative traits: using model organisms to study gene–gene interactions , 2013, Nature Reviews Genetics.

[23]  L. Liang,et al.  Mapping complex disease traits with global gene expression , 2009, Nature Reviews Genetics.

[24]  R. Andrews,et al.  Innate Immune Activity Conditions the Effect of Regulatory Variants upon Monocyte Gene Expression , 2014, Science.

[25]  Jiahua Chen,et al.  Extended Bayesian information criteria for model selection with large model spaces , 2008 .

[26]  Rongling Wu,et al.  Statistical Genetics of Quantitative Traits: Linkage, Maps and QTL , 2007 .

[27]  Marit Ackermann,et al.  Impact of Natural Genetic Variation on Gene Expression Dynamics , 2013, PLoS genetics.

[28]  Rongling Wu,et al.  A statistical procedure to map high-order epistasis for complex traits , 2013, Briefings Bioinform..

[29]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[30]  Cun-Hui Zhang Nearly unbiased variable selection under minimax concave penalty , 2010, 1002.4734.

[31]  L. Penrose,et al.  THE CORRELATION BETWEEN RELATIVES ON THE SUPPOSITION OF MENDELIAN INHERITANCE , 2022 .

[32]  M. Sillanpää,et al.  Dynamic Quantitative Trait Locus Analysis of Plant Phenomic Data. , 2015, Trends in plant science.

[33]  Hyonho Chun,et al.  Expression Quantitative Trait Loci Mapping With Multivariate Sparse Partial Least Squares Regression , 2009, Genetics.

[34]  Jianqing Fan,et al.  Sure independence screening for ultrahigh dimensional feature space , 2006, math/0612857.

[35]  R. Wu,et al.  A General Model for Multilocus Epistatic Interactions in Case-Control Studies , 2010, PloS one.

[36]  W Y Zhang,et al.  Discussion on `Sure independence screening for ultra-high dimensional feature space' by Fan, J and Lv, J. , 2008 .

[37]  Ben Lehner,et al.  The effects of genetic variation on gene expression dynamics during development , 2013, Nature.

[38]  L. B. Snoek,et al.  Genome-wide gene expression regulation as a function of genotype and age in C. elegans. , 2010, Genome research.

[39]  Wei Sun,et al.  A Statistical Framework for eQTL Mapping Using RNA-seq Data , 2012 .

[40]  M. Stephens,et al.  A Statistical Framework for Joint eQTL Analysis in Multiple Tissues , 2012, PLoS genetics.

[41]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[42]  R. Wu,et al.  Functional mapping — how to map and study the genetic architecture of dynamic complex traits , 2006, Nature Reviews Genetics.

[43]  B. R. Wiseman,et al.  Quantitative trait loci and metabolic pathways. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[44]  L. Kruglyak,et al.  Selection at Linked Sites Shapes Heritable Phenotypic Variation in C. elegans , 2010, Science.

[45]  Manolis Kellis,et al.  Common Genetic Variants Modulate Pathogen-Sensing Responses in Human Dendritic Cells , 2014, Science.

[46]  Wei Zhao,et al.  Wavelet-Based Functional Clustering for Patterns of High-Dimensional Dynamic Gene Expression , 2010, J. Comput. Biol..

[47]  Z. Zeng,et al.  Modeling epistasis of quantitative trait loci using Cockerham's model. , 2002, Genetics.

[48]  Jason H. Moore,et al.  The Ubiquitous Nature of Epistasis in Determining Susceptibility to Common Human Diseases , 2003, Human Heredity.