An Ultrahigh-Dimensional Mapping Model of High-order Epistatic Networks for Complex Traits

Background: Genetic interactions involving more than two loci have been thought to affect quantitatively inherited traits and diseases more pervasively than previously appreciated. However, the detection of such high-order interactions to chart a complete portrait of genetic architecture has not been well explored. Methods: We present an ultrahigh-dimensional model to systematically characterize genetic main effects and interaction effects of various orders among all possible markers in a genetic mapping or association study. The model was built on the extension of a variable selection procedure, called iFORM, derived from forward selection. The model shows its unique power to estimate the magnitudes and signs of high-order epistatic effects, in addition to those of main effects and pairwise epistatic effects. Results: The statistical properties of the model were tested and validated through simulation studies. By analyzing a real data for shoot growth in a mapping population of woody plant, mei (Prunus mume), we demonstrated the usefulness and utility of the model in practical genetic studies. The model has identified important high-order interactions that contribute to shoot growth for mei. Conclusion: The model provides a tool to precisely construct genotype-phenotype maps for quantitative traits by identifying any possible high-order epistasis which is often ignored in the current genetic literature.

[1]  Kristel Van Steen,et al.  Travelling the world of gene-gene interactions , 2012, Briefings Bioinform..

[2]  Noah Simon,et al.  Convex Modeling of Interactions With Strong Heredity , 2014, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[3]  Ronald M. Nelson,et al.  A century after Fisher: time for a new paradigm in quantitative genetics. , 2013, Trends in genetics : TIG.

[4]  Rongling Wu,et al.  Genetic control of juvenile growth and botanical architecture in an ornamental woody plant, Prunus mume Sieb. et Zucc. as revealed by a high-density linkage map , 2014, BMC Genetics.

[5]  Runze Li,et al.  A FAST ALGORITHM FOR DETECTING GENE-GENE INTERACTIONS IN GENOME-WIDE ASSOCIATION STUDIES. , 2014, The annals of applied statistics.

[6]  G. Casella,et al.  Functional mapping of quantitative trait loci underlying the character process: a theoretical framework. , 2002, Genetics.

[7]  G. Wagner,et al.  Epistasis and the mutation load: a measurement-theoretical approach. , 2001, Genetics.

[8]  T. Hastie,et al.  Learning Interactions via Hierarchical Group-Lasso Regularization , 2015, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[9]  Matthew B. Taylor,et al.  Genetic Interactions Involving Five or More Genes Contribute to a Complex Trait in Yeast , 2014, PLoS genetics.

[10]  R. Tibshirani,et al.  A LASSO FOR HIERARCHICAL INTERACTIONS. , 2012, Annals of statistics.

[11]  T. Mackay Epistasis and quantitative traits: using model organisms to study gene–gene interactions , 2013, Nature Reviews Genetics.

[12]  R. Wu,et al.  Functional mapping — how to map and study the genetic architecture of dynamic complex traits , 2006, Nature Reviews Genetics.

[13]  Rongling Wu,et al.  Asymptotic distribution for epistatic tests in case-control studies. , 2011, Genomics.

[14]  Xun Xu,et al.  The genome of Prunus mume , 2012, Nature Communications.

[15]  Jing Zhang,et al.  A Bayesian model for detection of high-order interactions among genetic variants in genome-wide association studies , 2015, BMC Genomics.

[16]  Matthew B. Taylor,et al.  Higher-order genetic interactions and their contribution to complex traits. , 2015, Trends in genetics : TIG.

[17]  Cun-Hui Zhang,et al.  The sparsity and bias of the Lasso selection in high-dimensional linear regression , 2008, 0808.0967.

[18]  Chris S. Haley,et al.  Detecting epistasis in human complex traits , 2014, Nature Reviews Genetics.

[19]  Rongling Wu,et al.  A statistical procedure to map high-order epistasis for complex traits , 2013, Briefings Bioinform..

[20]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[21]  Jie Zhang,et al.  Genome-Wide Characterization and Linkage Mapping of Simple Sequence Repeats in Mei (Prunus mume Sieb. et Zucc.) , 2013, PloS one.

[22]  Bo Zhang,et al.  3FunMap: full-sib family functional mapping of dynamic traits , 2011, Bioinform..

[23]  Lior Pachter,et al.  Analysis of epistatic interactions and fitness landscapes using a new geometric approach , 2007, BMC Evolutionary Biology.

[24]  M. Sillanpää,et al.  Dynamic Quantitative Trait Locus Analysis of Plant Phenomic Data. , 2015, Trends in plant science.

[25]  Mats E. Pettersson,et al.  Replication and Explorations of High-Order Epistasis Using a Large Advanced Intercross Line Pedigree , 2011, PLoS genetics.

[26]  Hao Zhu,et al.  A network of heterochronic genes including Imp1 regulates temporal changes in stem cell properties , 2013, eLife.

[27]  Jianqing Fan,et al.  Sure independence screening for ultrahigh dimensional feature space , 2006, math/0612857.

[28]  Robin D Dowell,et al.  Genotype to Phenotype: A Complex Problem , 2010, Science.

[29]  R. Wu,et al.  A General Model for Multilocus Epistatic Interactions in Case-Control Studies , 2010, PloS one.

[30]  Robert B. Heckendorn,et al.  Should evolutionary geneticists worry about higher-order epistasis? , 2013, Current opinion in genetics & development.

[31]  Achim Tresch,et al.  Heterochrony underpins natural variation in Cardamine hirsuta leaf form , 2015, Proceedings of the National Academy of Sciences.

[32]  Calin Belta,et al.  Exploiting the pathway structure of metabolism to reveal high-order epistasis , 2008, BMC Systems Biology.

[33]  S. Sim,et al.  Rapid evolution of flowering time by an annual plant in response to a climate fluctuation , 2007, Proceedings of the National Academy of Sciences.

[34]  L. Kruglyak,et al.  Finding the sources of missing heritability in a yeast cross , 2012, Nature.

[35]  Rongling Wu,et al.  iFORM/eQTL: an ultrahigh-dimensional platform for inferring the global genetic architecture of gene transcripts , 2016, Briefings Bioinform..