Quantitative gene-gene and gene-environment mapping for leaf shape variation using tree-based models.

Leaf shape traits have long been a focus of many disciplines, but the complex genetic and environmental interactive mechanisms regulating leaf shape variation have not yet been investigated in detail. The question of the respective roles of genes and environment and how they interact to modulate leaf shape is a thorny evolutionary problem, and sophisticated methodology is needed to address it. In this study, we investigated a framework-level approach that inputs shape image photographs and genetic and environmental data, and then outputs the relative importance ranks of all variables after integrating shape feature extraction, dimension reduction, and tree-based statistical models. The power of the proposed framework was confirmed by simulation and a Populus szechuanica var. tibetica data set. This new methodology resulted in the detection of novel shape characteristics, and also confirmed some previous findings. The quantitative modeling of a combination of polygenetic, plastic, epistatic, and gene-environment interactive effects, as investigated in this study, will improve the discernment of quantitative leaf shape characteristics, and the methods are ready to be applied to other leaf morphology data sets. Unlike the majority of approaches in the quantitative leaf shape literature, this framework-level approach is data-driven, without assuming any pre-known shape attributes, landmarks, or model structures.

[1]  Michael F. Covington,et al.  A Modern Ampelography: A Genetic Basis for Leaf Shape and Venation Patterning in Grape1[C][W][OPEN] , 2013, Plant Physiology.

[2]  Seiji Matsuura,et al.  Evaluation of variation of root shape of Japanese radish (Raphanus sativus L.) based on image analysis using elliptic Fourier descriptors , 1998, Euphytica.

[3]  G. Evanno,et al.  Detecting the number of clusters of individuals using the software structure: a simulation study , 2005, Molecular ecology.

[4]  A. D. Bradshaw,et al.  Evolutionary Significance of Phenotypic Plasticity in Plants , 1965 .

[5]  D. Royer,et al.  Sensitivity of leaf size and shape to climate within Acer rubrum and Quercus kelloggii. , 2008, The New phytologist.

[6]  Jason H. Moore,et al.  A call for biological data mining approaches in epidemiology , 2016, BioData Mining.

[7]  Thomas E. Juenger,et al.  Genotype-by-Environment Interaction and Plasticity: Exploring Genomic Responses of Plants to the Abiotic Environment , 2013 .

[8]  N. Cherry,et al.  Prediction of a District's Grape-Ripening Capacity Using a Latitude-Temperature Index (LTI) , 1988, American Journal of Enology and Viticulture.

[9]  F. Pociot,et al.  Novel analytical methods applied to type 1 diabetes genome-scan data. , 2004, American journal of human genetics.

[10]  Jie Peng,et al.  Resolving Distinct Genetic Regulators of Tomato Leaf Shape within a Heteroblastic and Ontogenetic Context[W][OPEN] , 2014, Plant Cell.

[11]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[12]  Michael F. Covington,et al.  Modeling development and quantitative trait mapping reveal independent genetic modules for leaf size and shape. , 2015, The New phytologist.

[13]  Peter J. Bradbury,et al.  Genome-wide association study of leaf architecture in the maize nested association mapping population , 2011, Nature Genetics.

[14]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[15]  Yao Sun,et al.  Temporal and spatial relationship of gene expression in the infarcted rat heart , 2012, BMC Bioinformatics.

[16]  K. Robertson,et al.  Phenotypic Plasticity of Leaf Shape along a Temperature Gradient in Acer rubrum , 2009, PloS one.

[17]  Lutgarde M. C. Buydens,et al.  Self- and Super-organizing Maps in R: The kohonen Package , 2007 .

[18]  F. Bookstein,et al.  The Relation Between Geometric Morphometrics and Functional Morphology, as Explored by Procrustes Interpretation of Individual Shape Measures Pertinent to Function , 2015, Anatomical record.

[19]  Nathan J B Kraft,et al.  Sensitivity of leaf size and shape to climate: global patterns and paleoclimatic applications. , 2011, The New phytologist.

[20]  Karl J. Niklas,et al.  The evolution and functional significance of leaf shape in the angiosperms. , 2011, Functional plant biology : FPB.

[21]  Christopher N Topp,et al.  Revealing plant cryptotypes: defining meaningful phenotypes among infinite traits. , 2015, Current opinion in plant biology.

[22]  D. Greenwood,et al.  Paleotemperature Estimation Using Leaf-Margin Analysis: Is Australia Different? , 2004 .

[23]  Richard Kennaway,et al.  Evolution of Allometry in Antirrhinum[C][W] , 2009, The Plant Cell Online.

[24]  Christian Peter Klingenberg,et al.  Evolution and development of shape: integrating quantitative approaches , 2010, Nature Reviews Genetics.

[25]  Allison J. Miller,et al.  Latent developmental and evolutionary shapes embedded within the grapevine leaf , 2015, bioRxiv.

[26]  Aashish Ranjan,et al.  Transcriptomic analysis suggests a key role for SQUAMOSA PROMOTER BINDING PROTEIN LIKE, NAC and YUCCA genes in the heteroblastic development of the temperate rainforest tree Gevuina avellana (Proteaceae). , 2016, The New phytologist.

[27]  Julin N. Maloof,et al.  The Developmental Trajectory of Leaflet Morphology in Wild Tomato Species[C][W][OA] , 2012, Plant Physiology.

[28]  José Luis Micol,et al.  Mutational spaces for leaf shape and size , 2008, HFSP journal.

[29]  Light Environment Has Little Effect on Heteroblastic Development of the Temperate Rain Forest Tree Gevuina avellana Mol. (Proteaceae) , 2015, International Journal of Plant Sciences.

[30]  E. W. Sinnott,et al.  A BOTANICAL INDEX OF CRETACEOUS AND TERTIARY CLIMATES. , 1915, Science.

[31]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[32]  M. Stephens,et al.  Inferring weak population structure with the assistance of sample group information , 2009, Molecular ecology resources.

[33]  P. Wilf,et al.  Digital Future for Paleoclimate Estimation from Fossil Leaves? Preliminary Results , 2003 .

[34]  Bjarne Gram Hansen,et al.  Biochemical Networks and Epistasis Shape the Arabidopsis thaliana Metabolome[W] , 2008, The Plant Cell Online.

[35]  R. Wu,et al.  Mapping shape quantitative trait loci using a radius-centroid-contour model , 2013, Heredity.

[36]  Christian Peter Klingenberg,et al.  The pace of morphological change: historical transformation of skull shape in St Bernard dogs , 2008, Proceedings of the Royal Society B: Biological Sciences.

[37]  J. Stommel,et al.  Inheritance of Fruit, Foliar, and Plant Habit Attributes in Capsicum , 2008 .

[38]  Xin Wang,et al.  SNP interaction detection with Random Forests in high-dimensional genetic data , 2012, BMC Bioinformatics.

[39]  D. Royer,et al.  Why Do Toothed Leaves Correlate with Cold Climates? Gas Exchange at Leaf Margins Provides New Insights into a Classic Paleotemperature Proxy , 2006, International Journal of Plant Sciences.

[40]  Jason H. Moore,et al.  A global view of epistasis , 2005, Nature Genetics.

[41]  T. Delmonte,et al.  QTL analysis of leaf morphology in tetraploid Gossypium (cotton) , 2000, Theoretical and Applied Genetics.

[42]  Brad T. Townsley,et al.  An Intracellular Transcriptomic Atlas of the Giant Coenocyte Caulerpa taxifolia , 2015, PLoS genetics.

[43]  Michael F. Covington,et al.  A Quantitative Genetic Basis for Leaf Morphology in a Set of Precisely Defined Tomato Introgression Lines[C][W][OPEN] , 2013, Plant Cell.

[44]  K. Lunetta,et al.  Screening large-scale association study data: exploiting interactions using random forests , 2004, BMC Genetics.

[45]  George C. Runger,et al.  Bias of Importance Measures for Multi-valued Attributes and Solutions , 2011, ICANN.

[46]  Heping Zhang,et al.  A forest-based approach to identifying gene and gene–gene interactions , 2007, Proceedings of the National Academy of Sciences.

[47]  Sterling Sawaya,et al.  THE GENETICS OF ADAPTIVE SHAPE SHIFT IN STICKLEBACK: PLEIOTROPY AND EFFECT SIZE , 2007, Evolution; international journal of organic evolution.

[48]  T. Mackay Epistasis and quantitative traits: using model organisms to study gene–gene interactions , 2013, Nature Reviews Genetics.

[49]  Cris Kuhlemeier,et al.  Leaf Asymmetry as a Developmental Constraint Imposed by Auxin-Dependent Phyllotactic Patterning[OA] , 2012, Plant Cell.

[50]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[51]  H. Tsukaya Leaf shape: genetic controls and environmental factors. , 2005, The International journal of developmental biology.

[52]  Xianzhong Feng,et al.  Evolution through genetically controlled allometry space. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[53]  L. Kruglyak,et al.  Finding the sources of missing heritability in a yeast cross , 2012, Nature.

[54]  D. Clayton,et al.  Genome-wide association studies: theoretical and practical concerns , 2005, Nature Reviews Genetics.

[55]  Sabrina Renaud,et al.  Epigenetic effects on the mouse mandible: common features and discrepancies in remodeling due to muscular dystrophy and response to food consistency , 2010, BMC Evolutionary Biology.

[56]  G. Wang,et al.  A forest-based feature screening approach for large-scale genome data with complex structures , 2015, BMC Genetics.

[57]  Jonathan L Haines,et al.  Genetics, statistics and human disease: analytical retooling for complexity. , 2004, Trends in genetics : TIG.

[58]  D. Adams,et al.  QUANTITATIVE GENETICS OF PLASTRON SHAPE IN SLIDER TURTLES (TRACHEMYS SCRIPTA) , 2006, Evolution; international journal of organic evolution.

[59]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[60]  D. Royer,et al.  Correlations of climate and plant ecology to leaf size and shape: potential proxies for the fossil record. , 2005, American journal of botany.

[61]  Jing Li,et al.  Detecting gene-gene interactions using a permutation-based random forest method , 2016, BioData Mining.

[62]  Adele Cutler,et al.  An application of Random Forests to a genome-wide association dataset: Methodological considerations & new findings , 2010, BMC Genetics.

[63]  P. Prusinkiewicz,et al.  The genetics of geometry. , 2004, Proceedings of the National Academy of Sciences of the United States of America.