GWAS identifies candidate regulators of in planta regeneration in Populus trichocarpa

Plant regeneration is an important dimension of plant propagation, and a key step in the production of transgenic plants. However, regeneration capacity varies widely among genotypes and species, the molecular basis of which is largely unknown. While association mapping methods such as genome-wide association studies (GWAS) have long demonstrated abilities to help uncover the genetic basis of trait variation in plants, the power of these methods relies on the accuracy and scale of phenotypic data used. To enable a largescale GWAS of in planta regeneration in model tree Populus, we implemented a workflow involving semantic segmentation to quantify regenerating plant tissues (callus and shoot) over time. We found the resulting statistics are of highly non-normal distributions, which necessitated transformations or permutations to avoid violating assumptions of linear models used in GWAS. While transformations can lead to a loss of statistical power, we demonstrate that this can be mitigated by the application of the Augmented Rank Truncation method, or avoided altogether using the Multi-Threaded Monte Carlo SNP-set (Sequence) Kernel Association Test to compute empirical p-values in GWAS. We report over 200 statistically supported candidate genes, with top candidates including regulators of cell adhesion, stress signaling, and hormone signaling pathways, as well as other diverse functions. We demonstrate that sensitive genetic discovery for complex developmental traits can be enabled by a workflow based on computer vision and adaptation of several statistical approaches necessitated by to the complexity of regeneration trait expression and distribution.

[1]  Daniel Moukouanga,et al.  Physiological and genetic control of transpiration efficiency in African rice, Oryza glaberrima Steud. , 2022, Journal of experimental botany.

[2]  S. Strauss,et al.  Robust High-Throughput Phenotyping with Deep Segmentation Enabled by a Web-Based Annotator , 2022, bioRxiv.

[3]  B. Mercatoris,et al.  Wheat Ear Segmentation Based on a Multisensor System and Superpixel Classification , 2022, Plant phenomics.

[4]  Andreu Paytuví Gallart,et al.  GreeNC 2.0: a comprehensive database of plant long non-coding RNAs , 2021, Nucleic Acids Res..

[5]  J. Schmutz,et al.  The ancient Salicoid genome duplication event: A platform for reconstruction of de Novo gene evolution in Populus trichocarpa. , 2021, Genome biology and evolution.

[6]  K. Shirasu,et al.  WIND transcription factors orchestrate wound‐induced callus formation, vascular reconnection and defense response in Arabidopsis , 2021, The New phytologist.

[7]  Yong He,et al.  Advanced high-throughput plant phenotyping techniques for genome-wide association studies: A review , 2021, Journal of advanced research.

[8]  R. Sederoff,et al.  MYB-Mediated Regulation of Anthocyanin Biosynthesis , 2021, International journal of molecular sciences.

[9]  C. Tong,et al.  Multivariate linear mixed model enhanced the power of identifying genome-wide association to poplar tree heights in a randomized complete block design , 2021, G3.

[10]  T. Sakamoto,et al.  Haplotype analysis from unmanned aerial vehicle imagery of rice MAGIC population for the trait dissection of biomass and plant architecture , 2020, Journal of experimental botany.

[11]  Shubhra Kanti Karmaker Santu,et al.  AutoML to Date and Beyond: Challenges and Opportunities , 2020, ACM Comput. Surv..

[12]  D. Geelen,et al.  Natural Variation in Plant Pluripotency and Regeneration , 2020, Plants.

[13]  Daniel A. Jacobson,et al.  Genome-Wide Association Study of Wood Anatomical and Morphological Traits in Populus trichocarpa , 2020, Frontiers in Plant Science.

[14]  R. Wu,et al.  Genome-wide association studies of callus differentiation for the desert tree, Populus euphratica. , 2020, Tree physiology.

[15]  T. Winkelmann,et al.  Genetic analysis of callus formation in a diversity panel of 96 rose genotypes , 2020, Plant Cell, Tissue and Organ Culture (PCTOC).

[16]  Avjinder S. Kaler,et al.  Comparing Different Statistical Models and Multiple Testing Corrections for Association Mapping in Soybean and Maize , 2020, Frontiers in Plant Science.

[17]  Renato Umeton,et al.  Automated machine learning: Review of the state-of-the-art and opportunities for healthcare , 2020, Artif. Intell. Medicine.

[18]  James C. Schnable,et al.  Plant segmentation by supervised machine learning methods , 2020, The Plant Phenome Journal.

[19]  Pei Sun,et al.  Deciphering Genetic Architecture of Adventitious Root and Related Shoot Traits in Populus Using QTL Mapping and RNA-Seq Data , 2019, International journal of molecular sciences.

[20]  D. Voytas,et al.  Plant gene editing through de novo induction of meristems , 2019, Nature Biotechnology.

[21]  Gerald A. Tuskan,et al.  Data Integration in Poplar: ‘Omics Layers and Integration Strategies , 2019, Front. Genet..

[22]  S. Coimbra,et al.  In silico and expression analyses of fasciclin-like arabinogalactan proteins reveal functional conservation during embryo and seed development , 2019, Plant Reproduction.

[23]  Tony P. Pridmore,et al.  RootNav 2.0: Deep learning for automatic navigation of complex plant root architectures , 2019, bioRxiv.

[24]  Malachy T. Campbell,et al.  Leveraging genome-enabled growth models to study shoot growth responses to water deficit in rice , 2019, bioRxiv.

[25]  A. Caballero,et al.  Inferring the Nature of Missing Heritability in Human Traits Using Data from the GWAS Catalog , 2019, Genetics.

[26]  Abhishek Dutta,et al.  The VIA Annotation Software for Images, Audio and Video , 2019, ACM Multimedia.

[27]  Tina Gui,et al.  Lawn plant identification and segmentation based on least squares support vector machine and multifeature fusion , 2019, J. Electronic Imaging.

[28]  C. Brien,et al.  Genome-wide association of barley plant growth under drought stress using a nested association mapping population , 2019, BMC Plant Biology.

[29]  Haichao Guo,et al.  RhizoVision Crown: An Integrated Hardware and Software Platform for Root Crown Phenotyping , 2019, bioRxiv.

[30]  G. Tuskan,et al.  Genome-wide association studies of bark texture in Populus trichocarpa , 2019, Tree Genetics & Genomes.

[31]  K. Lowe,et al.  Using Morphogenic Genes to Improve Recovery and Regeneration of Transgenic Plants , 2019, Plants.

[32]  Bing Wang,et al.  InterMineR: an R package for InterMine databases , 2019, Bioinform..

[33]  Jeff H. Chang,et al.  Association mapping, transcriptomics, and transient expression identify candidate genes mediating plant–pathogen interactions in a tree , 2018, Proceedings of the National Academy of Sciences.

[34]  S. Strauss,et al.  Opportunities for Innovation in Genetic Transformation of Forest Trees , 2018, Front. Plant Sci..

[35]  Y. Noh,et al.  Salicylic acid-induced transcriptional reprogramming by the HAC–NPR1–TGA histone acetyltransferase complex in Arabidopsis , 2018, Nucleic acids research.

[36]  Gerald A Tuskan,et al.  Defining the genetic components of callus formation: A GWAS approach , 2018, PloS one.

[37]  A. Caballero,et al.  Inferring the nature of missing heritability in human traits , 2018, bioRxiv.

[38]  Jie Zhao,et al.  N&agr;-Acetyltransferases 10 and 15 are Required for the Correct Initiation of Endosperm Cellularization in Arabidopsis , 2018, Plant & cell physiology.

[39]  G. Venkataraman,et al.  CRISPR for Crop Improvement: An Update Review , 2018, Front. Plant Sci..

[40]  Gerald A Tuskan,et al.  Genome-wide association studies and expression-based quantitative trait loci analyses reveal roles of HCT2 in caffeoylquinic acid biosynthesis and its regulation by defense-responsive transcription factors in Populus. , 2018, The New phytologist.

[41]  Lijun Luo,et al.  Genome-Wide Association Studies of Image Traits Reveal Genetic Architecture of Drought Resistance in Rice. , 2018, Molecular plant.

[42]  J. Ecker,et al.  A B-ARR-mediated cytokinin transcriptional network directs hormone cross-regulation and shoot development , 2018, Nature Communications.

[43]  Aaron Kusmec,et al.  FarmCPUpp: Efficient large‐scale genomewide association studies , 2017, bioRxiv.

[44]  Mateusz Majda,et al.  The Role of Auxin in Cell Wall Expansion , 2018, International journal of molecular sciences.

[45]  Joseph L. Gage,et al.  Selection Signatures Underlying Dramatic Male Inflorescence Transformation During Modern Hybrid Maize Breeding , 2018, Genetics.

[46]  Dmitri V. Zaykin,et al.  Detecting Weak Signals by Combining Small P-Values in Genetic Association Studies , 2018, bioRxiv.

[47]  Sotirios A. Tsaftaris,et al.  Leveraging multiple datasets for deep leaf counting , 2017, bioRxiv.

[48]  Shubhra Aich,et al.  Leaf Counting with Deep Convolutional and Deconvolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[49]  Division on Earth,et al.  Genetically Engineered Crops: Experiences and Prospects , 2017 .

[50]  Shangwei Zhong,et al.  Activation of HLS1 by Mechanical Stress via Ethylene-Stabilized EIN3 Is Crucial for Seedling Soil Emergence , 2016, Front. Plant Sci..

[51]  Sandra M. Schmöckel,et al.  Salinity tolerance loci revealed in rice using high-throughput non-invasive phenotyping , 2016, Nature Communications.

[52]  Peggy G. Lemaux,et al.  Advancing Crop Transformation in the Era of Genome Editing[OPEN] , 2016, Plant Cell.

[53]  S. Redline,et al.  Control for Population Structure and Relatedness for Binary Traits in Genetic Association Studies via Logistic Mixed Models. , 2016, American journal of human genetics.

[54]  A. Ding,et al.  Expansins: roles in plant growth and potential applications in crop improvement , 2016, Plant Cell Reports.

[55]  A. Ding,et al.  Expansins: roles in plant growth and potential applications in crop improvement , 2016, Plant Cell Reports.

[56]  Zhiwu Zhang,et al.  Iterative Usage of Fixed and Random Effect Models for Powerful and Efficient Genome-Wide Association Studies , 2016, PLoS genetics.

[57]  J. Weitz,et al.  Digital imaging of root traits (DIRT): a high-throughput computing and collaboration platform for field-based root phenomics , 2015, Plant Methods.

[58]  Emily M. Strait,et al.  The arabidopsis information resource: Making and mining the “gold standard” annotated reference plant genome , 2015, Genesis.

[59]  Xin Li,et al.  Two N-Terminal Acetyltransferases Antagonistically Regulate the Stability of a Nod-Like Receptor in Arabidopsis , 2015, Plant Cell.

[60]  田原 康玄,et al.  生活習慣病とgenome-wide association study , 2015 .

[61]  Hui Feng,et al.  Genome-wide association study of rice (Oryza sativa L.) leaf traits with a high-throughput leaf scorer , 2015, Journal of experimental botany.

[62]  Edwin Cuppen,et al.  Sambamba: fast processing of NGS alignment formats , 2015, Bioinform..

[63]  Nancy Wilkins-Diehr,et al.  XSEDE: Accelerating Scientific Discovery , 2014, Computing in Science & Engineering.

[64]  J. Uhrig,et al.  The Arabidopsis thaliana mitogen-activated protein kinases MPK3 and MPK6 target a subclass of 'VQ-motif'-containing proteins to regulate immune responses. , 2014, The New phytologist.

[65]  M. Stephens,et al.  fastSTRUCTURE: Variational Inference of Population Structure in Large SNP Data Sets , 2014, Genetics.

[66]  Tae-Ho Lee,et al.  SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data , 2014, BMC Genomics.

[67]  Iuliana Ionita-Laza,et al.  Sequence kernel association tests for the combined effect of rare and common variants. , 2013, American journal of human genetics.

[68]  Ziv Bar-Joseph,et al.  Temporal transcriptional response to ethylene gas drives growth hormone cross-regulation in Arabidopsis , 2013, eLife.

[69]  M. Stephens,et al.  Genome-wide Efficient Mixed Model Analysis for Association Studies , 2012, Nature Genetics.

[70]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[71]  Liam J. Revell,et al.  phytools: an R package for phylogenetic comparative biology (and other things) , 2012 .

[72]  A. Bacic,et al.  A Fasciclin-Like Arabinogalactan-Protein (FLA) Mutant of Arabidopsis thaliana, fla1, Shows Defects in Shoot Regeneration , 2011, PloS one.

[73]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[74]  M. Seki,et al.  The AP2/ERF Transcription Factor WIND1 Controls Cell Dedifferentiation in Arabidopsis , 2011, Current Biology.

[75]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer , 2011, Nature Biotechnology.

[76]  Vanessa Wahl,et al.  The FANTASTIC FOUR proteins influence shoot meristem size in Arabidopsis thaliana , 2010, BMC Plant Biology.

[77]  J. Ecker,et al.  Ethylene-Induced Stabilization of ETHYLENE INSENSITIVE3 and EIN3-LIKE1 Is Mediated by Proteasomal Degradation of EIN3 Binding F-Box 1 and 2 That Requires EIN2 in Arabidopsis[C][W] , 2010, Plant Cell.

[78]  Patrick J. Boyle,et al.  The BTB/POZ Domain of the Arabidopsis Disease Resistance Protein NPR1 Interacts with the Repression Domain of TGA2 to Negate Its Function[W] , 2009, The Plant Cell Online.

[79]  W. Deng,et al.  A novel method for induction of plant regeneration via somatic embryogenesis , 2009 .

[80]  T. Mizuno,et al.  A Small Subfamily of Arabidopsis RADIALIS-LIKE SANT/MYB Genes: A Link to HOOKLESS1-Mediated Signal Transduction during Early Morphogenesis , 2008, Bioscience, biotechnology, and biochemistry.

[81]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[82]  M. Raizada,et al.  Ethylene and shoot regeneration: hookless1 modulates de novo shoot organogenesis in Arabidopsis thaliana , 2008, Plant Cell Reports.

[83]  Yan Xiong,et al.  Dual control of nuclear EIN3 by bifurcate MAPK cascades in C2H4 signalling , 2008, Nature.

[84]  K. Taylor,et al.  Genome-Wide Association , 2007, Diabetes.

[85]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[86]  Xin Li,et al.  Regulation of plant innate immunity by three proteins in a complex conserved across the plant and animal kingdoms. , 2007, Genes & development.

[87]  M. Gribskov,et al.  The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) , 2006, Science.

[88]  Kazuo Shinozaki,et al.  Loss of NECROTIC SPOTTED LESIONS 1 associates with cell death and defense responses in Arabidopsis thaliana , 2006, Plant Molecular Biology.

[89]  Shuhua Yang,et al.  A Haplotype-Specific Resistance Gene Regulated by BONZAI1 Mediates Temperature-Dependent Growth Control in Arabidopsis , 2004, The Plant Cell Online.

[90]  Shuichi Yanagisawa,et al.  EIN3-Dependent Regulation of Plant Ethylene Hormone Signaling by Two Arabidopsis F Box Proteins EBF1 and EBF2 , 2003, Cell.

[91]  A. Bacic,et al.  The Fasciclin-Like Arabinogalactan Proteins of Arabidopsis. A Multigene Family of Putative Cell Adhesion Molecules1 , 2003, Plant Physiology.

[92]  Xinnian Dong,et al.  In Vivo Interaction between NPR1 and Transcription Factor TGA2 Leads to Salicylic Acid–Mediated Gene Activation in Arabidopsis Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.001628. , 2002, The Plant Cell Online.

[93]  Stephen W. Michnick,et al.  Direct visualization of protein interactions in plant cells , 2001, Nature Biotechnology.

[94]  A. Murphy,et al.  Flavonoids act as negative regulators of auxin transport in vivo in arabidopsis. , 2001, Plant physiology.

[95]  Jane Glazebrook,et al.  The Arabidopsis NPR1 Gene That Controls Systemic Acquired Resistance Encodes a Novel Protein Containing Ankyrin Repeats , 1997, Cell.

[96]  義記 岡 Late Pleistocene の海面変化に関する諸問題 , 1970 .

[97]  J. Noble,et al.  Late Pleistocene Stratigraphy and Chronology in Southwestern British Columbia and Northwestern Washington , 1965 .

[98]  J. Franco-Zorrilla,et al.  The cytokinin response factors modulate root and shoot growth and promote leaf senescence in Arabidopsis. , 2016, The Plant journal : for cell and molecular biology.

[99]  M. Raizada,et al.  The shoot regeneration capacity of excised Arabidopsis cotyledons is established during the initial hours after injury and is modulated by a complex genetic network of light signalling. , 2013, Plant, cell & environment.

[100]  Watson Richard,et al.  Opportunities for Innovation , 2007 .