High-throughput phenotyping with deep learning gives insight into the genetic architecture of flowering time in wheat

Background Precise measurement of plant traits with precision and speed on large populations has emerged as a critical bottleneck in connecting genotype to phenotype in genetics and breeding. This bottleneck limits advancements in understanding plant genomes and the development of improved, high-yielding crop varieties. Results Here we demonstrate the application of deep learning on proximal imaging from a mobile field vehicle to directly score plant morphology and developmental stages in wheat under field conditions. We developed and trained a convolutional neural network with image datasets labeled from expert visual scores and used this ‘breeder-trained’ network to directly score wheat morphology and developmental stages. For both morphological (awned) and phenological (flowering time) traits, we demonstrate high heritability and extremely high accuracy against the ‘ground-truth’ values from visual scoring. Using the traits scored by the network, we tested genotype-to-phenotype association using the deep learning phenotypes and uncovered novel epistatic interactions for flowering time. Enabled by the time-series high-throughput phenotyping, we describe a new phenotype as the rate of flowering and show heritable genetic control. Conclusions We demonstrated a field-based high-throughput phenotyping approach using deep learning that can directly score morphological and developmental phenotypes in genetic populations. Most powerfully, the deep learning approach presented here gives a conceptual advancement in high-throughput plant phenotyping as it can potentially score any trait in any plant species through leveraging expert knowledge from breeders, geneticist, pathologists and physiologists.

[1]  Tony P. Pridmore,et al.  Deep machine learning provides state-of-the-art performance in image-based plant phenotyping , 2016, bioRxiv.

[2]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[3]  Xu Wang,et al.  Development of a field-based high-throughput mobile phenotyping platform , 2016, Comput. Electron. Agric..

[4]  J. Zadoks A decimal code for the growth stages of cereals , 1974 .

[5]  Robert J. Elshire,et al.  TASSEL-GBS: A High Capacity Genotyping by Sequencing Analysis Pipeline , 2014, PloS one.

[6]  Lucas Beyer,et al.  In Defense of the Triplet Loss for Person Re-Identification , 2017, ArXiv.

[7]  J. Yosinski,et al.  Automated Identification of Northern Leaf Blight-Infected Maize Plants from Field Imagery Using Deep Learning. , 2017, Phytopathology.

[8]  M. Tester,et al.  Phenomics--technologies to relieve the phenotyping bottleneck. , 2011, Trends in plant science.

[9]  J. Holland,et al.  Estimating and Interpreting Heritability for Plant Breeding: An Update , 2010 .

[10]  P. Langridge,et al.  Breeding Technologies to Increase Crop Production in a Changing World , 2010, Science.

[11]  Hongyu Zhao,et al.  Imputing genotypes in biallelic populations from low-coverage sequence data 1 , 2015 .

[12]  Jeffrey W. White,et al.  APPROACHES FOR GEOSPATIAL PROCESSING OF FIELD-BASED HIGH-THROUGHPUT PLANT PHENOMICS DATA FROM GROUND VEHICLE PLATFORMS , 2016 .

[13]  P. Byrne,et al.  Phenotypic Plasticity of Winter Wheat Heading Date and Grain Yield across the US Great Plains , 2016 .

[14]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[15]  B. Cullis,et al.  mixed models for S language environments ASReml-R reference manual ASReml estimates variance components under a general linear mixed model by residual maximum likelihood (REML) , 2009 .

[16]  Jeffrey W. White,et al.  Field-Based High-Throughput Plant Phenotyping Reveals the Temporal Patterns of Quantitative Trait Loci Associated with Stress-Responsive Traits in Cotton , 2016, G3: Genes, Genomes, Genetics.

[17]  Jesse Poland,et al.  Field Book: An Open‐Source Application for Field Data Collection on Android , 2014 .

[18]  J. Poland,et al.  Development of High-Density Genetic Maps for Barley and Wheat Using a Novel Two-Enzyme Genotyping-by-Sequencing Approach , 2012, PloS one.

[19]  R. Nelson,et al.  In the eye of the beholder: the effect of rater variability and different rating scales on QTL mapping. , 2011, Phytopathology.

[20]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[21]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[22]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[23]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  D. Goodin,et al.  Application of unmanned aerial systems for high throughput phenotyping of large wheat breeding nurseries , 2016, Plant Methods.

[25]  De‐Zhu Li,et al.  Development of a universal and simplified ddRAD library preparation approach for SNP discovery and genotyping in angiosperm plants , 2016, Plant Methods.

[26]  Woohyung Lim,et al.  Deep neural networks show an equivalent and often superior performance to dermatologists in onychomycosis diagnosis: Automatic construction of onychomycosis datasets by region-based convolutional deep neural network , 2018, PloS one.

[27]  Jeffrey W. White,et al.  Development and evaluation of a field-based high-throughput phenotyping platform. , 2013, Functional plant biology : FPB.