Automated discretization of ‘transpiration restriction to increasing VPD’ features from outdoors high-throughput phenotyping data

Background Restricting transpiration under high vapor pressure deficit (VPD) is a promising water-saving trait for drought adaptation. However, it is often measured under controlled conditions and at very low throughput, unsuitable for breeding. A few high-throughput phenotyping (HTP) studies exist, and have considered only maximum transpiration rate in analyzing genotypic differences in this trait. Further, no study has precisely identified the VPD breakpoints where genotypes restrict transpiration under natural conditions. Therefore, outdoors HTP data (15 min frequency) of a chickpea population were used to automate the generation of smooth transpiration profiles, extract informative features of the transpiration response to VPD for optimal genotypic discretization, identify VPD breakpoints, and compare genotypes. Results Fifteen biologically relevant features were extracted from the transpiration rate profiles derived from load cells data. Genotypes were clustered (C1, C2, C3) and 6 most important features (with heritability > 0.5) were selected using unsupervised Random Forest. All the wild relatives were found in C1, while C2 and C3 mostly comprised high TE and low TE lines, respectively. Assessment of the distinct p-value groups within each selected feature revealed highest genotypic variation for the feature representing transpiration response to high VPD condition. Sensitivity analysis on a multi-output neural network model (with R of 0.931, 0.944, 0.953 for C1, C2, C3, respectively) found C1 with the highest water saving ability, that restricted transpiration at relatively low VPD levels, 56% (i.e. 3.52 kPa) or 62% (i.e. 3.90 kPa), depending whether the influence of other environmental variables was minimum or maximum. Also, VPD appeared to have the most striking influence on the transpiration response independently of other environment variable, whereas light, temperature, and relative humidity alone had little/no effect. Conclusion Through this study, we present a novel approach to identifying genotypes with drought-tolerance potential, which overcomes the challenges in HTP of the water-saving trait. The six selected features served as proxy phenotypes for reliable genotypic discretization. The wild chickpeas were found to limit water-loss faster than the water-profligate cultivated ones. Such an analytic approach can be directly used for prescriptive breeding applications, applied to other traits, and help expedite maximized information extraction from HTP data.

[1]  Bernard W. Silverman,et al.  A Fast and Efficient Cross-Validation Method for Smoothing Parameter Choice in Spline Regression , 1984 .

[2]  M. Hutchinson,et al.  Smoothing noisy data with spline functions , 1985 .

[3]  Ronald Rousseau,et al.  Similarity measures in scientometric research: The Jaccard index versus Salton's cosine formula , 1989, Inf. Process. Manag..

[4]  Steven K. Rogers,et al.  An Introduction to Biological and Artificial Neural Networks for Pattern Recognition , 1991 .

[5]  Lutz Prechelt,et al.  Automatic early stopping using cross validation: quantifying the criteria , 1998, Neural Networks.

[6]  Lee-Ing Tong,et al.  A NOVEL MEANS OF APPLYING NEURAL NETWORKS TO OPTIMIZE THE MULTIRESPONSE PROBLEM , 2001 .

[7]  Xiucai Guo,et al.  PID neural networks in multivariable systems , 2002, Proceedings of the IEEE Internatinal Symposium on Intelligent Control.

[8]  Julian D. Olden,et al.  Illuminating the “black box”: a randomization approach for understanding variable contributions in artificial neural networks , 2002 .

[9]  Xianhong Xie,et al.  Optimal spline smoothing of fMRI time series by generalized cross-validation , 2003, NeuroImage.

[10]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[11]  B. Kozłowski Time series denoising with wavelet transform , 2005 .

[12]  G. Hammer,et al.  Potential yield and water-use efficiency benefits in sorghum from limited maximum transpiration rate. , 2005, Functional plant biology : FPB.

[13]  T. Cox,et al.  Local Minima in Nonmetric Multidimensional Scaling , 2005 .

[14]  S. Horvath,et al.  Unsupervised Learning With Random Forest Predictors , 2006 .

[15]  M. Vannucci,et al.  Biophysical modelling and NDVI time series to project near‐term forage supply: spectral analysis aided by wavelet denoising and ARIMA modelling , 2007 .

[16]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[17]  I. S. Bozchalooi,et al.  A smoothness index-guided approach to wavelet parameter selection in signal de-noising and fault detection , 2007 .

[18]  Michael Grabner,et al.  Time-varying-response smoothing , 2007 .

[19]  Chandra Erdman,et al.  bcp: An R Package for Performing a Bayesian Analysis of Change Point Problems , 2007 .

[20]  Tony R. Martinez,et al.  Decision Tree Ensemble: Small Heterogeneous Is Better Than Large Homogeneous , 2008, 2008 Seventh International Conference on Machine Learning and Applications.

[21]  Bjoern H. Menze,et al.  A comparison of random forest and its Gini importance with standard chemometric methods for the feature selection and classification of spectral data , 2009, BMC Bioinformatics.

[22]  T. Sinclair,et al.  Genotypic variation in peanut for transpiration response to vapor pressure deficit , 2010 .

[23]  T. Sinclair,et al.  Genetic variability of transpiration response to vapor pressure deficit among sorghum genotypes , 2010 .

[24]  V. Vadez,et al.  Terminal drought-tolerant pearl millet [Pennisetum glaucum (L.) R. Br.] have high leaf ABA and limit transpiration at high vapour pressure deficit , 2010, Journal of experimental botany.

[25]  Sunil Kumar Sinha,et al.  Intelligent Hybrid Wavelet Models for Short-Term Load Forecasting , 2010, IEEE Transactions on Power Systems.

[26]  R. Varshney,et al.  Genomics and Physiological Approaches for Root Trait Breeding to Improve Drought Tolerance in Chickpea (Cicer arietinum L.) , 2011 .

[27]  T. Sinclair Is transpiration efficiency a viable plant trait in breeding for crop improvement? , 2012, Functional plant biology : FPB.

[28]  Jun'ichi Tsujii,et al.  Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries , 2012, J. Am. Medical Informatics Assoc..

[29]  W. Sadok,et al.  Differential sensitivities of transpiration to evaporative demand and soil water deficit among wheat elite cultivars indicate different strategies for drought tolerance , 2012 .

[30]  T. Sinclair,et al.  Temperature interactions with transpiration response to vapor pressure deficit among cultivated and wild soybean genotypes. , 2013, Physiologia plantarum.

[31]  Mark E. Cooper,et al.  Transpiration Response of Maize Hybrids to Atmospheric Vapour Pressure Deficit , 2013 .

[32]  Adam P. Piotrowski,et al.  A comparison of methods to avoid overfitting in neural networks training in the case of catchment runoff modelling , 2013 .

[33]  David S. Matteson,et al.  A Nonparametric Approach for Multiple Change Point Analysis of Multivariate Data , 2013, 1306.4933.

[34]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[35]  T. C. Hennessey,et al.  Increased vapor pressure deficit due to higher temperature leads to greater transpiration and faster mortality during drought for tree seedlings common to the forest-grassland ecotone. , 2013, The New phytologist.

[36]  M. Zaman-Allah,et al.  Water: the most important 'molecular' component of water stress tolerance research. , 2013, Functional plant biology : FPB.

[37]  Shokoufe Tayyebi,et al.  Applying Neural Network to Dynamic Modeling of Biosurfactant Production Using Soybean Oil Refinery Wastes , 2013 .

[38]  V. Vadez,et al.  Transpiration efficiency: new insights into an old story. , 2014, Journal of experimental botany.

[39]  Malika Charrad,et al.  NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set , 2014 .

[40]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[41]  Li Zhu,et al.  MODWT-ARMA model for time series prediction , 2014 .

[42]  Girisha Garg,et al.  A signal invariant wavelet function selection algorithm , 2015, Medical & Biological Engineering & Computing.

[43]  Mariana Recamonde Mendoza,et al.  What variables are important in predicting bovine viral diarrhea virus? A random forest approach , 2015, Veterinary Research.

[44]  Michael D. Dukes,et al.  Step by Step Calculation of the Penman-Monteith Evapotranspiration (FAO-56 Method) , 2024, EDIS.

[45]  Grégoire M. Hummel,et al.  LeasyScan: a novel concept combining 3D imaging and lysimetry for high-throughput phenotyping of traits controlling plant water budget , 2015, Journal of experimental botany.

[46]  Hanno Scharr,et al.  Machine Learning for Plant Phenotyping Needs Image Processing. , 2016, Trends in plant science.

[47]  Xavier Draye,et al.  Gravimetric phenotyping of whole plant transpiration responses to atmospheric vapour pressure deficit identifies genotypic variation in water use efficiency. , 2016, Plant science : an international journal of experimental plant biology.

[48]  Yufeng Ge,et al.  Temporal dynamics of maize plant growth, water use, and leaf water content using automated high throughput RGB and hyperspectral imaging , 2016, Comput. Electron. Agric..

[49]  Thomas E. Carter,et al.  Limited-Transpiration Trait for Increased Yield for Water-Limited Soybean: From Model to Phenotype to Genotype to Cultivars , 2016 .

[50]  Juan M. Corchado,et al.  Fitting for smoothing: A methodology for continuous-time target track estimation , 2016, 2016 International Conference on Indoor Positioning and Indoor Navigation (IPIN).

[51]  T. Pridmore,et al.  Plant Phenomics, From Sensors to Knowledge , 2017, Current Biology.

[52]  R. Wallach,et al.  High‐throughput physiological phenotyping and screening system for the characterization of plant–environment interactions , 2017, The Plant journal : for cell and molecular biology.

[53]  T. Sinclair,et al.  Relevance of limited-transpiration trait for lentil (Lens culinaris Medik.) in South Asia , 2017 .

[54]  Ashutosh Kumar Singh,et al.  Deep Learning for Plant Stress Phenotyping: Trends and Future Perspectives. , 2018, Trends in plant science.

[55]  Samuel H. Taylor,et al.  Whole plant chamber to examine sensitivity of cereal gas exchange to changes in evaporative demand , 2018, Plant Methods.

[56]  J. Rose,et al.  Isolation and manipulation of protoplasts from the unicellular green alga Penium margaritaceum , 2018, Plant Methods.

[57]  G McLean,et al.  Integrating modelling and phenotyping approaches to identify and screen complex traits: transpiration efficiency in cereals , 2018, Journal of experimental botany.

[58]  Kelly R. Thorp,et al.  High-Throughput Phenotyping of Crop Water Use Efficiency via Multispectral Drone Imagery and a Daily Soil Water Balance Model , 2018, Remote. Sens..

[59]  Mario Cantú-Sifuentes,et al.  Multivariate statistical inference in a radial basis function neural network , 2018, Expert Syst. Appl..

[60]  K. Kozlov,et al.  Non-linear regression models for time to flowering in wild chickpea combine genetic and climatic factors , 2019, BMC Plant Biology.

[61]  V. Vadez,et al.  Measurement of transpiration restriction under high vapor pressure deficit for sorghum mapping population parents , 2019, Plant Physiology Reports.

[62]  G. Hammer,et al.  Genotypic variation in whole-plant transpiration efficiency in sorghum only partly aligns with variation in stomatal conductance. , 2019, Functional plant biology : FPB.

[63]  Asheesh K. Singh,et al.  Machine Learning Approach for Prescriptive Plant Breeding , 2019, Scientific Reports.

[64]  R. Snowdon,et al.  High-resolution digital phenotyping of water uptake and transpiration efficiency. , 2020, Trends in plant science.