Applying machine learning to investigate long‐term insect–plant interactions preserved on digitized herbarium specimens

Premise of the study Despite the economic importance of insect damage to plants, long-term data documenting changes in insect damage (‘herbivory’) and diversity are limited. Millions of pressed plant specimens are now available online for collecting big data on plant-insect interactions during the Anthropocene. Methods We initiated development of machine learning methods to automate extraction of herbivory data from herbarium specimens. We trained an insect damage detector and a damage type classifier on two distantly related plant species. We experimented with 1) classifying six types of herbivory and two control categories of undamaged leaf, and 2) detecting two of these damage categories for which several hundred annotations were available. Results Classification models identified the correct type of herbivory 81.5% of the time. The damage classifier was accurate for categories with at least one hundred test samples. We show anecdotally that the detector works well when asked to detect two types of damage. Discussion The classifier and detector together are a promising first step for the automation of herbivory data collection. We describe ongoing efforts to increase the accuracy of these models to allow other researchers to extract similar data and apply them to address a variety of biological hypotheses.

[1]  Noel H. Holmgren,et al.  Index Herbariorum: A global directory of public herbaria and associated staff , 1998 .

[2]  Emily K. Meineke,et al.  Herbarium specimens reveal increasing herbivory over the past century , 2018, Journal of Ecology.

[3]  Peter Wilf,et al.  Impact of the terminal Cretaceous event on plant–insect associations , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[4]  S. J. Tonsor,et al.  The Changing Uses of Herbarium Data in an Era of Global Change: An Overview Using Automated Content Analysis , 2019, BioScience.

[5]  P. Raven,et al.  BUTTERFLIES AND PLANTS: A STUDY IN COEVOLUTION , 1964 .

[6]  M. Willig,et al.  Populations are not declining and food webs are not collapsing at the Luquillo Experimental Forest , 2019, Proceedings of the National Academy of Sciences.

[7]  Rebecca B. Dikow,et al.  Generating segmentation masks of herbarium specimens and a data set for training segmentation models using deep learning , 2020, Applications in plant sciences.

[8]  D. Futuyma,et al.  Macroevolution and the biological diversity of plants and herbivores , 2009, Proceedings of the National Academy of Sciences.

[9]  C. Linnen,et al.  Oviposition traits generate extrinsic postzygotic isolation between two pine sawfly species , 2017, BMC Evolutionary Biology.

[10]  R. Primack,et al.  Herbarium specimens, photographs, and field observations show Philadelphia area plants are responding to climate change. , 2012, American journal of botany.

[11]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  R. Primack,et al.  Herbarium specimens demonstrate earlier flowering times in response to warming in Boston. , 2004, American journal of botany.

[13]  C. Labandeira,et al.  Response of plant-insect associations to paleocene-eocene warming , 1999, Science.

[14]  J. M. Heberling,et al.  Herbarium specimens as exaptations: New uses for old collections. , 2017, American journal of botany.

[15]  Lawrence M. Page,et al.  Digitization of Biodiversity Collections Reveals Biggest Data on Biodiversity , 2015 .

[16]  Hong Liu,et al.  Testing the enemy release hypothesis: a review and meta-analysis , 2006, Biological Invasions.

[17]  Hernán A Burbano,et al.  Using herbaria to study global environmental change , 2018, The New phytologist.

[18]  Shaogang Gong,et al.  Class Rectification Hard Mining for Imbalanced Deep Learning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[19]  Dumitru Erhan,et al.  Scalable Object Detection Using Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Martin M Turcotte,et al.  Percentage leaf herbivory across vascular plant species , 2014 .

[21]  Emily K. Meineke,et al.  Biological collections for understanding biodiversity in the Anthropocene , 2018, Philosophical Transactions of the Royal Society B.

[22]  Emily K. Meineke,et al.  Museum specimens provide novel insights into changing plant–herbivore interactions , 2018, Philosophical Transactions of the Royal Society B.

[23]  Elizabeth R. Ellwood,et al.  Toward a large‐scale and deep phenological stage annotation of herbarium specimens: Case studies from temperate, tropical, and equatorial floras , 2019, Applications in plant sciences.

[24]  Aaron M Ellison,et al.  Herbarium records are reliable sources of phenological change driven by climate and provide novel insights into species' phenological cueing mechanisms. , 2015, American journal of botany.

[25]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  R. Guralnick,et al.  Tracking trends in monarch abundance over the 20th century is currently impossible using museum records , 2019, Proceedings of the National Academy of Sciences.

[27]  Butterfly abundance declines over 20 years of systematic monitoring in Ohio, USA , 2019 .

[28]  P. Bonnet,et al.  Going deeper in the automated identification of Herbarium specimens , 2017, BMC Evolutionary Biology.

[29]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  H. de Kroon,et al.  More than 75 percent decline over 27 years in total flying insect biomass in protected areas , 2017, PloS one.

[31]  G. Cooper-Driver INSECT‐FERN ASSOCIATIONS , 1978 .

[32]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[33]  R. Proulx,et al.  Bookkeeping of insect herbivory trends in herbarium specimens of purple loosestrife (Lythrum salicaria) , 2018, Philosophical Transactions of the Royal Society B.

[34]  Emily K. Meineke,et al.  The unrealized potential of herbaria for global change biology , 2017, bioRxiv.

[35]  Bradford C. Lister,et al.  Climate-driven declines in arthropod abundance restructure a rainforest food web , 2018, Proceedings of the National Academy of Sciences.

[36]  Abhishek Dutta,et al.  The VIA Annotation Software for Images, Audio and Video , 2019, ACM Multimedia.

[37]  M. Berenbaum,et al.  Increase in toxicity of an invasive weed after reassociation with its coevolved herbivore , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[38]  P. Coley,et al.  Insect herbivory, plant defense, and early Cenozoic climate change , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[40]  Martin M Turcotte,et al.  Precision and accuracy in quantifying herbivory , 2016 .

[41]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[42]  F. Stuart Chapin,et al.  Resource Availability and Plant Antiherbivore Defense , 1985, Science.

[43]  William J. Foley,et al.  Assessing the evidence for latitudinal gradients in plant defence and herbivory , 2011 .

[44]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[45]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[46]  H. Cyr,et al.  Magnitude and patterns of herbivory in aquatic and terrestrial ecosystems , 1993, Nature.

[47]  Just Cebrian,et al.  PATTERNS OF HERBIVORY AND DECOMPOSITION IN AQUATIC AND TERRESTRIAL ECOSYSTEMS , 2004 .

[48]  P. Soltis Digitization of herbaria enables novel research. , 2017, American journal of botany.

[49]  Pamela S Soltis,et al.  Old Plants, New Tricks: Phenological Research Using Herbarium Specimens. , 2017, Trends in ecology & evolution.

[50]  P. Feeny,et al.  Plant apparency and chemical defense , 1976 .

[51]  D. Soltis,et al.  Mobilizing and integrating big data in studies of spatial and phylogenetic patterns of biodiversity , 2016, Plant diversity.

[52]  Jitendra Malik,et al.  Hypercolumns for object segmentation and fine-grained localization , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[53]  Jeffrey R. Adrion,et al.  Butterfly abundance declines over 20 years of systematic monitoring in Ohio, USA , 2019, bioRxiv.

[54]  T. Davies,et al.  Macroecological and macroevolutionary patterns of leaf herbivory across vascular plants , 2014, Proceedings of the Royal Society B: Biological Sciences.

[55]  C. Labandeira,et al.  Decoupled Plant and Insect Diversity After the End-Cretaceous Extinction , 2006, Science.

[56]  Vicki A. Funk,et al.  Applications of deep convolutional neural networks to digitized natural history collections , 2017, Biodiversity data journal.

[57]  S. Frank,et al.  Shared and unique responses of insects to the interaction of urbanization and background climate. , 2015, Current opinion in insect science.

[58]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[59]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[60]  V. Zverev,et al.  Decreased losses of woody plant foliage to insects in large urban areas are explained by bird predation , 2017, Global change biology.

[61]  J. Puzey,et al.  Monarch butterfly and milkweed declines substantially predate the use of genetically modified crops , 2018, Proceedings of the National Academy of Sciences.

[62]  C. Labandeira,et al.  Fossil insect folivory tracks paleotemperature for six million years , 2010 .

[63]  Tyson Wepprich Monarch butterfly trends are sensitive to unexamined changes in museum collections over time , 2019, Proceedings of the National Academy of Sciences.