TasselNet: counting maize tassels in the wild via local counts regression network

BackgroundAccurately counting maize tassels is important for monitoring the growth status of maize plants. This tedious task, however, is still mainly done by manual efforts. In the context of modern plant phenotyping, automating this task is required to meet the need of large-scale analysis of genotype and phenotype. In recent years, computer vision technologies have experienced a significant breakthrough due to the emergence of large-scale datasets and increased computational resources. Naturally image-based approaches have also received much attention in plant-related studies. Yet a fact is that most image-based systems for plant phenotyping are deployed under controlled laboratory environment. When transferring the application scenario to unconstrained in-field conditions, intrinsic and extrinsic variations in the wild pose great challenges for accurate counting of maize tassels, which goes beyond the ability of conventional image processing techniques. This calls for further robust computer vision approaches to address in-field variations.ResultsThis paper studies the in-field counting problem of maize tassels. To our knowledge, this is the first time that a plant-related counting problem is considered using computer vision technologies under unconstrained field-based environment. With 361 field images collected in four experimental fields across China between 2010 and 2015 and corresponding manually-labelled dotted annotations, a novel Maize Tassels Counting (MTC) dataset is created and will be released with this paper. To alleviate the in-field challenges, a deep convolutional neural network-based approach termed TasselNet is proposed. TasselNet can achieve good adaptability to in-field variations via modelling the local visual characteristics of field images and regressing the local counts of maize tassels. Extensive results on the MTC dataset demonstrate that TasselNet outperforms other state-of-the-art approaches by large margins and achieves the overall best counting performance, with a mean absolute error of 6.6 and a mean squared error of 9.6 averaged over 8 test sequences.ConclusionsTasselNet can achieve robust in-field counting of maize tassels with a relatively high degree of accuracy. Our experimental evaluations also suggest several good practices for practitioners working on maize-tassel-like counting problems. It is worth noting that, though the counting errors have been greatly reduced by TasselNet, in-field counting of maize tassels remains an open and unsolved problem.

[1]  Xiaogang Wang,et al.  Cross-scene crowd counting via deep convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Andrew Zisserman,et al.  Learning To Count Objects in Images , 2010, NIPS.

[3]  Maryam Rahnemoonfar,et al.  Deep Count: Fruit Counting Based on Deep Simulated Learning , 2017, Sensors.

[4]  Zhiguo Cao,et al.  Region-based colour modelling for joint crop and maize tassel segmentation , 2016 .

[5]  S. Tsaftaris,et al.  Learning to Count Leaves in Rosette Plants , 2015 .

[6]  Dinesh Manocha,et al.  Modeling, Simulation and Visual Analysis of Crowds , 2013, The International Series in Video Computing.

[7]  Vishal M. Patel,et al.  A Survey of Recent Advances in CNN-based Single Image Crowd Counting and Density Estimation , 2017, Pattern Recognit. Lett..

[8]  L. Xiong,et al.  Combining high-throughput phenotyping and genome-wide association studies to reveal natural genetic variation in rice , 2014, Nature Communications.

[9]  Nuno Vasconcelos,et al.  Privacy preserving crowd monitoring: Counting people without people models or tracking , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Dinesh Manocha,et al.  Modeling, Simulation and Visual Analysis of Crowds: A Multidisciplinary Perspective , 2013, Modeling, Simulation and Visual Analysis of Crowds.

[11]  Zhiguo Cao,et al.  Two-dimensional subspace alignment for convolutional activations adaptation , 2017, Pattern Recognit..

[12]  Haroon Idrees,et al.  Counting in Dense Crowds using Deep Features , 2015 .

[13]  Shenghua Gao,et al.  Single-Image Crowd Counting via Multi-Column Convolutional Neural Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[15]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[16]  Shaogang Gong,et al.  Crowd Counting and Profiling: Methodology and Evaluation , 2013, Modeling, Simulation and Visual Analysis of Crowds.

[17]  Andrea Vedaldi,et al.  MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.

[18]  Andrew Zisserman,et al.  Interactive Object Counting , 2014, ECCV.

[19]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[20]  Serge J. Belongie,et al.  Counting Crowded Moving Objects , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  T. Pridmore,et al.  Computer Vision Problems in Plant Phenotyping, CVPPP 2017: Introduction to the CVPPP 2017 Workshop Papers , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[23]  Ulrich Schurr,et al.  Future scenarios for plant phenotyping. , 2013, Annual review of plant biology.

[24]  Tieniu Tan,et al.  Estimating the number of people in crowded scenes by MID based foreground segmentation and head-shoulder detection , 2008, 2008 19th International Conference on Pattern Recognition.

[25]  Zhenghong Yu,et al.  An image-based approach for automatic detecting tasseling stage of maize using spatio-temporal saliency , 2013, Other Conferences.

[26]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Andreas Krause,et al.  Advances in Neural Information Processing Systems (NIPS) , 2014 .

[28]  Daniel Oñoro-Rubio,et al.  Towards Perspective-Free Object Counting with Deep Learning , 2016, ECCV.

[29]  S. Ninomiya,et al.  Automated characterization of flowering dynamics in rice using field-acquired time-series RGB images , 2015, Plant Methods.

[30]  Hanno Scharr,et al.  Image Analysis: The New Bottleneck in Plant Phenotyping [Applications Corner] , 2015, IEEE Signal Processing Magazine.

[31]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[32]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[33]  Andrew Zisserman,et al.  Counting in the Wild , 2016, ECCV.

[34]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Joseph L. Gage,et al.  TIPS: a system for automated image-based phenotyping of maize tassels , 2017, Plant Methods.

[36]  Ullrich Köthe,et al.  Learning to count with regression forest and structured labels , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[37]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[39]  Yoshua Bengio,et al.  Count-ception: Counting by Fully Convolutional Redundant Counting , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[40]  Zhiguo Cao,et al.  Fine-grained maize tassel trait characterization with multi-view representations , 2015, Comput. Electron. Agric..

[41]  Zhiguo Cao,et al.  Toward Good Practices for Fine-Grained Maize Cultivar Identification With Filter-Specific Convolutional Activations , 2018, IEEE Transactions on Automation Science and Engineering.

[42]  Shaogang Gong,et al.  Feature Mining for Localised Crowd Counting , 2012, BMVC.

[43]  Andrew Zisserman,et al.  Microscopy cell counting and detection with fully convolutional regression networks , 2018, Comput. methods Biomech. Biomed. Eng. Imaging Vis..