论文信息 - Automated insect identification through concatenated histograms of local appearance features: feature vector generation and region detection for deformable objects

Automated insect identification through concatenated histograms of local appearance features: feature vector generation and region detection for deformable objects

This paper describes a computer vision approach to automated rapid-throughput taxonomic identification of stonefly larvae. The long-term objective of this research is to develop a cost-effective method for environmental monitoring based on automated identification of indicator species. Recognition of stonefly larvae is challenging because they are highly articulated, they exhibit a high degree of intraspecies variation in size and color, and some species are difficult to distinguish visually, despite prominent dorsal patterning. The stoneflies are imaged via an apparatus that manipulates the specimens into the field of view of a microscope so that images are obtained under highly repeatable conditions. The images are then classified through a process that involves (a) identification of regions of interest, (b) representation of those regions as SIFT vectors (Lowe, in Int J Comput Vis 60(2):91–110, 2004) (c) classification of the SIFT vectors into learned “features” to form a histogram of detected features, and (d) classification of the feature histogram via state-of-the-art ensemble classification algorithms. The steps (a) to (c) compose the concatenated feature histogram (CFH) method. We apply three region detectors for part (a) above, including a newly developed principal curvature-based region (PCBR) detector. This detector finds stable regions of high curvature via a watershed segmentation algorithm. We compute a separate dictionary of learned features for each region detector, and then concatenate the histograms prior to the final classification step. We evaluate this classification methodology on a task of discriminating among four stonefly taxa, two of which, Calineuria and Doroneuria, are difficult even for experts to discriminate. The results show that the combination of all three detectors gives four-class accuracy of 82% and three-class accuracy (pooling Calineuria and Doro-neuria) of 95%. Each region detector makes a valuable contribution. In particular, our new PCBR detector is able to discriminate Calineuria and Doroneuria much better than the other detectors.

[1] R. Greenberg. Biometry , 1969, The Yale Journal of Biology and Medicine.

[2] Christopher G. Harris,et al. A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[3] W. Hilsenhoff. Rapid Field Assessment of Organic Pollution with a Family-Level Biotic Index , 1988, Journal of the North American Benthological Society.

[4] Luc Vincent,et al. Watersheds in Digital Spaces: An Efficient Algorithm Based on Immersion Simulations , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[5] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .

[6] Yoav Freund,et al. Experiments with a New Boosting Algorithm , 1996, ICML.

[7] G. Lamberti,et al. Methods in stream ecology , 1997 .

[8] Simon M. Lucas,et al. Face recognition with the continuous n-tuple classifier , 1998, BMVC.

[9] Tomaso A. Poggio,et al. Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[10] Pietro Perona,et al. A Probabilistic Approach to Object Recognition Using Local Photometry and Global Geometry , 1998, ECCV.

[11] Carsten Steger,et al. An Unbiased Detector of Curvilinear Structures , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[12] Keith C. Norris,et al. A test of a pattern recognition system for identification of spiders , 1999 .

[13] J. Friedman. Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[14] Luc Van Gool,et al. Wide Baseline Stereo Matching based on Local, Affinely Invariant Regions , 2000, BMVC.

[15] Stefan Schröder,et al. Biodiversity Informatics in Action: Identification and Monitoring of Bee Species using ABIS , 2001 .

[16] R. Freckleton,et al. Declines in the numbers of amateur and professional taxonomists: implications for conservation , 2002 .

[17] Cordelia Schmid,et al. An Affine Invariant Interest Point Detector , 2002, ECCV.

[18] Jiri Matas,et al. Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[19] Pietro Perona,et al. Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[20] Peter Auer,et al. Weak Hypotheses and Boosting for Generic Object Detection and Recognition , 2004, ECCV.

[21] T. Tuytelaars,et al. Matching Widely Separated Views Based on Affine Invariant Regions , 2004, International Journal of Computer Vision.

[22] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[23] C. Schmid,et al. Scale-invariant shape features for recognition of object categories , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[24] Andrew Zisserman,et al. An Affine Invariant Salient Region Detector , 2004, ECCV.

[25] Trevor Darrell,et al. Conditional Random Fields for Object Recognition , 2004, NIPS.

[26] Cordelia Schmid,et al. Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[27] Gabriela Csurka,et al. Visual categorization with bags of keypoints , 2002, eccv 2004.

[28] M. O'Neill,et al. Automated species identification: why not? , 2004, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[29] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[30] Tomaso A. Poggio,et al. A Trainable System for Object Detection , 2000, International Journal of Computer Vision.

[31] Bernt Schiele,et al. Pedestrian detection in crowded scenes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[32] Guillaume Bouchard,et al. Hierarchical part-based visual object categorization , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[33] Eibe Frank,et al. Logistic Model Trees , 2003, Machine Learning.

[34] Cordelia Schmid,et al. A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[35] Frédéric Jurie,et al. Creating efficient codebooks for visual recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[36] Andrew Blake,et al. Contour-based learning for object detection , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[37] Martial Hebert,et al. Exploiting Inference for Approximate Parameter Learning in Discriminative Fields: An Empirical Study , 2005, EMMCVPR.

[38] C. Schmid,et al. Object Class Recognition Using Discriminative Local Features , 2005 .

[39] Peter Auer,et al. Generic object recognition with boosting , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40] Thomas G. Dietterich,et al. A Hierarchical Object Recognition System Based on Multi-scale Principal Curvature Regions , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[41] Thomas G. Dietterich,et al. Automated Insect Identification through Concatenated Histograms of Local Appearance Features , 2007, WACV.