A Fine-Grained Image Categorization System by Cellet-Encoded Spatial Pyramid Modeling

In this paper, a new fine-grained image categorization system that improves spatial pyramid matching is developed. In this method, multiple cells are combined into cellets in the proposed categorization model, which are highly responsive to an object's fine categories. The object components are represented by cellets that can connect spatially adjacent cells within the same pyramid level. Here, image categorization can be formulated as the matching between the cellets of corresponding images. Toward an effective matching process, an active learning algorithm that can effectively select a few representative cells for constructing the cellets is adopted. A linear-discriminant-analysis-like scheme is employed to select discriminative cellets. Then, fine-grained image categorization can be conducted with a trained linear support vector machine. Experimental results on three real-world data sets demonstrate that our proposed system outperforms the state of the art.

[1]  Mislav Grgic,et al.  Performance analysis of image compression using wavelets , 2001, IEEE Trans. Ind. Electron..

[2]  Aljoscha Smolic,et al.  Evaluation and FPGA Implementation of Sparse Linear Solvers for Video Processing Applications , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Chuanhou Gao,et al.  Modeling of the Thermal State Change of Blast Furnace Hearth With Support Vector Machines , 2012, IEEE Transactions on Industrial Electronics.

[4]  Larry S. Davis,et al.  Birdlets: Subordinate categorization using volumetric primitives and pose-normalized appearance , 2011, 2011 International Conference on Computer Vision.

[5]  Xuelong Li,et al.  Image categorization: Graph edit distance+edge direction histogram , 2008, Pattern Recognit..

[6]  Trevor Darrell,et al.  Pose pooling kernels for sub-category recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Yue Gao,et al.  3-D Object Retrieval and Recognition With Hypergraph Analysis , 2012, IEEE Transactions on Image Processing.

[8]  Ling Shao,et al.  Feature Learning for Image Classification Via Multiobjective Genetic Programming , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Yue Gao,et al.  Camera Constraint-Free View-Based 3-D Object Retrieval , 2012, IEEE Transactions on Image Processing.

[10]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[11]  Gary R. Bradski,et al.  A codebook-free and annotation-free approach for fine-grained image categorization , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Chia-Feng Juang,et al.  A TS Fuzzy System Learned Through a Support Vector Machine in Principal Component Space for Real-Time Object Detection , 2012, IEEE Transactions on Industrial Electronics.

[13]  Jieping Ye,et al.  Least squares linear discriminant analysis , 2007, ICML '07.

[14]  C. Schmid,et al.  Scale-invariant shape features for recognition of object categories , 2004, CVPR 2004.

[15]  Hao Su,et al.  Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification , 2010, NIPS.

[16]  Adam L. Kaczmarek,et al.  Interactive Query Expansion With the Use of Clustering-by-Directions Algorithm , 2011, IEEE Transactions on Industrial Electronics.

[17]  Wei-Yun Yau,et al.  Multichannel Pulse-Coupled-Neural-Network-Based Color Image Segmentation for Object Detection , 2012, IEEE Transactions on Industrial Electronics.

[18]  Jaime Gómez Gil,et al.  A New Method for Sensorless Estimation of the Speed and Position in Brushed DC Motors Using Support Vector Machines , 2012, IEEE Transactions on Industrial Electronics.

[19]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[20]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21]  Jianwei Zhang,et al.  Intelligent Lighting Control for Vision-Based Robotic Manipulation , 2012, IEEE Transactions on Industrial Electronics.

[22]  Xuelong Li,et al.  Fusion of Multichannel Local and Global Structural Cues for Photo Aesthetics Evaluation , 2014, IEEE Transactions on Image Processing.

[23]  Fei-Fei Li,et al.  Object-Centric Spatial Pooling for Image Classification , 2012, ECCV.

[24]  Markos Papadonikolakis,et al.  A novel FPGA-based SVM classifier , 2010, 2010 International Conference on Field-Programmable Technology.

[25]  Cheng-Yuan Chang,et al.  Real-Time Visual Tracking and Measurement to Control Fast Dynamics of Overhead Cranes , 2012, IEEE Transactions on Industrial Electronics.

[26]  C.-C. Jay Kuo,et al.  Novel Rate-Quantization Model-Based Rate Control With Adaptive Initialization for Spatial Scalable Video Coding , 2012, IEEE Transactions on Industrial Electronics.

[27]  Bing-Fei Wu,et al.  A Real-Time Vision System for Nighttime Vehicle Detection and Traffic Surveillance , 2011, IEEE Transactions on Industrial Electronics.

[28]  Yi Yang,et al.  Discriminative Cellets Discovery for Fine-Grained Image Categories Retrieval , 2014, ICMR.

[29]  Jonathan Krause,et al.  Fine-Grained Crowdsourcing for Fine-Grained Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Trevor Darrell,et al.  Beyond spatial pyramids: Receptive field learning for pooled image features , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Rajat Raina,et al.  Efficient sparse coding algorithms , 2006, NIPS.

[32]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[33]  Yi Yang,et al.  Discovering Discriminative Graphlets for Aerial Image Categories Recognition , 2013, IEEE Transactions on Image Processing.

[34]  Peter N. Belhumeur,et al.  POOF: Part-Based One-vs.-One Features for Fine-Grained Categorization, Face Verification, and Attribute Estimation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[35]  Pietro Perona,et al.  Caltech-UCSD Birds 200 , 2010 .

[36]  Xiao Liu,et al.  Probabilistic Graphlet Transfer for Photo Cropping , 2013, IEEE Transactions on Image Processing.

[37]  Tong Heng Lee,et al.  A Robust Real-Time Embedded Vision System on an Unmanned Rotorcraft for Ground Target Following , 2012, IEEE Transactions on Industrial Electronics.

[38]  Mario Ignacio Chacon Murguia,et al.  An Adaptive Neural-Fuzzy Approach for Object Detection in Dynamic Backgrounds for Surveillance Systems , 2012, IEEE Transactions on Industrial Electronics.

[39]  Xiaoli Li,et al.  The Specular Exponent as a Criterion for Appearance Quality Assessment of Pearllike Objects by Artificial Vision , 2012, IEEE Transactions on Industrial Electronics.

[40]  Li-Chen Fu,et al.  Multilayered Image Processing for Multiscale Harris Corner Detection in Digital Realization , 2010, IEEE Transactions on Industrial Electronics.

[41]  Xuelong Li,et al.  Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search , 2013, IEEE Transactions on Image Processing.

[42]  Donald Geman,et al.  Vantage Feature Frames for Fine-Grained Categorization , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Kun Duan,et al.  Discovering localized attributes for fine-grained recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[44]  Xiao Liu,et al.  Probabilistic Graphlet Cut: Exploiting Spatial Structure Cue for Weakly Supervised Image Segmentation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[45]  Xuelong Li,et al.  Beyond Spatial Pyramids: A New Feature Extraction Framework with Dense Spatial Sampling for Image Classification , 2012, ECCV.

[46]  Shenghuo Zhu,et al.  Efficient Object Detection and Segmentation for Fine-Grained Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.