Rotation-covariant visual concept detection using steerable Riesz wavelets and bags of visual words

Distinct texture classes are often sharing several visual concepts. Texture instances from different classes are sharing regions in the feature hyperspace, which results in ill-defined classification configurations. In this work, we detect rotation-covariant visual concepts using steerable Riesz wavelets and bags of visual words. In a first step, K-means clustering is used to detect visual concepts in the hyperspace of the energies of steerable Riesz wavelets. The coordinates of the clusters are used to construct templates from linear combinations of the Riesz components that are corresponding to visual concepts. The visualization of these templates allows verifying the relevance of the concepts modeled. Then, the local orientations of each template are optimized to maximize their response, which is carried out analytically and can still be expressed as a linear combination of the initial steerable Riesz templates. The texture classes are learned in the feature space composed of the concatenation of the maximum responses of each visual concept using support vector machines. An experimental evaluation using the Outex TC 00010 test suite allowed a classification accuracy of 97.5%, which demonstrates the feasibility of the proposed approach. An optimal number K = 20 of clusters is required to model the visual concepts, which was found to be fewer than the number of classes. This shows that higher-level classes are sharing low-level visual concepts. The importance of rotation-covariant visual concept modeling is highlighted by allowing an absolute gain of more than 30% in accuracy. The visual concepts are modeling the local organization of directions at various scales, which is in accordance with the bottom{up visual information processing sequence of the primal sketch in Marr's theory on vision.

[1]  William T. Freeman,et al.  Presented at: 2nd Annual IEEE International Conference on Image , 1995 .

[2]  Paul F. Whelan,et al.  Local binary patterns versus signal processing texture analysis: a study from a performance evaluati , 2012 .

[3]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[4]  Zhenhua Guo,et al.  A Completed Modeling of Local Binary Pattern Operator for Texture Classification , 2010, IEEE Transactions on Image Processing.

[5]  Nong Sang,et al.  Local Binary Pattern histogram based Texton learning for texture classification , 2011, 2011 18th IEEE International Conference on Image Processing.

[6]  Nicholas Ayache,et al.  Endomicroscopic video retrieval using mosaicing and visualwords , 2010, 2010 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[7]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[9]  P. Fua,et al.  Learning rotational features for filament detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Kenneth I. Laws,et al.  Rapid Texture Identification , 1980, Optics & Photonics.

[11]  Dimitri Van De Ville,et al.  Lung Texture Classification Using Locally-Oriented Riesz Components , 2011, MICCAI.

[12]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Fei-Fei Li,et al.  Attribute Learning in Large-Scale Datasets , 2010, ECCV Workshops.

[14]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Xueming Qian,et al.  PLBP: An effective local binary patterns texture descriptor with pyramid representation , 2011, Pattern Recognit..

[16]  Tony Lindeberg,et al.  Edge Detection and Ridge Detection with Automatic Scale Selection , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  Eero P. Simoncelli,et al.  Steerable wedge filters for local orientation analysis , 1996, IEEE Trans. Image Process..

[18]  Donald A. Adjeroh,et al.  Comparison of Texture Analysis Schemes Under Nonideal Conditions , 2011, IEEE Transactions on Image Processing.

[19]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[20]  Salem Nasri,et al.  Rotation invariant texture classification using Support Vector Machines , 2011, 2011 International Conference on Communications, Computing and Control Applications (CCCA).

[21]  HighWire Press Philosophical Transactions of the Royal Society of London , 1781, The London Medical Journal.

[22]  Nuno Vasconcelos,et al.  Holistic context modeling using semantic co-occurrences , 2009, CVPR.

[23]  Hayit Greenspan,et al.  X-ray Categorization and Retrieval on the Organ and Pathology Level, Using Patch-Based Visual Words , 2011, IEEE Transactions on Medical Imaging.

[24]  Matti Pietikäinen,et al.  Outex - new framework for empirical evaluation of texture analysis algorithms , 2002, Object recognition supported by user interaction for service robots.

[25]  Song-Chun Zhu,et al.  Towards a mathematical theory of primal sketch and sketchability , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[26]  Changxin Gao,et al.  Pyramid-Based Multi-structure Local Binary Pattern for Texture Classification , 2010, ACCV.

[27]  Pietro Perona,et al.  Overcomplete steerable pyramid filters and rotation invariance , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Michael Unser,et al.  Detection of symmetric junctions in biological images using 2-D steerable wavelet transforms , 2013, 2013 IEEE 10th International Symposium on Biomedical Imaging.

[29]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Rainer Herpers,et al.  The SVD approach for steerable filter design , 1998, ISCAS '98. Proceedings of the 1998 IEEE International Symposium on Circuits and Systems (Cat. No.98CH36187).

[31]  Thomas Deselaers,et al.  Visual and semantic similarity in ImageNet , 2011, CVPR 2011.

[32]  Dimitri Van De Ville,et al.  Rotation–Covariant Texture Learning Using Steerable Riesz Wavelets , 2014, IEEE Transactions on Image Processing.

[33]  Franz Kainberger,et al.  Texture Bags: Anomaly Retrieval in Medical Images Based on Local 3D-Texture Similarity , 2011, MCBR-CDS.

[34]  Zhenhua Guo,et al.  Local directional derivative pattern for rotation invariant texture classification , 2011, Neural Computing and Applications.

[35]  Antonio Torralba,et al.  Sharing Visual Features for Multiclass and Multiview Object Detection , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Michael Unser,et al.  Design of steerable filters for the detection of micro-particles , 2013, 2013 IEEE 10th International Symposium on Biomedical Imaging.

[37]  R. Porter,et al.  Robust rotation-invariant texture classification: wavelet, Gabor filter and GMRF based schemes , 1997 .

[38]  Stefanie Nowak,et al.  Overview of the CLEF 2009 Large Scale - Visual Concept Detection and Annotation Task , 2009, CLEF.

[39]  D Marr,et al.  Early processing of visual information. , 1976, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[40]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Dimitri Van De Ville,et al.  Multiscale Lung Texture Signature Learning Using the Riesz Transform , 2012, MICCAI.

[42]  M.,et al.  Statistical and Structural Approaches to Texture , 2022 .

[43]  Dimitri Van De Ville,et al.  Wavelet Steerability and the Higher-Order Riesz Transform , 2010, IEEE Transactions on Image Processing.

[44]  Henning Müller,et al.  Using Multiscale Visual Words for Lung Texture Classification and Retrieval , 2011, MCBR-CDS.

[45]  Hamid Soltanian-Zadeh,et al.  Radon transform orientation estimation for rotation invariant texture analysis , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  B. Julesz Textons, the elements of texture perception, and their interactions , 1981, Nature.

[47]  Paul Southam,et al.  Towards Texture Classification in Real Scenes , 2005, BMVC.

[48]  Yong Wang,et al.  Combining global, regional and contextual features for automatic image annotation , 2009, Pattern Recognit..

[49]  Azriel Rosenfeld,et al.  Gray-level corner detection , 1982, Pattern Recognit. Lett..

[50]  Gerald Schaefer,et al.  A comparative analysis of local binary pattern texture classification , 2012, 2012 Visual Communications and Image Processing.

[51]  Pranam Janney,et al.  Invariant Features of Local Textures—a rotation invariant local texture , 2008 .

[52]  Paul W. Fieguth,et al.  Extended local binary patterns for texture classification , 2012, Image Vis. Comput..