Classifying Images from Athletics Based on Spatial Relations

Spatial relations between image regions are used in this paper for image classification in a rule-based fashion. In the particular case where image regions correspond to semantically interpretable objects the rules provide the means for justifying classification in a human-familiar manner. In the work presented here instances of particular object classes are detected combining bottom-up (learnable models based on simple features) and top-down information (object models consisting of primitive geometric shapes such as lines). The rule-based system acts as a model for the spatial configuration of objects. Experimental results in the athletic domain show that despite inaccuracy in object detection, spatial relations allow for efficient discrimination between visually similar images classes.

[1]  Arnold W. M. Smeulders,et al.  PicToSeek: combining color and shape invariant features for image retrieval , 2000, IEEE Trans. Image Process..

[2]  Lior Wolf,et al.  A Critical View of Context , 2006, International Journal of Computer Vision.

[3]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Multimedia Systems.

[4]  C.-C. Jay Kuo,et al.  Texture analysis and classification with tree-structured wavelet transform , 1993, IEEE Trans. Image Process..

[5]  Yali Amit,et al.  A coarse-to-fine strategy for multiclass shape detection , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Yixin Chen,et al.  CLUE: cluster-based retrieval of images by unsupervised learning , 2005, IEEE Transactions on Image Processing.

[7]  Anat Levin,et al.  Learning to Combine Bottom-Up and Top-Down Segmentation , 2006, ECCV.

[8]  Shimon Ullman,et al.  Combining Top-Down and Bottom-Up Segmentation , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[9]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[10]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[11]  Volker Haarslev,et al.  Racer: A Core Inference Engine for the Semantic Web , 2003, EON.

[12]  Walter F. Bischof,et al.  Learning spatio-temporal relational structures , 2001, Appl. Artif. Intell..

[13]  Yannis Avrithis,et al.  Facial Image Indexing in Multimedia Databases , 2001, Pattern Analysis & Applications.

[14]  K. Selçuk Candan,et al.  Hierarchical Image Modeling for Object-Based Media Retrieval , 1998, Data Knowl. Eng..

[15]  Ashish Kapoor,et al.  Located Hidden Random Fields: Learning Discriminative Parts for Object Detection , 2006, ECCV.

[16]  Xiaodong Fan Contextual disambiguation for multi-class object detection , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[17]  Paul L. Rosin,et al.  Shape measures for image retrieval , 2003, Pattern Recognit. Lett..

[18]  Shih-Fu Chang,et al.  Visually Searching the Web for Content , 1997, IEEE Multim..

[19]  Anthony G. Constantinides,et al.  Visual attention based region of interest coding for video-telephony applications , 2006 .

[20]  Jiebo Luo,et al.  Probabilistic spatial context models for scene content understanding , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[21]  John R. Smith,et al.  MPEG-7 multimedia description schemes , 2001, IEEE Trans. Circuits Syst. Video Technol..

[22]  Robert C. Williges,et al.  Criteria For Evaluating Usability Evaluation Methods , 2001, Int. J. Hum. Comput. Interact..

[23]  Donald Geman,et al.  Coarse-to-Fine Face Detection , 2004, International Journal of Computer Vision.

[24]  B. S. Manjunath,et al.  Unsupervised Segmentation of Color-Texture Regions in Images and Video , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Chrisa Tsinaraki,et al.  An Ontology-Driven Framework for the Management of Semantic Metadata Describing Audiovisual Information , 2003, CAiSE.