Ontology Based Object Learning and Recognition

Cette these se place dans le cadre du probleme de la reconnaissance d'objets et plus generalement dans celui de la vision cognitive. L'approche proposee se decompose en trois phases principales: une phase d'acquisition de connaissances qui consiste a acquerir la connaissance d'un domaine d'application sous la forme d'une hierarchie de classes d'objets et de sous parties. Il s'agit egalement de decrire ces classes du domaine en termes de concepts visuels (forme, texture, couleur, relations spatiales) fournis par une ontologie. Chaque concept visuel de cette ontologie etant associe a des descripteurs bas niveau, le fosse semantique est reduit de maniere conviviale pour un expert. La phase d'apprentissage consiste, a partir d'images d'exemples segmentees et labellisees, a obtenir un ensemble de detecteurs de concepts visuels. Ces detecteurs sont obtenus par l'entrainement de Support Vector Machines avec les descripteurs numeriques extraits dans les images d'exemples segmentees et labellisees par des concepts visuels. La phase de categorisation utilise la connaissance acquise ainsi que les detecteurs de concepts visuels obtenus lors de la phase d'apprentissage. La connaissance sert a generer des hypotheses qui doivent etre verifiees dans l'image a interpreter. Cette verification consiste a detecter des concepts visuels dans l'image segmentee automatiquement. Le resultat de la categorisation est exprime en termes de classes du domaine mais aussi en termes de concepts visuels. L'approche proposee a notamment ete utilisee ete utilisee pour l'indexation et la recherche semantique d'images.

[1]  Takashi Matsuyama,et al.  SIGMA: A Knowledge-Based Aerial Image Understanding System , 1990 .

[2]  Emanuele Trucco,et al.  Geometric Invariance in Computer Vision , 1995 .

[3]  Martin D. Levine,et al.  3-D shape approximation using parametric geons , 1997, Image Vis. Comput..

[4]  Frans Coenen,et al.  A Generic Ontology for Spatial Reasoning , 1999 .

[5]  B. Schiele,et al.  Interleaved Object Categorization and Segmentation , 2003, BMVC.

[6]  Ian Horrocks The FaCT System , 1998, TABLEAUX.

[7]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Cordelia Schmid,et al.  Scale-invariant shape features for recognition of object categories , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[9]  Michael Unser,et al.  B-spline snakes and a JAVA interface: an intuitive tool for general contour outlining , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[10]  Anthony G. Cohn,et al.  Qualitative Spatial Representation and Reasoning: An Overview , 2001, Fundam. Informaticae.

[11]  W. Eric L. Grimson,et al.  On the Sensitivity of the Hough Transform for Object Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Ozy Sjahputera,et al.  The use of force histograms for affine-invariant relative position description , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Sabine Moisan,et al.  Experience in Integrating Image Processing Programs , 1999, ICVS.

[14]  William A. Barrett,et al.  Toboggan-based intelligent scissors with a four-parameter edge model , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[15]  Stamos Metzidakis Barthes' image , 1987 .

[16]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[17]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[18]  Olivier D. Faugeras,et al.  HYPER: A New Approach for the Recognition and Positioning of Two-Dimensional Objects , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  B. Schiele,et al.  Combined Object Categorization and Segmentation With an Implicit Shape Model , 2004 .

[20]  Robert M. MacGregor,et al.  Inside the LOOM description classifier , 1991, SGAR.

[21]  Fabien L. Gandon,et al.  Ontology Engineering: a Survey and a Return on Experience , 2002 .

[22]  Rodney A. Brooks,et al.  Model-Based Three-Dimensional Interpretations of Two-Dimensional Images , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Alexander G. Hauptmann,et al.  The Use and Utility of High-Level Semantic Features in Video Retrieval , 2005, CIVR.

[24]  Marvin Minsky,et al.  A framework for representing knowledge" in the psychology of computer vision , 1975 .

[25]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[26]  Asunción Gómez-Pérez,et al.  Building Ontologies at the Knowledge Level using the Ontology Design Environment , 1998 .

[27]  Alexander G. Hauptmann Lessons for the Future from a Decade of Informedia Video Analysis Research , 2005, CIVR.

[28]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[29]  Sabine Moisan,et al.  Blocks, a component framework with checking facilities for knowledge-based systems , 2001, Informatica.

[30]  M.,et al.  Statistical and Structural Approaches to Texture , 2022 .

[31]  Anthony G. Cohn,et al.  Towards an Architecture for Cognitive Vision Using Qualitative Spatio-temporal Representations and Abduction , 2003, Spatial Cognition.

[32]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[33]  Nozha Boujemaa,et al.  What's beyond query by example? , 2003 .

[34]  John F. Sowa,et al.  Conceptual Structures: Information Processing in Mind and Machine , 1983 .

[35]  Jake K. Aggarwal,et al.  Model-based object recognition in dense-range images—a review , 1993, CSUR.

[36]  Alberto Del Bimbo,et al.  An Invariant Representation for Matching Trajectories Across Uncalibrated Video Streams , 2005, CIVR.

[37]  S. Edelman,et al.  Computational Theories of Object Recognition Edelman -computation and Object Recognition Ii Box 1. Structural Descriptions ~ 7~ Recognition by Components Varieties of Alignment Multidimensional Histograms Approximation in Feature Space , 2022 .

[38]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[39]  Bernd Neumann,et al.  Navigating through Logic-Based Scene Models for High-Level Scene Interpretations , 2003, ICVS.

[40]  Aleksandra Mojsilovic,et al.  Adaptive perceptual color-texture image segmentation , 2005, IEEE Transactions on Image Processing.

[41]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[42]  Deborah L. McGuinness,et al.  OWL Web ontology language overview , 2004 .

[43]  Harry Wechsler,et al.  Segmentation of Textured Images and Gestalt Organization Using Spatial/Spatial-Frequency Representations , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[44]  Deborah L. McGuinness,et al.  CLASSIC: a structural data model for objects , 1989, SIGMOD '89.

[45]  Alexander G. Hauptmann,et al.  Towards a Large Scale Concept Ontology for Broadcast Video , 2004, CIVR.

[46]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[47]  Eliseo Clementini,et al.  A Small Set of Formal Topological Relationships Suitable for End-User Interaction , 1993, SSD.

[48]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[49]  Gérard G. Medioni,et al.  3D structures for generic object recognition , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[50]  Rama Chellappa,et al.  Knowledge-based control of vision systems , 1999, Image Vis. Comput..

[51]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[52]  Monique Thonnat,et al.  Image formation model of a 3D translucent object observed in light microscopy , 2002, Proceedings. International Conference on Image Processing.

[53]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[54]  Harry Shum,et al.  Lazy snapping , 2004, ACM Trans. Graph..

[55]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[56]  William A. Barrett,et al.  Interactive Segmentation with Intelligent Scissors , 1998, Graph. Model. Image Process..

[57]  Asunción Gómez-Pérez,et al.  A Roadmap to Ontology Specification Languages , 2000, EKAW.

[58]  Sven J. Dickinson,et al.  A Research Roadmap of Cognitive Vision , 2005 .

[59]  Monique Thonnat,et al.  Ontology based object learning and recognition: application to image retrieval , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[60]  Robert B. Fisher,et al.  Class-based recognition of 3D objects represented by volumetric primitives , 1997, Image Vis. Comput..

[61]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[62]  R. Moller,et al.  Towards computer vision with description logics: some recent progress , 1999, Proceedings Integration of Speech and Image Understanding.

[63]  Nozha Boujemaa,et al.  New image retrieval paradigm: logical composition of region categories , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[64]  Atilla Baskurt,et al.  Image understanding and scene models: a generic framework integrating domain knowledge and Gestalt theory , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[65]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[66]  R GruberThomas Toward principles for the design of ontologies used for knowledge sharing , 1995 .

[67]  Sven J. Dickinson,et al.  Panel report: the potential of geons for generic 3-D object recognition , 1997, Image Vis. Comput..

[68]  Richard Campbell,et al.  Object recognition for an intelligent room , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[69]  Rachid Deriche,et al.  Fast algorithms for low-level vision , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[70]  M R Quillian,et al.  Word concepts: a theory and simulation of some basic semantic capabilities. , 1967, Behavioral science.

[71]  Sara Shatford,et al.  Analyzing the Subject of a Picture: A Theoretical Approach , 1986 .

[72]  Enver Sangineto,et al.  An abstract representation of geometric knowledge for object classification , 2003, Pattern Recognit. Lett..

[73]  H. Barlow Vision: A computational investigation into the human representation and processing of visual information: David Marr. San Francisco: W. H. Freeman, 1982. pp. xvi + 397 , 1983 .

[74]  François Brémond,et al.  Automatic Video Interpretation: A Novel Algorithm for Temporal Scenario Recognition , 2003, IJCAI.

[75]  A. Ravishankar Rao,et al.  Towards a texture naming system: Identifying relevant dimensions of texture , 1993, Vision Research.

[76]  Céline Hudelot Towards a Cognitive Vision Platform for Semantic Image Interpretation; Application to the Recognition of Biological Organisms , 2005 .

[77]  Volker Haarslev,et al.  Description of the RACER System and its Applications , 2001, Description Logics.

[78]  Francesco M. Donini,et al.  Structured Knowledge Representation for Image Retrieval , 2011, J. Artif. Intell. Res..

[79]  A. Ravishankar Rao,et al.  The Texture Lexicon: Understanding the Categorization of Visual Texture Terms and Their Relationship to Texture Images , 1997, Cogn. Sci..

[80]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[81]  Gérard G. Medioni,et al.  Perceptual grouping for generic recognition , 2004, International Journal of Computer Vision.

[82]  Philippe Mulhem,et al.  Integrating Perceptual Signal Features within a Multi-facetted Conceptual Model for Automatic Image Retrieval , 2004, ECIR.

[83]  Tieniu Tan,et al.  Brief review of invariant texture analysis methods , 2002, Pattern Recognit..

[84]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[85]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[86]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[87]  Ching-chih Chen,et al.  Automated semantic annotation and retrieval based on sharable ontology and case-based learning techniques , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[88]  Pietro Perona,et al.  Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[89]  Bruce A. Draper,et al.  The schema system , 1988, International Journal of Computer Vision.

[90]  Bernt Schiele,et al.  Recognition without Correspondence using Multidimensional Receptive Field Histograms , 2004, International Journal of Computer Vision.

[91]  Bernard Dubuisson,et al.  A statistical decision rule with incomplete knowledge about classes , 1993, Pattern Recognit..

[92]  Ronen Basri,et al.  Recognition by prototypes , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[93]  Richard Lepage,et al.  Knowledge-Based Image Understanding Systems: A Survey , 1997, Comput. Vis. Image Underst..

[94]  Ramin Zabih,et al.  Comparing images using color coherence vectors , 1997, MULTIMEDIA '96.

[95]  David Sinclair,et al.  Language-based querying of image collections on the basis of an extensible ontology , 2004, Image Vis. Comput..

[96]  Rose Dieng,et al.  Knowledge Engineering and Knowledge Management Methods, Models, and Tools , 2002, Lecture Notes in Computer Science.

[97]  M. Berthod,et al.  Automatic classification of planktonic foraminifera by a knowledge-based system , 1994, Proceedings of the Tenth Conference on Artificial Intelligence for Applications.

[98]  Shih-Fu Chang,et al.  Semantic visual templates: linking visual features to semantics , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[99]  Martin D. Levine,et al.  Low Level Image Segmentation: An Expert System , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[100]  François Brémond,et al.  Design and Assessment of an Intelligent Activity Monitoring Platform , 2005, EURASIP J. Adv. Signal Process..

[101]  G. Miller,et al.  Language and Perception , 1976 .

[102]  Nicola Guarino,et al.  Ontologies and Knowledge Bases. Towards a Terminological Clarification , 1995 .

[103]  Jitendra Malik,et al.  Matching Shapes , 2001, ICCV.

[104]  Marinette Revenu,et al.  Borg: A Knowledge-Based System for Automatic Generation of Image Processing Programs , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[105]  Heinrich Niemann,et al.  ERNEST: A Semantic Network System for Pattern Understanding , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[106]  Monique Thonnat,et al.  A knowledge-based approach to integration of image processing procedures , 1993 .

[107]  Bruce A. Draper,et al.  Knowledge-directed vision: control, learning, and integration , 1996, Proc. IEEE.

[108]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[109]  Rosalind W. Picard Toward a Visual Thesaurus , 1995, MIRO.

[110]  Pietro Perona,et al.  Towards automatic discovery of object categories , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[111]  Shimon Ullman,et al.  A Pictorial Approach to Object Classification , 1991, IJCAI.

[112]  Frédéric Precioso,et al.  Robust real-time segmentation of images and videos using a smooth-spline snake-based algorithm , 2005, IEEE Transactions on Image Processing.

[113]  Jitendra Malik,et al.  Blobworld: A System for Region-Based Image Indexing and Retrieval , 1999, VISUAL.

[114]  Allen R. Hanson,et al.  Computer Vision Systems , 1978 .

[115]  Chi-Ren Shyu,et al.  Modeling Multi-object Spatial Relationships for Satellite Image Database Indexing and Retrieval , 2005, CIVR.