Learning Logic Rules for the Tower of Knowledge Using Markov Logic Networks

In this paper, we propose a novel logic-rule learning approach for the Tower of Knowledge (ToK) architecture, based on Markov logic networks, for scene interpretation. This approach is in the spirit of the recently proposed Markov logic networks for machine learning. Its purpose is to learn the soft-constraint logic rules for labeling the components of a scene. In our approach, FOIL (First Order Inductive Learner) is applied to learn the logic rules for MLN and then gradient ascent search is utilized to compute weights attached to each rule for softening the rules. This approach also benefits from the architecture of ToK, in reasoning whether a component in a scene has the right characteristics in order to fulfil the functions a label implies, from the logic point of view. One significant advantage of the proposed approach, rather than the previous versions of ToK, is its automatic logic learning capability such that the manual insertion of logic rules is not necessary. Experiments of labeling the identified components in buildings, for building scene interpretation, illustrate the promise of this approach.

[1]  Maria Petrou,et al.  Recursive Tower of Knowledge for Learning to Interpret Scenes , 2008, BMVC.

[2]  Bernd Neumann,et al.  On scene interpretation with description logics , 2006, Image Vis. Comput..

[3]  Pushmeet Kohli,et al.  Simultaneous Segmentation and Pose Estimation of Humans Using Dynamic Graph Cuts , 2008, International Journal of Computer Vision.

[4]  M. Thonnat,et al.  Symbol Grounding for Semantic Image Interpretation: From Image Data to Semantics , 2005, Tenth IEEE International Conference on Computer Vision Workshops (ICCVW'05).

[5]  Ramakant Nevatia,et al.  Expandable Bayesian Networks for 3D Object Description from Multiple Views and Multiple Mode Inputs , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[7]  Pedro M. Domingos,et al.  Discriminative Training of Markov Logic Networks , 2005, AAAI.

[8]  Jerome A. Feldman,et al.  A Semantics-Based Decision Theory Region Analyser , 1973, IJCAI.

[9]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[10]  Joachim M. Buhmann,et al.  Learning the Compositional Nature of Visual Objects , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Luc Van Gool,et al.  Towards Multi-View Object Class Detection , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[12]  Y. Ohta Knowledge-based interpretation of outdoor natural color scenes , 1998 .

[13]  Laurent Itti,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 Rapid Biologically-inspired Scene Classification Using Features Shared with Visual Attention , 2022 .

[14]  Maria Petrou,et al.  Learning in Computer Vision: Some Thoughts , 2007, CIARP.

[15]  Feng Han,et al.  Bottom-Up/Top-Down Image Parsing with Attribute Grammar , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  L. Stark,et al.  Dissertation Abstract , 1994, Journal of Cognitive Education and Psychology.

[17]  Christoph von der Malsburg,et al.  A Neural System for the Recognition of Partially Occluded Objects in Cluttered Scenes: A Pilot Study , 1993, Int. J. Pattern Recognit. Artif. Intell..

[18]  Antonio Torralba,et al.  Describing Visual Scenes Using Transformed Objects and Parts , 2008, International Journal of Computer Vision.