A Logical Framework for Depiction and Image Interpretation

We propose a logical framework for depiction and interpretation that formalizes image domain knowledge, scene domain knowledge and the depiction mapping between the image and scene domains. This framework requires three sets of axioms: image axioms, scene axioms and depiction axioms. An interpretation of an image is defined to be a logical model of these axioms. \n The approach is illustrated by a case study, a reconstruction in first order logic of a simplified map understanding program, Mapsee. The reconstruction starts with a description of the map and a specification of general knowledge of maps, geographic objects and their depiction relationships. For the simple map world we show how the task level specification may be refined to a provably correct implementation by applying model-preserving transformations to the initial logical representation to produce a set of propositional formulas. The implementation may use known constraint satisfaction techniques to find the set of models of these propositional formulas. In addition, we sketch preliminary logical treatments for image queries, contingent scene knowledge, ambiguity in image description, occlusion, complex objects, preferred interpretations and image synthesis. \n This approach provides a formal framework for analyzing and going beyond existing systems such as Mapsee, and for understanding the use of constraint satisfaction techniques. It can be used as a foundation for the specification, design and implementation of vision and graphics systems that are correct with respect to the task and algorithm levels.

[1]  Paul W. Purdom,et al.  Solving Satisfiability with Less Searching , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Raymond Reiter,et al.  The Logic of Depiction , 1987 .

[3]  William S. Havens,et al.  Knowledge Structuring and Constraint Satisfaction: The Mapsee Approach , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Raymond Reiter On Closed World Data Bases , 1977, Logic and Data Bases.

[5]  Alan K. Mackworth,et al.  Analysis by synthesis in computational vision with application to remote sensing , 1985 .

[6]  Alan K. Mackworth Constraint Satisfaction , 1985 .

[7]  Wolfgang Bibel,et al.  On Matrices with Connections , 1981, JACM.

[8]  William F. Clocksin,et al.  Programming in Prolog , 1981, Springer Berlin Heidelberg.

[9]  Rodney A. Brooks,et al.  Symbolic Reasoning Among 3-D Models and 2-D Images , 1981, Artif. Intell..

[10]  John K. Tsotsos Knowledge organization and its role in representation and interpretation for time‐varying data: the ALVEN system , 1985, Comput. Intell..

[11]  Jan Andreas Mulder Using discrimination graphs to represent visual knowledge , 1985 .

[12]  Adolfo Guzman,et al.  Decomposition of a visual scene into three-dimensional bodies , 1968 .

[13]  Alan K. Mackworth On Reading Sketch Maps , 1977, IJCAI.

[14]  Alan K. Mackworth Adequacy Criteria for Visual Knowledge Representation , 1987 .

[15]  Elliott Mendelson,et al.  Introduction to Mathematical Logic , 1979 .

[16]  M. B. Clowes,et al.  On Seeing Things , 1971, Artif. Intell..

[17]  William S. Havens,et al.  Hierarchical arc consistency: exploiting structured domains in constraint satisfaction problems , 1985 .

[18]  Robert A. Kowalski,et al.  Logic for problem solving , 1982, The computer science library : Artificial intelligence series.

[19]  Larry Wos,et al.  Efficiency and Completeness of the Set of Support Strategy in Theorem Proving , 1965, JACM.

[20]  John McCarthy,et al.  SOME PHILOSOPHICAL PROBLEMS FROM THE STANDPOINT OF ARTI CIAL INTELLIGENCE , 1987 .

[21]  David Maier,et al.  The Theory of Relational Databases , 1983 .

[22]  Wolfgang Bibel,et al.  Constraint Satisfaction from a Deductive Viewpoint , 1988, Artif. Intell..

[23]  William S. Havens,et al.  Representing Knowledge of the Visual World , 1983, Computer.

[24]  Gladys Magali Wong Depiction and domains in visual knowledge representation , 1986 .

[25]  Raymond Reiter,et al.  Equality and Domain Closure in First-Order Databases , 1980, JACM.