Integration of different computational models in a computer vision framework

A general (application independent) computer vision framework is proposed. It follows the methodology of knowledge-base systems - dividing a system into knowledge base and control. We choose procedural semantic networks for object-oriented modelling of the world. It is basically a non-monotonic logical system. Several inference rules are proposed that allow to create instances of model concepts. In order to activate an inference rule a model-to-image data matching process need to be performed. We view this matching as a solution to constraint satisfaction problem (CSP), supported by Bayesian net-based evaluation of partial variable assignments. A modified incremental search for CSP is designed that allows partial solutions and calls for stochastic inference in order to provide judgments of partial states. Hence the detection of partial occlusion of objects is handled consistently with Bayesian inference over evidence and hidden variables.

[1]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[3]  Robert M. Haralick,et al.  The Consistent Labeling Problem: Part I , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Gerhard Sagerer,et al.  Neuronal, Statistisch, Wissensbasiert: Ein Beitrag zur Paradigmendiskussion für die Mustererkennung , 1993, DAGM-Symposium.

[5]  Taein Lee,et al.  Active contour models , 2005 .

[6]  Rodney A. Brooks,et al.  Symbolic Reasoning Among 3-D Models and 2-D Images , 1981, Artif. Intell..

[7]  Włodzimierz Kasprzak,et al.  Adaptive computation methods in digital image sequence analysis , 2000 .

[8]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[9]  A. Wilkowski,et al.  Hand Gesture Recognition in Image Sequences Using Active Contours and HMMs , 2009 .

[10]  Hans-Hellmut Nagel,et al.  From image sequences towards conceptual descriptions , 1988, Image Vis. Comput..

[11]  Demetri Terzopoulos Deformable Models: Classic, Topology-Adaptive and Generalized Formulations , 2003 .

[12]  H. Barlow Vision: A computational investigation into the human representation and processing of visual information: David Marr. San Francisco: W. H. Freeman, 1982. pp. xvi + 397 , 1983 .

[13]  R M Haralick,et al.  The consistent labeling problem: part I. , 1979, IEEE transactions on pattern analysis and machine intelligence.

[14]  King-Sun Fu,et al.  Subgraph error-correcting isomorphisms for syntactic pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[15]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Heinrich Niemann,et al.  ERNEST: A Semantic Network System for Pattern Understanding , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Włodzimierz Kasprzak,et al.  Constrained contour matching in hand posture recognition , 2009 .

[18]  Peter E. Hart,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[19]  Włodzimierz Kasprzak,et al.  A linguistic approach to 3-D object recognition , 1987, Comput. Graph..

[20]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[21]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Wojciech Szynkiewicz,et al.  Rubik's cube reconstruction from single view for Service robots , 2006 .

[23]  W. Pedrycz,et al.  Medical Image Understanding Technology , 2005, IEEE Transactions on Neural Networks.

[24]  Paul Y. S. Cheung,et al.  Fuzzy-attribute graph with application to Chinese character recognition , 1992, IEEE Trans. Syst. Man Cybern..

[25]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .