The Tower of Knowledge Scheme for Learning in Computer Vision

A scheme, named tower of knowledge (ToK), is proposed for interpreting 3D scenes. The ToK encapsulates causal dependencies between object appearance and functionality. We demonstrate it by labelling the components of the 3D model of a building, reconstructed from images of multiple views, by using utility theory.

[1]  Maria Petrou,et al.  Non-Gibbsian Markov Random Field Models for Contextual Labelling of Structured Scenes , 2007, BMVC.

[2]  Lucas Paletta,et al.  Urban Object Recognition from Informative Local Features , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[3]  Steven W. Zucker,et al.  On the Foundations of Relaxation Labeling Processes , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Ramesh C. Jain,et al.  Three-dimensional object recognition , 1985, CSUR.

[5]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[6]  William J. Christmas,et al.  Structural Matching in Computer Vision Using Probabilistic Relaxation , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Maria Petrou,et al.  Learning in Computer Vision: Some Thoughts , 2007, CIARP.

[8]  Roberto Cipolla,et al.  Modelling and Interpretation of Architecture from Several Images , 2004, International Journal of Computer Vision.

[9]  Allen R. Hanson,et al.  Control in a 3D reconstruction system using selective perception , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[10]  大西 仁,et al.  Pearl, J. (1988, second printing 1991). Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan-Kaufmann. , 1994 .

[11]  Allen R. Hanson,et al.  Decision Making and Uncertainty Management in a 3D Reconstruction System , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Patrick Henry Winston,et al.  Learning structural descriptions from examples , 1970 .

[13]  Allen R. Hanson,et al.  Cost and Information-Driven Algorithm Selection for Vision Systems , 2004, ICIAR.

[14]  Vadim Bulitko,et al.  Machine Learning for Adaptive Image Interpretation , 2004, AAAI.

[15]  Ramakant Nevatia,et al.  Expandable Bayesian Networks for 3D Object Description from Multiple Views and Multiple Mode Inputs , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[17]  Heiko Wersing,et al.  Evolutionary optimization of a hierarchical object recognition model , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[18]  S. Ekvall,et al.  Integrating object and grasp recognition for dynamic scene interpretation , 2005, ICAR '05. Proceedings., 12th International Conference on Advanced Robotics, 2005..

[19]  Bernd Neumann,et al.  On scene interpretation with description logics , 2006, Image Vis. Comput..

[20]  Paul A. Viola,et al.  Learning from one example through shared densities on transforms , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[21]  Pietro Perona,et al.  Unsupervised Learning of Models for Recognition , 2000, ECCV.