Function from visual analysis and physical interaction: a methodology for recognition of generic classes of objects

The GRUFF-I (Generic Recognition Using Form, Function and Interaction) system reasons about and generates plans for interaction with 3-D shapes for the purpose of generic object recognition. A researcher selects an object and places it in an observation area. An initial intensity and range image are acquired and provided as input to a three-stage recognition system. The first stage builds a 3-D model. The second stage considers the shape-suggested functionality of this model by applying concepts of physics and causation (e.g., to infer stability) to label the object's potential functionality. The third stage uses this labeling to instantiate a plan for interaction to confirm the object's functional use in a task by incorporating feedback from both visual and robotic sensors. Results of this work are presented for eighteen chair-like and cup-like objects. Major conclusions from this work include: (1) metrically accurate representations of the world can be built and used for higher level reasoning, (2) shape-based reasoning prior to interaction-based reasoning provides an efficient methodology for object recognition, in terms of the judicious use of system resources, and (3) interaction-based reasoning helps to confirm the functionality of a categorized object without explicitly determining the object's material composition.

[1]  Kevin W. Bowyer,et al.  'We do dishes, but we don't do windows': function-based modeling and recognition of rigid objects , 1992, Other Conferences.

[2]  Jake K. Aggarwal,et al.  Model-based object recognition in dense-range images—a review , 1993, CSUR.

[3]  G. Reeke Marvin Minsky, The Society of Mind , 1991, Artif. Intell..

[4]  Joe Jackson,et al.  Knowledge-based prehension: capturing human dexterity , 1988, Proceedings. 1988 IEEE International Conference on Robotics and Automation.

[5]  Tony Lindeberg,et al.  Scale-Space Theory in Computer Vision , 1993, Lecture Notes in Computer Science.

[6]  Ales Leonardis,et al.  Grasping arbitrarily shaped 3-D objects from a pile , 1995, Proceedings of 1995 IEEE International Conference on Robotics and Automation.

[7]  Sharon A. Stansfield,et al.  Haptic Perception with an Articulated, Sensate Robot Hand , 1992, Robotica.

[8]  Andrew W. Fitzgibbon,et al.  An Experimental Comparison of Range Image Segmentation Algorithms , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Ashok K. Goel Integrating Case-Based and Model-Based Reasoning: A Computational Model of Design Problem Solving , 1992, AI Mag..

[10]  Ruzena Bajcsy,et al.  Occlusions as a Guide for Planning the Next View , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Ruzena Bajcsy,et al.  An Active Approach to Characterization and Recognition of Functionality and Functional Properties , 1993 .

[12]  Jack Hodges,et al.  Naive mechanics: a computational model of device use and function in design improvisation , 1992, IEEE Expert.

[13]  S. Kosslyn,et al.  Components of high-level vision: A cognitive neuroscience analysis and accounts of neurological syndromes , 1990, Cognition.

[14]  Michael R. Lowry,et al.  Learning Physical Descriptions From Functional Definitions, Examples, and Precedents , 1983, AAAI.

[15]  Lawrence O. Hall,et al.  Investigation of methods of combining functional evidence for 3-D object recognition , 1991, Other Conferences.

[16]  Rodney A. Brooks,et al.  Model-Based Computer Vision , 1984 .

[17]  Gregory D. Hager,et al.  Real-time feature tracking and projective invariance as a basis for hand-eye coordination , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Dmitry B. Goldgof,et al.  Extracting known and inferred shape information from a single view , 1992, Other Conferences.

[19]  Martti Mäntylä,et al.  Introduction to Solid Modeling , 1988 .

[20]  Matthew Brand,et al.  Eye for design: why, where, and how to look for causal structure in visual scenes , 1992, Other Conferences.

[21]  Sharon A. Stansfield,et al.  Representing generic objects for exploration and recognition , 1988, Proceedings. 1988 IEEE International Conference on Robotics and Automation.

[22]  L. Uhr,et al.  Representing and using functional definitions for visual recognition , 1987 .

[23]  Rodney A. Brooks,et al.  Model-Based Three-Dimensional Interpretations of Two-Dimensional Images , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Dmitry B. Goldgof,et al.  The Space Envelope: A Representation for 3D Scenes , 1998, Comput. Vis. Image Underst..

[25]  B. Tversky,et al.  Categories of environmental scenes , 1983, Cognitive Psychology.

[26]  S. Pinker,et al.  Visual cognition : An introduction * , 1989 .

[27]  Oscar Firschein,et al.  Readings in computer vision: issues, problems, principles, and paradigms , 1987 .

[28]  ARISTIDES A. G. REQUICHA,et al.  Representations for Rigid Solids: Theory, Methods, and Systems , 1980, CSUR.

[29]  Fausto Giunchiglia,et al.  FUR: Understanding functional reasoning , 1989, Int. J. Intell. Syst..

[30]  Dmitry B. Goldgof,et al.  Building a B-rep from a segmented range image , 1994, Proceedings of 1994 IEEE 2nd CAD-Based Vision Workshop.

[31]  Ramakant Nevatia,et al.  A method for recognition and localization of generic objects for indoor navigation , 1994, Proceedings of 1994 IEEE Workshop on Applications of Computer Vision.

[32]  Kevin W. Bowyer,et al.  Generic Recognition of Articulated Objects through Reasoning about Potential Function , 1995, Comput. Vis. Image Underst..

[33]  Louise Stark,et al.  Towards a domain-independent function-based recognition system , 1995 .

[34]  Wayne D. Gray,et al.  Basic objects in natural categories , 1976, Cognitive Psychology.

[35]  Michael Brady,et al.  Generating and Generalizing Models of Visual Objects , 1987, Artif. Intell..

[36]  Kevin W. Bowyer,et al.  Representing and Reasoning About Object Functionality: Towards an Integrated Approach , 1996, Intelligent Robots.

[37]  Louise Stark Recognizing object function through reasoning about 3-D shape and dynamic physical properties , 1994, CVPR.

[38]  Jack Hodges Functional and Physical Object Characteristics and Object Recognition in Improvisation , 1995, Comput. Vis. Image Underst..

[39]  Dmitry B. Goldgof,et al.  Function-based recognition from incomplete knowledge of shape , 1993 .

[40]  Gérard G. Medioni,et al.  Surface description of complex objects from multiple range images , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[41]  Ruzena Bajcsy,et al.  Identification of functional features through observations and interactions , 1995 .

[42]  Lawrence G. Roberts,et al.  Machine Perception of Three-Dimensional Solids , 1963, Outstanding Dissertations in the Computer Sciences.

[43]  Gregory D. Hager,et al.  Robot feedback control based on stereo vision: towards calibration-free hand-eye coordination , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[44]  Kevin W. Bowyer,et al.  Generic recognition of articulated objects by reasoning about functionality , 1994, ICPR.

[45]  Azriel Rosenfeld,et al.  Navigational Functionalities , 1995, Comput. Vis. Image Underst..

[46]  Ramesh C. Jain,et al.  Three-dimensional object recognition , 1985, CSUR.

[47]  Peter K. Allen,et al.  Forming complex dextrous manipulations from task primitives , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[48]  Azriel Rosenfeld,et al.  Recognition by Functional Parts , 1995, Comput. Vis. Image Underst..

[49]  Marie-Christine Jaulent,et al.  Object structure and action requirements: A compatibility model for functional recognition , 1991, Int. J. Intell. Syst..

[50]  節雄 大須賀,et al.  Advances in information modelling and knowledge bases , 1991 .

[51]  Ehud Rivlin,et al.  Recognizing functionality , 1995, Proceedings of International Symposium on Computer Vision - ISCV.

[52]  Rodney A. Brooks,et al.  Symbolic Reasoning Among 3-D Models and 2-D Images , 1981, Artif. Intell..

[53]  Kevin W. Bowyer,et al.  Function-based generic recognition for multiple object categories , 1994 .

[54]  Kazuhiko Kawamura,et al.  Humanoids: Future Robots for Home and Factory , 1998 .

[55]  Ehud Rivlin,et al.  Function From Motion , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[56]  Charles R. Dyer,et al.  Model-based recognition in robot vision , 1986, CSUR.

[57]  Dmitry B. Goldgof,et al.  Recovery of global nonrigid motion-a model based approach without point correspondences , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[58]  Ruzena Bajcsy,et al.  Interactive Recognition and Representation of Functionality , 1995, Comput. Vis. Image Underst..

[59]  Thea Iberall,et al.  The nature of human prehension: Three dextrous hands in one , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[60]  Peter K. Allen,et al.  Using tactile and visual sensing with a robotic hand , 1997, Proceedings of International Conference on Robotics and Automation.

[61]  R. Bajcsy,et al.  Exploratory procedures for material properties: the temperature perception , 1991, Fifth International Conference on Advanced Robotics 'Robots in Unstructured Environments.

[62]  Ruzena Bajcsy,et al.  Shape and Function , 1987, Other Conferences.

[63]  Kevin W. Bowyer,et al.  Generic recognition through qualitative reasoning about 3-D shape and object function , 1991, CVPR.

[64]  Lawrence Birnbaum,et al.  Causal Scene Understanding , 1995, Comput. Vis. Image Underst..

[65]  R. Brooks Model-Based Three-Dimensional Interpretations of Two-Dimensional Images , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[66]  L. Stark,et al.  Dissertation Abstract , 1994, Journal of Cognitive Education and Psychology.

[67]  Zen Chen,et al.  Incremental model building of polyhedral objects using structured light , 1993, Pattern Recognit..

[68]  Gregory D. Hager,et al.  A vision-based grasping system for unfamiliar planar objects , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[69]  Kevin W. Bowyer,et al.  GRUFF-3: Generalizing the domain of a function-based recognition system , 1994, Pattern Recognit..

[70]  B. Tversky,et al.  Journal of Experimental Psychology : General VOL . 113 , No . 2 JUNE 1984 Objects , Parts , and Categories , 2005 .

[71]  Matthew Brand,et al.  Physics-Based Visual Understanding , 1997, Comput. Vis. Image Underst..

[72]  Dmitry B. Goldgof,et al.  Extracting a Valid Boundary Representation from a Segmented Range Image , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[73]  Of references. , 1966, JAMA.

[74]  N. Warren Studies in cross-cultural psychology , 1977 .

[75]  Claude Laurgeau,et al.  A Road Scene Understanding System based on a Blackboard Architecture , 1993 .

[76]  Piero P. Bonissone,et al.  Selecting Uncertainty Calculi and Granularity: An Experiment in Trading-off Precision and Complexity , 1985, UAI.

[77]  Frank Ade,et al.  Grasping Unknown Objects , 1994, Modelling and Planning for Sensor Based Intelligent Robot Systems.

[78]  Eric Krotkov,et al.  Analysis and synthesis of the sounds of impact based on shape-invariant properties of materials , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[79]  Lipika Dey Recognition and Learning of Unknown Objects in a Hierarchical Knowledge-Base , 1993 .

[80]  Kazuhiko Kawamura,et al.  A low-cost, DSP-based, intelligent vision system for robotic applications , 1996, Proceedings of IEEE International Conference on Robotics and Automation.