Object Tracking and Object Change Detection in Desktop Manipulation for Video-Based Interactive Manuals

This paper introduces a novel method for object tracking and recognition in assembly work. The purpose of this study is to index instructional videos and to provide appropriate instructions to a user during actual assembly work. Object tracking for this purpose involves a lack of prior knowledge such as an object's shape or color, since objects are often moved, assembled, or even crushed. The clutter present in an environment or environmental changes must also be addressed. For this purpose, we use two or more pairs of image sensors. In this method, an object held by a hand is reliably detected, and its 3D area, that is, its volume and location, are obtained using shape-from-silhouette in real time. The observation of such volume allows the estimation of the changes in an object's state, and can be good indices for the processes of assembly work.

[1]  Yuichi Ohta,et al.  Tracking hands and objects for an intelligent video production system , 2002, Object recognition supported by user interaction for service robots.

[2]  Steven M. Seitz,et al.  Photorealistic Scene Reconstruction by Voxel Coloring , 1997, International Journal of Computer Vision.

[3]  Yuichi Ohta,et al.  Simple and robust tracking of hands and objects for video-based multimedia production , 2003, Proceedings of IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, MFI2003..

[4]  A. Laurentini,et al.  The Visual Hull Concept for Silhouette-Based Image Understanding , 1994, IEEE Trans. Pattern Anal. Mach. Intell..