Scene Segmentation Based On Object Model Using Multisensory Information

We propose a method for model- based scene segmentation which separates object instances from the background in the scene. An object model is composed of parameterized geo- metric primitives constrained to each other accord- ing to its shape and size variations, and the rela- tionships to other object classes. As sensory data, we use a pair of color stereo images which provides color and disparity information. Using these infor- mation and the constraints in model descriptions, we approach to the problems of scene segmenta- tion and object recognition. First, we construct a height map, which shows a top view of the input scene, transforming the disparity information into 3-D space based on the height and tilt of the camera system. Using the size constraints of the object model, we separate candidate areas for object instances from the back- ground. Next, parts of the model are extracted from the scene descriptions inside each candidate area, and free parameters in the object model are determined so that the instance can be best fitted to the model. Finally, uncertainty measure for pla- nar patches is defined and applied to the extracted parts. We show the preliminary results using car models.

[1]  W. Eric L. Grimson,et al.  On the sensitivity of geometric hashing , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[2]  Minoru Asada,et al.  A multistage stereo method giving priority to reliable matching with self calibration by stereo , 1991, Proceedings IROS '91:IEEE/RSJ International Workshop on Intelligent Robots and Systems '91.

[3]  M. Asada Building A 3-D World Model For A Mobile Robot From Sensory Data , 1990 .

[4]  Takeo Kanade,et al.  Automatic generation of object recognition programs , 1988, Proc. IEEE.

[5]  Anil K. Jain,et al.  BONSAI: 3D Object Recognition Using Constrained Search , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Minoru Asada,et al.  Geometric reasoning for world model representation based on planar patch with uncertainty from video range images , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[7]  Thomas O. Binford,et al.  Survey of Model-Based Image Analysis Systems , 1982 .

[8]  Robert C. Bolles,et al.  3DPO: A Three- Dimensional Part Orientation System , 1986, IJCAI.

[9]  Rodney A. Brooks,et al.  Symbolic Reasoning Among 3-D Models and 2-D Images , 1981, Artif. Intell..

[10]  Keiichiro Kadono,et al.  Context-constrained matching of hierarchical CAD-based models for outdoor scene interpretation , 1991, [1991 Proceedings] Workshop on Directions in Automated CAD-Based Vision.

[11]  W. Eric L. Grimson,et al.  The effect of indexing on the complexity of object recognition , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[12]  Takashi Matsuyama Expert systems for image processing: Knowledge-based composition of image analysis processes , 1989, Comput. Vis. Graph. Image Process..

[13]  大田 友一,et al.  Knowledge-based interpretation of outdoor natural color scenes , 1985 .

[14]  Avinash C. Kak,et al.  A robot vision system for recognizing 3D objects in low-order polynomial time , 1989, IEEE Trans. Syst. Man Cybern..