论文信息 - Approximate Representation of Unknown Objects with a Single-line Scanning Lidar and a Video Camera

Approximate Representation of Unknown Objects with a Single-line Scanning Lidar and a Video Camera

Models are useful for many computer vision tasks, such as object detection, recognition, and tracking. Computer vision tasks must handle situations where unknown objects appear and must detect and track some object which is not in the trained database. In such cases, the system must learn or, otherwise derive, descriptions of new objects. In this dissertation, we investigate creating a representation of previously unknown objects that newly appear in the scene. The representation is to create a viewpoint-invariant and scale-normalized model approximately describing an unknown object. Those properties of the representation facilitate 3D tracking of the object using 2D-to-2D image matching. The representation has both benefits of an implicit model (referred to as a view-based model) and an explicit model (referred to as a shape-based model). The object representation is created using multi-modal sensors. We illustrate the benefits of the object representation with two applications: object detection and 3D tracking. We extend the object representation to an explicit model by imposing a shape prior and combining two existing approaches.

Takeo Kanade | Ki Ho Kwak | K. Kwak

[1] Jean-Yves Bouguet,et al. Camera calibration toolbox for matlab , 2001 .

[2] Andrew Zisserman,et al. Multiple View Geometry in Computer Vision (2nd ed) , 2003 .

[3] Dorin Comaniciu,et al. Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[4] Yunhui Liu,et al. An algorithm for extrinsic parameters calibration of a camera and a laser range finder using line features , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[5] Takeo Kanade,et al. A Theory of Origami World , 1979, Artif. Intell..

[6] Harry Shum,et al. Background Cut , 2006, ECCV.

[7] Cordelia Schmid,et al. 3D object modeling and recognition using affine-invariant patches and multi-view spatial constraints , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[8] Richard Szeliski,et al. Interactive 3D architectural modeling from unordered photo collections , 2008, ACM Trans. Graph..

[9] Pietro Perona,et al. Evaluation of Features Detectors and Descriptors based on 3D Objects , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[10] Richard Szeliski,et al. Computer Vision - Algorithms and Applications , 2011, Texts in Computer Science.

[11] David J. Fleet,et al. Robust Online Appearance Models for Visual Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[12] Hans-Hellmut Nagel,et al. Initialization of Model-Based Vehicle Tracking in Video Sequences of Inner-City Intersections , 2007, International Journal of Computer Vision.

[13] B. Roysam,et al. Automated Cell Lineage Construction: A Rapid Method to Analyze Clonal Development Established with Murine Neural Progenitor Cells , 2006, Cell cycle.

[14] Katsushi Ikeuchi,et al. Object shape and reflectance modeling from observation , 1997, SIGGRAPH.

[15] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[16] Hans-Hellmut Nagel,et al. Combination of Edge Element and Optical Flow Estimates for 3D-Model-Based Vehicle Tracking in Traffic Image Sequences , 1999, International Journal of Computer Vision.

[17] Joseph L. Mundy,et al. Object Recognition in the Geometric Era: A Retrospective , 2006, Toward Category-Level Object Recognition.

[18] Vincent Lepetit,et al. Monocular Model-Based 3D Tracking of Rigid Objects: A Survey , 2005, Found. Trends Comput. Graph. Vis..

[19] Wolfram Burgard,et al. Using Boosted Features for the Detection of People in 2D Range Data , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[20] Tieniu Tan,et al. 3-D model-based vehicle tracking , 2005, IEEE Transactions on Image Processing.

[21] James L. Crowley,et al. Measuring Image Flow By Tracking Edge-lines , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[22] Richard Bowden,et al. Simultaneous modeling and tracking (SMAT) of feature sets , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[23] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[24] David G. Lowe,et al. Local feature view clustering for 3D object recognition , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[25] Geovany de Araújo Borges,et al. Line Extraction in 2D Range Images for Mobile Robotics , 2004, J. Intell. Robotic Syst..

[26] FuaPascal,et al. Monocular model-based 3D tracking of rigid objects , 2005 .

[27] Marie-Pierre Jolly,et al. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[28] Tomaso Poggio,et al. Models of object recognition , 2000, Nature Neuroscience.

[29] Takeo Kanade,et al. Boundary detection based on supervised learning , 2010, 2010 IEEE International Conference on Robotics and Automation.

[30] Alex Pentland,et al. Modal Representations , 1994, Object Representation in Computer Vision.

[31] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[32] Mubarak Shah,et al. A noniterative greedy algorithm for multiframe point correspondence , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33] Sven J. Dickinson,et al. Object Representation and Recognition , 1999 .

[34] Victor S. Lempitsky,et al. Seamless Mosaicing of Image-Based Texture Maps , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[35] Robert A. MacLachlan,et al. Tracking Moving Objects From a Moving Vehicle Using a Laser Scanner , 2006 .

[36] Paul A. Viola,et al. Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[37] Y. Bar-Shalom. Tracking and data association , 1988 .

[38] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[39] Roland Siegwart,et al. A comparison of line extraction algorithms using 2D range data for indoor mobile robotics , 2007, Auton. Robots.

[40] Mubarak Shah,et al. 3D Model based Object Class Detection in An Arbitrary View , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[41] Olivier Strauss,et al. Calibration of a multi-sensor system laser rangefinder/camera , 1995, Proceedings of the Intelligent Vehicles '95. Symposium.

[42] Takeo Kanade,et al. Extrinsic calibration of a single line scanning lidar and a camera , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[43] David H. Douglas,et al. ALGORITHMS FOR THE REDUCTION OF THE NUMBER OF POINTS REQUIRED TO REPRESENT A DIGITIZED LINE OR ITS CARICATURE , 1973 .

[44] Zhengyou Zhang,et al. Iterative point matching for registration of free-form curves and surfaces , 1994, International Journal of Computer Vision.

[45] Sebastian Thrun,et al. Model based vehicle detection and tracking for autonomous urban driving , 2009, Auton. Robots.

[46] Andrew E. Johnson,et al. Registration and integration of textured 3-D data , 1997, Proceedings. International Conference on Recent Advances in 3-D Digital Imaging and Modeling (Cat. No.97TB100134).

[47] Berthold K. P. Horn,et al. Determining Optical Flow , 1981, Other Conferences.

[48] Pólo de Coimbra,et al. Segmentation and Geometric Primitives Extraction from 2D Laser Range Data for Mobile Robot Applications , 2005 .

[49] Vincent Lepetit,et al. Gradient Response Maps for Real-Time Detection of Textureless Objects , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50] David L. Waltz,et al. Generating Semantic Descriptions From Drawings of Scenes With Shadows , 1972 .

[51] Neil A. Thacker,et al. The Bhattacharyya metric as an absolute similarity measure for frequency coded data , 1998, Kybernetika.

[52] Alan K. Mackworth. Interpreting Pictures of Polyhedral Scenes , 1973, IJCAI.

[53] Donald Reid. An algorithm for tracking multiple targets , 1978 .

[54] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[55] S.S. Blackman,et al. Multiple hypothesis tracking for multiple target tracking , 2004, IEEE Aerospace and Electronic Systems Magazine.

[56] Cristiano Premebida. Segmentation and Geometric Primitives Extraction from 2D Laser Range Data for Mobile Robot Applications , 2005 .

[57] Robert T. Collins,et al. On-the-fly Object Modeling while Tracking , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[58] Takeo Kanade,et al. A robust shape model for multi-view car alignment , 2009, CVPR.

[59] Kenneth Levenberg. A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[60] Cristiano Premebida,et al. LIDAR and vision‐based pedestrian detection system , 2009, J. Field Robotics.

[61] Takeo Kanade,et al. A robust shape model for multi-view car alignment , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[62] Roland Siegwart,et al. Human detection using multimodal and multidimensional features , 2008, 2008 IEEE International Conference on Robotics and Automation.

[63] Hans-Hellmut Nagel,et al. Model-based object tracking in monocular image sequences of road traffic scenes , 1993, International Journal of Computer 11263on.

[64] Kevin Cannons,et al. A Review of Visual Tracking , 2008 .

[65] Luc Van Gool,et al. Coupled Object Detection and Tracking from Static Cameras and Moving Vehicles , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[66] Takeo Kanade,et al. A statistical method for 3D object detection applied to faces and cars , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[67] Takeo Kanade,et al. Cell population tracking and lineage construction with spatiotemporal context , 2008, Medical Image Anal..

[68] Takeo Kanade,et al. Recovery of the Three-Dimensional Shape of an Object from a Single View , 1981, Artif. Intell..

[69] Bernhard P. Wrobel,et al. Multiple View Geometry in Computer Vision , 2001 .

[70] Ingemar J. Cox,et al. A review of statistical data association techniques for motion correspondence , 1993, International Journal of Computer Vision.

[71] Daniel Scharstein,et al. Matching images by comparing their gradient fields , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[72] Frederick R. Forst,et al. On robust estimation of the location parameter , 1980 .

[73] Cor J. Veenman,et al. Resolving Motion Correspondence for Densely Moving Points , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[74] Thomas K. Peucker,et al. 2. Algorithms for the Reduction of the Number of Points Required to Represent a Digitized Line or its Caricature , 2011 .

[75] Luc Van Gool,et al. Object Detection and Tracking for Autonomous Navigation in Dynamic Environments , 2010, Int. J. Robotics Res..

[76] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.

[77] B. Eckardt. What Is Cognitive Science , 1992 .

[78] Cordelia Schmid,et al. 3D Object Modeling and Recognition Using Local Affine-Invariant Image Descriptors and Multi-View Spatial Constraints , 2006, International Journal of Computer Vision.

[79] Wolfram Burgard,et al. Classifying dynamic objects , 2009, Auton. Robots.

[80] Andrew W. Fitzgibbon,et al. Unwrap mosaics: a new representation for video editing , 2008, ACM Trans. Graph..