Pixels, Stixels, and Objects

Dense stereo vision has evolved into a powerful foundation for the next generation of intelligent vehicles. The high spatial and temporal resolution allows for robust obstacle detection in complex inner city scenarios, including pedestrian recognition and detection of partially hidden moving objects. Aiming at a vision architecture for efficiently solving an increasing number of vision tasks, the medium-level representation named Stixel World has been developed. This paper shows how this representation forms the foundation for a very efficient, robust and comprehensive understanding of traffic scenes. A recently proposed Stixel computation scheme allows the extraction of multiple objects per image column and generates a segmentation of the input data. The motion of the Stixels is obtained by applying the 6D-Vision principle to track Stixels over time. Subsequently, this allows for an optimal Stixel grouping such that all dynamic objects can be detected easily. Pose and motion of moving Stixel groups are used to initialize more specific object trackers. Moreover, appearance-based object recognition highly benefits from the attention control offered by the Stixel World, both in performance and efficiency.

[1]  Uwe Franke,et al.  Stixmentation - Probabilistic Stixel based Traffic Scene Labeling , 2012, BMVC.

[2]  Uwe Franke,et al.  The Stixel World - A Compact Medium Level Representation of the 3D-World , 2009, DAGM-Symposium.

[3]  Viii Supervisor Sonar-Based Real-World Mapping and Navigation , 2001 .

[4]  Uwe Franke,et al.  Efficient representation of traffic scenes by means of dynamic stixels , 2010, 2010 IEEE Intelligent Vehicles Symposium.

[5]  Simon Lacroix,et al.  Digital elevation map building from low altitude stereo imagery , 2002, Robotics Auton. Syst..

[6]  Jan-Michael Frahm,et al.  3D Reconstruction Using an n-Layer Heightmap , 2010, DAGM-Symposium.

[7]  Timo Milbich,et al.  May I enter the roundabout? A time-to-contact computation based on stereo-vision , 2012, 2012 IEEE Intelligent Vehicles Symposium.

[8]  Uwe Franke,et al.  Towards a Global Optimal Multi-Layer Stixel Representation of Dense 3D Data , 2011, BMVC.

[9]  Alexei A. Efros,et al.  Geometric context from a single image , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[10]  Markus Enzweiler,et al.  Efficient Stixel-based object recognition , 2012, 2012 IEEE Intelligent Vehicles Symposium.

[11]  Vladimir Kolmogorov,et al.  Minimizing Nonsubmodular Functions with Graph Cuts-A Review , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[13]  Takeo Kanade,et al.  High-Resolution Terrain Map from Multiple Sensor Data , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Allen R. Hanson,et al.  Computer Vision Systems , 1978 .

[15]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[16]  Uwe Franke,et al.  6D-Vision: Fusion of Stereo and Motion for Robust Environment Perception , 2005, DAGM-Symposium.

[17]  Uwe Franke,et al.  Vehicle Tracking at Urban Intersections Using Dense Stereo , 2009 .

[18]  Olga Veksler,et al.  Order-Preserving Moves for Graph-Cut-Based Optimization , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Stefan K. Gehrig,et al.  A Real-Time Low-Power Stereo Vision Engine Using Semi-Global Matching , 2009, ICVS.

[20]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[21]  Rudolf Mester,et al.  Free Space Computation Using Stochastic Occupancy Grids and Dynamic Programming , 2008 .

[22]  Sergiu Nedevschi,et al.  Road Surface and Obstacle Detection Based on Elevation Maps from Dense Stereo , 2007, 2007 IEEE Intelligent Transportation Systems Conference.

[23]  Wolfgang Förstner,et al.  Curb reconstruction using Conditional Random Fields , 2010, 2010 IEEE Intelligent Vehicles Symposium.

[24]  Sergiu Nedevschi,et al.  Curb detection for driving assistance systems: A cubic spline-based approach , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[25]  Olga Veksler,et al.  Tiered scene labeling with dynamic programming , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[26]  Luc Van Gool,et al.  Stixels estimation without depth map computation , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[27]  Hans P. Moravec Robot spatial perception by stereoscopic vision and 3D evidence grids , 1996 .