The Role of Machine Vision for Intelligent Vehicles

Humans assimilate information from the traffic environment mainly through visual perception. Obviously, the dominant information required to conduct a vehicle can be acquired with visual sensors. However, in contrast to most other sensor principles, video signals contain relevant information in a highly indirect manner and hence visual sensing requires sophisticated machine vision and image understanding techniques. This paper provides an overview on the state of research in the field of machine vision for intelligent vehicles. The functional spectrum addressed covers the range from advanced driver assistance systems to autonomous driving. The organization of the article adopts the typical order in image processing pipelines that successively condense the rich information and vast amount of data in video sequences. Data-intensive low-level “early vision” techniques first extract features that are later grouped and further processed to obtain information of direct relevance for vehicle guidance. Recognition and classification schemes allow to identify specific objects in a traffic scene. Recently, semantic labeling techniques using convolutional neural networks have achieved impressive results in this field. High-level decisions of intelligent vehicles are often influenced by map data. The emerging role of machine vision in the mapping and localization process is illustrated at the example of autonomous driving. Scene representation methods are discussed that organize the information from all sensors and data sources and thus build the interface between perception and planning. Recently, vision benchmarks have been tailored to various tasks in traffic scene perception that provide a metric for the rich diversity of machine vision methods. Finally, the paper addresses computing architectures suited to real-time implementation. Throughout the paper, numerous specific examples and real world experiments with prototype vehicles are presented.

[1]  Hans-Joachim Wünsche,et al.  Odometry-based online extrinsic sensor calibration , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Tobias Gindele,et al.  Bayesian Occupancy grid Filter for dynamic environments using prior map knowledge , 2009, 2009 IEEE Intelligent Vehicles Symposium.

[3]  Paul Newman,et al.  From dusk till dawn: Localisation at night using artificial light sources , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[4]  Julius Ziegler,et al.  Sparse scene flow segmentation for moving object detection in urban environments , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[5]  Andreas Geiger,et al.  Object scene flow for autonomous vehicles , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Mohan M. Trivedi,et al.  Vision for Looking at Traffic Lights: Issues, Survey, and Perspectives , 2016, IEEE Transactions on Intelligent Transportation Systems.

[7]  Gerd Wanielik,et al.  Vehicle localization in urban environments using feature maps and aerial images , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[8]  Oliver Denninger,et al.  A Stream Processing Framework for On-Line Optimization of Performance and Energy Efficiency on Heterogeneous Systems , 2014, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops.

[9]  Amnon Shashua,et al.  Vision-based ACC with a single camera: bounds on range and range rate accuracy , 2003, IEEE IV2003 Intelligent Vehicles Symposium. Proceedings (Cat. No.03TH8683).

[10]  Massimo Bertozzi,et al.  A real-time oriented system for vehicle detection , 1997, J. Syst. Archit..

[11]  F. Xavier Roca,et al.  Toward Real-Time Pedestrian Detection Based on a Deformable Template Model , 2014, IEEE Transactions on Intelligent Transportation Systems.

[12]  Michael J. Black,et al.  A Quantitative Analysis of Current Practices in Optical Flow Estimation and the Principles Behind Them , 2013, International Journal of Computer Vision.

[13]  Bill Triggs,et al.  Scene Segmentation with CRFs Learned from Partially Labeled Images , 2007, NIPS.

[14]  Stefan K. Gehrig,et al.  Exploiting the Power of Stereo Confidences , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Keiichi Uchimura,et al.  Driver Inattention Monitoring System for Intelligent Vehicles: A Review , 2009, IEEE Transactions on Intelligent Transportation Systems.

[16]  Markus Maurer,et al.  Forward Collision Warning and Avoidance , 2012 .

[17]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[18]  Bernt Schiele,et al.  Ten Years of Pedestrian Detection, What Have We Learned? , 2014, ECCV Workshops.

[19]  Hermann Winner,et al.  Three Decades of Driver Assistance Systems: Review and Future Perspectives , 2014, IEEE Intelligent Transportation Systems Magazine.

[20]  Daniel P. Huttenlocher,et al.  Efficient Belief Propagation for Early Vision , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[21]  Markus Maurer,et al.  A compact vision system for road vehicle guidance , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[22]  Ernst D. Dickmanns,et al.  Recursive 3-D Road and Relative Ego-State Recognition , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Hans-Hellmut Nagel,et al.  FhG-Co-driver: From map-guided automatic driving by machine vision to a cooperative driver support , 1995 .

[24]  Zehang Sun,et al.  On-road vehicle detection using Gabor filters and support vector machines , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[25]  Julius Ziegler,et al.  Making Bertha Drive—An Autonomous Journey on a Historic Route , 2014, IEEE Intelligent Transportation Systems Magazine.

[26]  Julian Eggert,et al.  A multi-block-matching approach for stereo , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).

[27]  Dizan Vasquez,et al.  A survey on motion prediction and risk assessment for intelligent vehicles , 2014, ROBOMECH Journal.

[28]  Ralf G. Herrtwich,et al.  Making Bertha See , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[29]  William Whittaker,et al.  Autonomous driving in urban environments: Boss and the Urban Challenge , 2008, J. Field Robotics.

[30]  Francisco Angel Moreno,et al.  The Málaga urban dataset: High-rate stereo and LiDAR in a realistic urban scenario , 2014, Int. J. Robotics Res..

[31]  Dariu Gavrila,et al.  UvA-DARE ( Digital Academic Repository ) Pedestrian Path Prediction with Recursive Bayesian Filters : A Comparative Study , 2013 .

[32]  Xi Wang,et al.  High-Resolution Stereo Datasets with Subpixel-Accurate Ground Truth , 2014, GCPR.

[33]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[34]  Stefan Roth,et al.  Stixmantics: A Medium-Level Model for Real-Time Semantic Scene Understanding , 2014, ECCV.

[35]  Luc Van Gool,et al.  Sparse Flow: Sparse Matching for Small to Large Displacement Optical Flow , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[36]  Stefan Roth,et al.  Efficient Multi-cue Scene Segmentation , 2013, GCPR.

[37]  C. Stiller,et al.  Vehicle detection fusing 2D visual features , 2004, IEEE Intelligent Vehicles Symposium, 2004.

[38]  Ronen Lerner,et al.  Recent progress in road and lane detection: a survey , 2012, Machine Vision and Applications.

[39]  Jiri Matas,et al.  A system for real-time detection and tracking of vehicles from a single car-mounted camera , 2012, 2012 15th International IEEE Conference on Intelligent Transportation Systems.

[40]  Christian Hoffmann,et al.  Continuous Stereo Self-Calibration by Camera Parameter Tracking , 2009, IEEE Transactions on Image Processing.

[41]  Tong Boon Tang,et al.  Vehicle Detection Techniques for Collision Avoidance Systems: A Review , 2015, IEEE Transactions on Intelligent Transportation Systems.

[42]  Tomaso A. Poggio,et al.  Example-Based Object Detection in Images by Components , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  Massimo Bertozzi,et al.  GOLD: a parallel real-time stereo vision system for generic obstacle and lane detection , 1998, IEEE Trans. Image Process..

[44]  Uwe Franke,et al.  Efficient representation of traffic scenes by means of dynamic stixels , 2010, 2010 IEEE Intelligent Vehicles Symposium.

[45]  Stefan K. Gehrig,et al.  A Real-Time Low-Power Stereo Vision Engine Using Semi-Global Matching , 2009, ICVS.

[46]  H. Hirschmüller Accurate and Efficient Stereo Processing by Semi-Global Matching and Mutual Information , 2005, CVPR.

[47]  Todd Jochem,et al.  Rapidly Adapting Machine Vision for Automated Vehicle Steering , 1996, IEEE Expert.

[48]  Sebastian Ramos,et al.  The Cityscapes Dataset , 2015, CVPR 2015.

[49]  Sebastian Thrun,et al.  Robust vehicle localization in urban environments using probabilistic maps , 2010, 2010 IEEE International Conference on Robotics and Automation.

[50]  Mohan M. Trivedi,et al.  Looking at Vehicles on the Road: A Survey of Vision-Based Vehicle Detection, Tracking, and Behavior Analysis , 2013, IEEE Transactions on Intelligent Transportation Systems.

[51]  Benjamin Ranft,et al.  Modeling arbitrarily oriented slanted planes for efficient stereo vision based on block matching , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[52]  Huijing Zhao,et al.  Information Fusion on Oversegmented Images: An Application for Urban Scene Understanding , 2013, MVA.

[53]  Thomas Kalinke,et al.  A TEXTURE-BASED OBJECT DETECTION DETECTION AND AN ADAPTIVE MODEL-BASED CLASSIFICATION , 1998 .

[54]  Julius Ziegler,et al.  StereoScan: Dense 3d reconstruction in real-time , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[55]  Julius Ziegler,et al.  Calibrating multiple cameras with non-overlapping views using coded checkerboard targets , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[56]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[57]  Henning Lategahn,et al.  Trinocular optical flow estimation for intelligent vehicle applications , 2012, 2012 15th International IEEE Conference on Intelligent Transportation Systems.

[58]  Martin Lauer,et al.  Minimizing Odometry Drift by Vanishing Direction References , 2015 .

[59]  Philippe Bonnifait,et al.  iGPS: Global Positioning in Urban Canyons with Road Surface Maps , 2012, IEEE Intelligent Transportation Systems Magazine.

[60]  Sebastian Thrun,et al.  Stanley: The robot that won the DARPA Grand Challenge , 2006, J. Field Robotics.

[61]  Marc Pollefeys,et al.  Combining Monocular and Stereo Cues for Mobile Robot Localization Using Visual Words , 2010, 2010 20th International Conference on Pattern Recognition.

[62]  Daniel Cremers,et al.  B-Spline Modeling of Road Surfaces With an Application to Free-Space Estimation , 2009, IEEE Transactions on Intelligent Transportation Systems.

[63]  Cordelia Schmid,et al.  EpicFlow: Edge-preserving interpolation of correspondences for optical flow , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[64]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[65]  Lawrence Ulrich Top ten tech cars , 2014, IEEE Spectrum.

[66]  Reinhold Behringer,et al.  The seeing passenger car 'VaMoRs-P' , 1994, Proceedings of the Intelligent Vehicles '94 Symposium.

[67]  Andrew Zisserman,et al.  Multiple View Geometry in Computer Vision (2nd ed) , 2003 .

[68]  Scott Krig,et al.  Computer Vision Metrics: Survey, Taxonomy, and Analysis , 2014 .

[69]  Mohan M. Trivedi,et al.  Looking-in and looking-out vision for Urban Intelligent Assistance: Estimation of driver attentive state and dynamic surround for safe merging and braking , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[70]  Dariu Gavrila,et al.  PedCut: an iterative framework for pedestrian segmentation combining shape models and multiple data cues , 2013, BMVC.

[71]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[72]  Henning Lategahn,et al.  DIRD is an illumination robust descriptor , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[73]  Zsombor Paroczi,et al.  Real-time traffic sign recognition system , 2011, 2011 2nd International Conference on Cognitive Infocommunications (CogInfoCom).

[74]  Peter Norvig,et al.  Artificial intelligence - a modern approach, 2nd Edition , 2003, Prentice Hall series in artificial intelligence.

[75]  Véronique Berge-Cherfaoui,et al.  Credibilist occupancy grids for vehicle perception in dynamic environments , 2011, 2011 IEEE International Conference on Robotics and Automation.

[76]  Marina L. Gavrilova,et al.  Combining dense features with interest regions for efficient part-based image matching , 2014, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[77]  Jay A. Farrell,et al.  High-precision lane-level road map building for vehicle navigation , 2010, IEEE/ION Position, Location and Navigation Symposium.

[78]  Raquel Urtasun,et al.  Robust Monocular Epipolar Flow Estimation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[79]  Kurt Konolige,et al.  Towards lifelong visual maps , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[80]  Uwe Franke,et al.  B-spline-based road model for 3d lane recognition , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[81]  Pierre Vandergheynst,et al.  FREAK: Fast Retina Keypoint , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[82]  Klaus C. J. Dietmayer,et al.  Fusion of laser and radar sensor data with a sequential Monte Carlo Bayesian occupancy filter , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).

[83]  C. Stiller,et al.  Situation Assessment and Trajectory Planning for AnnieWAY , 2011 .

[84]  Richard O. Duda,et al.  Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.

[85]  Reinhard Klette,et al.  Hierarchical Scan-Line Dynamic Programming for Optical Flow Using Semi-Global Matching , 2012, ACCV Workshops.

[86]  David Gerónimo Gómez,et al.  Survey of Pedestrian Detection for Advanced Driver Assistance Systems , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[87]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[88]  Julie M. Harris Early vision and beyond , 1996, Trends in Neurosciences.

[89]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[90]  Julius Ziegler,et al.  Video based localization for Bertha , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[91]  Constantine Papageorgiou,et al.  A trainable system for object detection in images and video sequences , 2000 .

[92]  Andreas Geiger,et al.  Displets: Resolving stereo ambiguities using object knowledge , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[93]  Sergiu Nedevschi,et al.  Direct formulas for stereo-based visual odometry error modeling , 2015, 2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP).

[94]  Andreas Geiger,et al.  Efficient Large-Scale Stereo Matching , 2010, ACCV.

[95]  Wolfram Burgard,et al.  A Tutorial on Graph-Based SLAM , 2010, IEEE Intelligent Transportation Systems Magazine.

[96]  Reinhard Klette,et al.  Ground Truth Evaluation of Stereo Algorithms for Real World Applications , 2010, ACCV Workshops.

[97]  Jan-Michael Frahm,et al.  Real-Time Plane-Sweeping Stereo with Multiple Sweeping Directions , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[98]  Ryan M. Eustice,et al.  Ford Campus vision and lidar data set , 2011, Int. J. Robotics Res..

[99]  Avshalom Suissa,et al.  The Daimler-Benz steering assistant: a spin-off from autonomous driving , 1994, Proceedings of the Intelligent Vehicles '94 Symposium.

[100]  Tomaso A. Poggio,et al.  A Trainable System for Object Detection , 2000, International Journal of Computer Vision.

[101]  Raquel Urtasun,et al.  Efficient Joint Segmentation, Occlusion Labeling, Stereo and Flow Estimation , 2014, ECCV.

[102]  Thomas B. Moeslund,et al.  Vision-Based Traffic Sign Detection and Analysis for Intelligent Driver Assistance Systems: Perspectives and Survey , 2012, IEEE Transactions on Intelligent Transportation Systems.

[103]  Horst Bischof,et al.  A Duality Based Approach for Realtime TV-L1 Optical Flow , 2007, DAGM-Symposium.

[104]  J.E. Gayko,et al.  Development, evaluation and introduction of a lane keeping assistance system , 2004, IEEE Intelligent Vehicles Symposium, 2004.

[105]  M. Amaç Güvensan,et al.  Driver Behavior Analysis for Safe Driving: A Survey , 2015, IEEE Transactions on Intelligent Transportation Systems.

[106]  Eike Rehder,et al.  Head detection and orientation estimation for pedestrian safety , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[107]  Richard Szeliski,et al.  High-accuracy stereo depth maps using structured light , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[108]  Ralf Kohlhaas,et al.  Towards a unified traffic situation estimation model — Street-dependent behaviour and motion models , 2015, 2015 18th International Conference on Information Fusion (Fusion).

[109]  Romain Dupont,et al.  A General Dense Image Matching Framework Combining Direct and Feature-Based Costs , 2013, 2013 IEEE International Conference on Computer Vision.

[110]  Miguel Ángel Sotelo,et al.  Fast Road Sign Detection Using Hough Transform for Assisted Driving of Road Vehicles , 2005, EUROCAST.

[111]  Konrad Schindler,et al.  Piecewise Rigid Scene Flow , 2013, 2013 IEEE International Conference on Computer Vision.

[112]  Timo Schönwald,et al.  Parallel matching-based estimation - a case study on three different hardware architectures , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[113]  Dariu M. Gavrila,et al.  Steering and evasion assist , 2012 .

[114]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[115]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[116]  Martin Lauer,et al.  Robust scale estimation for monocular visual odometry using structure from motion and vanishing points , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).

[117]  Pierre Charbonnier,et al.  Evaluation of Road Marking Feature Extraction , 2008, 2008 11th International IEEE Conference on Intelligent Transportation Systems.

[118]  Olivier Aycard,et al.  Detection, classification and tracking of moving objects in a 3D environment , 2012, 2012 IEEE Intelligent Vehicles Symposium.

[119]  Julian Eggert,et al.  Block-matching stereo with relaxed fronto-parallel assumption , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[120]  Noah Snavely,et al.  NYC3DCars: A Dataset of 3D Vehicles in Geographic Context , 2013, 2013 IEEE International Conference on Computer Vision.

[121]  Andrew W. Fitzgibbon,et al.  SphereFlow: 6 DoF Scene Flow from RGB-D Pairs , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[122]  Daniel Cremers,et al.  Efficient Dense Scene Flow from Sparse or Dense Stereo Data , 2008, ECCV.

[123]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[124]  Christoph Stiller,et al.  Automated map generation from aerial images for precise vehicle localization , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[125]  Julius Ziegler,et al.  Lanelets: Efficient map representation for autonomous driving , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[126]  Wolfgang Rosenstiel,et al.  STELLaR - A case-study on SysTEmaticaLLy embedding a Traffic Light Recognition , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[127]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[128]  Yann LeCun,et al.  Road Scene Segmentation from a Single Image , 2012, ECCV.

[129]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[130]  Samuel Williams,et al.  The Landscape of Parallel Computing Research: A View from Berkeley , 2006 .

[131]  Henning Lategahn,et al.  Vision-Only Localization , 2014, IEEE Transactions on Intelligent Transportation Systems.

[132]  Tarak Gandhi,et al.  Pedestrian Protection Systems: Issues, Survey, and Challenges , 2007, IEEE Transactions on Intelligent Transportation Systems.

[133]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[134]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.