Efficient Road Scene Understanding for Intelligent Vehicles Using Compositional Hierarchical Models

In this paper, we present a novel compositional hierarchical framework for road scene understanding that allows for reliable estimation of scene topologies, such as the number, location, and width of lanes and the lane topology, i.e., parallel, splitting, or merging. In our approach, lanes and roads are represented in a hierarchical compositional model in which nodes represent parts of roads and edges represent probabilistic constraints between pairs of parts. A key benefit of our approach is the representation of lanes and roads as a set of common parts. This makes our approach applicable to scenes with rich topological diversity, while bringing along the much desired computational efficiency. To cope with the high-dimensional and continuous parameter space of our model and the non-Gaussian image evidence, we perform inference using nonparametric belief propagation. Based on this approximate inference algorithm, we introduce depth-first message passing for lane detection, which performs inference in several sweeps. Empirical results show that depth-first message passing requires significantly lower computation for performance comparable with classical belief propagation.

[1]  Inderjit S. Dhillon,et al.  Clustering on the Unit Hypersphere using von Mises-Fisher Distributions , 2005, J. Mach. Learn. Res..

[2]  Camillo J. Taylor,et al.  Stochastic Road Shape Estimation , 2001, ICCV.

[3]  W. Eric L. Grimson,et al.  Unsupervised Activity Perception by Hierarchical Bayesian Models , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Ernst D. Dickmanns,et al.  Recursive 3-D Road and Relative Ego-State Recognition , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Philip H. S. Torr,et al.  Combining Appearance and Structure from Motion Features for Road Scene Understanding , 2009, BMVC.

[6]  Jens Spehr,et al.  On hierarchical models for visual recognition and learning of objects, scenes, and activities , 2014 .

[7]  Sergiu Nedevschi,et al.  Probabilistic Lane Tracking in Difficult Road Scenarios Using Stereovision , 2009, IEEE Transactions on Intelligent Transportation Systems.

[8]  Antonio Torralba,et al.  Part and appearance sharing: Recursive Compositional Models for multi-view , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Pierre Charbonnier,et al.  Evaluation of Road Marking Feature Extraction , 2008, 2008 11th International IEEE Conference on Intelligent Transportation Systems.

[10]  Antonio Torralba,et al.  Using the forest to see the trees: exploiting context for visual object detection and localization , 2010, CACM.

[11]  Erik B. Sudderth Graphical models for visual object recognition and tracking , 2006 .

[12]  William T. Freeman,et al.  Nonparametric belief propagation , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[13]  Alexander Zelinsky,et al.  Robust vision based lane tracking using multiple cues and particle filtering , 2003, IEEE IV2003 Intelligent Vehicles Symposium. Proceedings (Cat. No.03TH8683).

[14]  Luc Van Gool,et al.  What's going on? Discovering spatio-temporal dependencies in dynamic scenes , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Mohan M. Trivedi,et al.  Video-based lane estimation and tracking for driver assistance: survey, system, and evaluation , 2006, IEEE Transactions on Intelligent Transportation Systems.

[16]  Bernt Schiele,et al.  Monocular 3D Scene Modeling and Inference: Understanding Multi-Object Traffic Scenes , 2010, ECCV.

[17]  Michael Isard,et al.  PAMPAS: real-valued graphical models for computer vision , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[18]  William T. Freeman,et al.  Correctness of Belief Propagation in Gaussian Graphical Models of Arbitrary Topology , 1999, Neural Computation.

[19]  Antonio Torralba,et al.  Using the Forest to See the Trees: A Graphical Model Relating Features, Objects, and Scenes , 2003, NIPS.

[20]  Michael I. Mandel,et al.  Distributed Occlusion Reasoning for Tracking with Nonparametric Belief Propagation , 2004, NIPS.

[21]  C. V. Jawahar,et al.  Scene Text Recognition using Higher Order Language Priors , 2009, BMVC.

[22]  Mohan M. Trivedi,et al.  This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS 1 Integrated Lane and Vehicle Detection, Localization, , 2022 .

[23]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[24]  Christoph Stiller,et al.  Kalman Particle Filter for lane recognition on rural roads , 2009, 2009 IEEE Intelligent Vehicles Symposium.

[25]  Friedrich M. Wahl,et al.  Hierarchical scene understanding for intelligent vehicles , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[26]  Lars C. Wolf,et al.  RoadGraph: High level sensor data fusion between objects and street network , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[27]  Bernt Schiele,et al.  A Dynamic Conditional Random Field Model for Joint Labeling of Object and Scene Classes , 2008, ECCV.

[28]  Martin Lauer,et al.  3D Traffic Scene Understanding From Movable Platforms , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Ronen Lerner,et al.  Recent progress in road and lane detection: a survey , 2012, Machine Vision and Applications.

[30]  W. Eric L. Grimson,et al.  Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Christoph Stiller,et al.  Efficient scene understanding for intelligent vehicles using a part-based road representation , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[32]  Bernard W. Silverman,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[33]  Sidharth Bhatia,et al.  Tracking loose-limbed people , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[34]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[35]  U. Franks,et al.  Lane Recognition on Country Roads , 2007, 2007 IEEE Intelligent Vehicles Symposium.