Multimedia Fusion at Semantic Level in Vehicle Cooperactive Perception

The complete perception of the environment is an important prerequisite for realizing autonomous driving. However, due to the existence of obstructions and the limits of sensing range, perception ability of a single vehicle cannot meet the requirements of perception. In this paper, we attempt to expand the range of vehicle perception and eliminate blind areas by fusing multimedia based on vehicle network. Using a deep leaning method, we get a more in-depth understanding and extract critical information from vision system. We integrate multi-vehicle’s information into the ego-vehicle’s coordinate by merging unified bird-view maps, taking both graphic and semantic alignment as association. Furthermore, a transport protocol is designed to limit the information transmission amount. The experimental results on open data sets show that our approach can effectively fuse multimedia information in the multi-vehicle system, and improve the vehicle’s perception ability.

[1]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Emilio Frazzoli,et al.  Multivehicle Cooperative Driving Using Cooperative Perception: Design and Experimental Validation , 2015, IEEE Transactions on Intelligent Transportation Systems.

[3]  Fawzi Nashashibi,et al.  Cooperative Perception: Application in the Context of Outdoor Intelligent Vehicle Systems , 2012 .

[4]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  S. Ilgin Guler,et al.  Using connected vehicle technology to improve the efficiency of intersections , 2014 .

[6]  Marcus Obst,et al.  Multi-sensor data fusion for checking plausibility of V2V communications by vision-based multiple-object tracking , 2014, 2014 IEEE Vehicular Networking Conference (VNC).

[7]  Achim Kampker,et al.  Improving vehicle localization using semantic and pole-like landmarks , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[8]  Massimo Bertozzi,et al.  GOLD: a parallel real-time stereo vision system for generic obstacle and lane detection , 1998, IEEE Trans. Image Process..

[9]  Matthias Kranz,et al.  V2V Communications in Automotive Multi-Sensor Multi-Target Tracking , 2008, 2008 IEEE 68th Vehicular Technology Conference.