A Reinforcement Learning Approach of Data Forwarding in Vehicular Networks

As the basis of vehicle ad hoc networks, the method of forwarding data is one of the most important parts which ensures the stability and efficiency of network communication. However, the high-speed mobile vehicle nodes cause frequent changes of network topology and disconnections of network links, casting a big challenge to the performance of network data delivery. Data forwarding methods based on the prior knowledge of vehicle’s trajectory are difficult to adapt to the changing vehicle trajectory in real world applications, while getting destination vehicles’ positions in broadcast way are extremely costly. To solve the above problems, we have proposed an association state based optimized data forwarding method (ASODF) with the assistance of low loaded road side units (RSU). The proposed method maps the urban road network into a directed graph, utilizes the carry-forward mechanism and decomposes the data transmission into decision-making data forwarding at intersections and data delivery on roads. The vehicles carried data combine the destination nodes locations obtained by low loaded road side units and their locations into association states, and the association state optimization problem is formalized as a Reinforcement Learning problem with Markov Decision Process (MDP). We utilized the value iteration scheme to figure out the delay-optimal policy, which is further used to forward data packets to obtain the best delay of data transmission. Experiments based on a real vehicle trajectory data set demonstrate the effectiveness of our model ASODF.

[1]  Mohsen Guizani,et al.  An effective key management scheme for heterogeneous sensor networks , 2007, Ad Hoc Networks.

[2]  Jaehoon Jeong,et al.  TBD: Trajectory-Based Data Forwarding for Light-Traffic Vehicular Networks , 2009, 2009 29th IEEE International Conference on Distributed Computing Systems.

[3]  Joelle Pineau,et al.  Point-based value iteration: An anytime algorithm for POMDPs , 2003, IJCAI.

[4]  Jaehoon Jeong,et al.  Utilizing shared vehicle trajectories for data forwarding in vehicular networks , 2011, 2011 Proceedings IEEE INFOCOM.

[5]  Brad Karp,et al.  GPSR: greedy perimeter stateless routing for wireless networks , 2000, MobiCom '00.

[6]  Song Chong,et al.  Delay-optimal data forwarding in Vehicular Sensor Networks , 2012, 2013 11th International Symposium and Workshops on Modeling and Optimization in Mobile, Ad Hoc and Wireless Networks (WiOpt).

[7]  David A. Wagner,et al.  Security in wireless sensor networks , 2004, SASN '04.

[8]  Cecilia Mascolo,et al.  GeOpps: Geographical Opportunistic Routing for Vehicular Networks , 2007, 2007 IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks.

[9]  Davide Frey,et al.  Towards lightweight information dissemination in inter-vehicular networks , 2006, VANET '06.

[10]  Philippe Jacquet,et al.  Optimized Link State Routing Protocol (OLSR) , 2003, RFC.

[11]  Li Chen,et al.  MGF:Mobile Gateway Based Forwarding for Infrastructure-to-Vehicle Data Delivery in Vehicular Ad Hoc Networks , 2012 .

[12]  Li Xiao,et al.  SADV: Static-Node-Assisted Adaptive Data Dissemination in Vehicular Networks , 2010, IEEE Transactions on Vehicular Technology.

[13]  Xiaojiang Du,et al.  A survey of key management schemes in wireless sensor networks , 2007, Comput. Commun..

[14]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[15]  Jaehoon Jeong,et al.  TSF: Trajectory-Based Statistical Forwarding for Infrastructure-to-Vehicle Data Delivery in Vehicular Networks , 2010, 2010 IEEE 30th International Conference on Distributed Computing Systems.

[16]  Martin Mauve,et al.  A routing strategy for vehicular ad hoc networks in city environments , 2003, IEEE IV2003 Intelligent Vehicles Symposium. Proceedings (Cat. No.03TH8683).

[17]  Chase Qishi Wu,et al.  Secure cell relay routing protocol for sensor networks , 2005, PCCC 2005. 24th IEEE International Performance, Computing, and Communications Conference, 2005..

[18]  Charles E. Perkins,et al.  Highly Dynamic Destination-Sequenced Distance-Vector Routing (DSDV) for mobile computers , 1994, SIGCOMM.

[19]  Xiaojiang Du,et al.  Security in wireless sensor networks , 2008, IEEE Wireless Communications.

[20]  Bo Li,et al.  Trajectory improves data delivery in vehicular networks , 2011, 2011 Proceedings IEEE INFOCOM.

[21]  Mohsen Guizani,et al.  Transactions papers a routing-driven Elliptic Curve Cryptography based key management scheme for Heterogeneous Sensor Networks , 2009, IEEE Transactions on Wireless Communications.

[22]  Jing Zhao,et al.  VADD: Vehicle-Assisted Data Delivery in Vehicular Ad Hoc Networks , 2008, IEEE Trans. Veh. Technol..

[23]  Hassan Artail,et al.  Performance analysis of routing in VANETs using the RSU network , 2011, 2011 IEEE 7th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob).

[24]  Mohsen Guizani,et al.  Secure and Efficient Time Synchronization in Heterogeneous Sensor Networks , 2008, IEEE Transactions on Vehicular Technology.

[25]  Secure cell relay routing protocol for sensor networks: Research Articles , 2006 .

[26]  R. Bellman A Markovian Decision Process , 1957 .

[27]  David A. Maltz,et al.  DSR: the dynamic source routing protocol for multihop wireless ad hoc networks , 2001 .

[28]  Xu Li,et al.  Performance Evaluation of SUVnet With Real-Time Traffic Data , 2007, IEEE Transactions on Vehicular Technology.