论文信息 - Distributed Reinforcement Learning Approach for Vehicular Ad Hoc Networks

Distributed Reinforcement Learning Approach for Vehicular Ad Hoc Networks

In Vehicular Ad hoc Networks (VANETs), general purpose ad hoc routing protocols such as AODV cannot work efficiently due to the frequent changes in network topology caused by vehicle movement. This paper proposes a VANET routing protocol QLAODV (Q-Learning AODV) which suits unicast applications in high mobility scenarios. QLAODV is a distributed reinforcement learning routing protocol, which uses a Q-Learning algorithm to infer network state information and uses unicast control packets to check the path availability in a real time manner in order to allow Q-Learning to work efficiently in a highly dynamic network environment. QLAODV is favored by its dynamic route change mechanism, which makes it capable of reacting quickly to network topology changes. We present an analysis of the performance of QLAODV by simulation using different mobility models. The simulation results show that QLAODV can efficiently handle unicast applications in VANETs.

[1] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[2] Leslie Pack Kaelbling,et al. Mobilized ad-hoc networks: a reinforcement learning approach , 2004 .

[3] Mahmood Fathy,et al. Enhancing AODV routing protocol using mobility parameters in VANET , 2008, 2008 IEEE/ACS International Conference on Computer Systems and Applications.

[4] Lixin Gao,et al. Prediction-Based Routing for Vehicular Ad Hoc Networks , 2007, IEEE Transactions on Vehicular Technology.

[5] Chiu-Kuo Liang,et al. An ad hoc on-demand routing protocol with high packet delivery fraction , 2004, 2004 IEEE International Conference on Mobile Ad-hoc and Sensor Systems (IEEE Cat. No.04EX975).

[6] Rahul Jain,et al. Geographical routing using partial information for wireless ad hoc networks , 2001, IEEE Wirel. Commun..

[7] Charles E. Perkins,et al. Ad hoc On-Demand Distance Vector (AODV) Routing , 2001, RFC.

[8] Brad Karp,et al. GPSR: greedy perimeter stateless routing for wireless networks , 2000, MobiCom '00.

[9] Fei Xie,et al. TOPO: Routing in Large Scale Vehicular Networks , 2007, 2007 IEEE 66th Vehicular Technology Conference.

[10] Martin Mauve,et al. A survey on position-based routing in mobile ad hoc networks , 2001, IEEE Netw..

[11] Amit Kumar Saha,et al. Modeling mobility for vehicular ad-hoc networks , 2004, VANET '04.

[12] Jim Dowling,et al. Using feedback in collaborative reinforcement learning to adaptively optimize MANET routing , 2005, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[13] Hao Zhu,et al. MURU: A Multi-Hop Routing Protocol for Urban Vehicular Ad Hoc Networks , 2006, 2006 Third Annual International Conference on Mobile and Ubiquitous Systems: Networking & Services.

[14] Gang Lu,et al. Enhancing routing performance for inter-vehicle communication in city environment , 2006, PM2HW2N '06.

[15] Andrew G. Barto,et al. Reinforcement learning , 1998 .

[16] Hamid Menouar. An intelligent movement-based routing for VANETs , 2006 .

[17] M. Meincke,et al. Traffic Models for Inter-Vehicle Communications , 2006, 2006 2nd International Conference on Information & Communication Technologies.

[18] Ehssan Sakhaee,et al. A Stable Routing Protocol to Support ITS Services in VANET Networks , 2007, IEEE Transactions on Vehicular Technology.

[19] Onur Altintas,et al. Survey of Routing Protocols for Inter-Vehicle Communications , 2006, 2006 3rd Annual International Conference on Mobile and Ubiquitous Systems - Workshops.

[20] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[21] Alvin S. Lim,et al. Connectivity Aware Routing in Vehicular Networks , 2008, 2008 IEEE Wireless Communications and Networking Conference.

[22] Yacine Khaled,et al. Conditional Transmissions: Performance Study of a New Communication Strategy in VANET , 2007, IEEE Transactions on Vehicular Technology.

[23] Chris Watkins,et al. Learning from delayed rewards , 1989 .

[24] Ahmed Helmy,et al. IMPORTANT: a framework to systematically analyze the Impact of Mobility on Performance of Routing Protocols for Adhoc Networks , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[25] Subir Biswas,et al. Neighborhood Route Diffusion for Packet Salvaging in Networks with High Mobility , 2008, 2008 IEEE International Performance, Computing and Communications Conference.

[26] Michael L. Littman,et al. Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach , 1993, NIPS.

[27] Yu Wang,et al. Routing in vehicular ad hoc networks: A survey , 2007, IEEE Vehicular Technology Magazine.