Multi-objective reinforcement learning based routing in cognitive radio networks: Walking in a random maze

The routing procedure in cognitive radio networks with dynamic spectrum activities is studied. The spectrum statistics are assumed to be unknown. Moreover, the performance is measured using multiple metrics like average delay and packet loss rate. To address the challenges of randomness, uncertainty and multiple metrics, the multi-objective reinforcement learning algorithm is applied for the routing in cognitive radio networks. The effectiveness of the learning procedure is demonstrated by numerical simulations.

[1]  Yang Yang,et al.  Reinforcement learning based spectrum-aware routing in multi-hop cognitive radio networks , 2009, 2009 4th International Conference on Cognitive Radio Oriented Wireless Networks and Communications.

[2]  Kang G. Shin,et al.  OS-MAC: An Efficient MAC Protocol for Spectrum-Agile Wireless Networks , 2008, IEEE Transactions on Mobile Computing.

[3]  Marwan Krunz,et al.  Probabilistic Path Selection in Opportunistic Cognitive Radio Networks , 2008, IEEE GLOBECOM 2008 - 2008 IEEE Global Telecommunications Conference.

[4]  Konkoly Thege Multi-criteria Reinforcement Learning , 1998 .

[5]  Joseph Mitola Cognitive Radio for Flexible Mobile Multimedia Communications , 2001, Mob. Networks Appl..

[6]  Ian F. Akyildiz,et al.  TP-CRAHN: a Transport Protocol for Cognitive Radio Ad-Hoc Networks , 2009, IEEE INFOCOM 2009.

[7]  Xiao Ma,et al.  Spectrum Aware Routing for Multi-Hop Cognitive Radio Networks with a Single Transceiver , 2008, 2008 3rd International Conference on Cognitive Radio Oriented Wireless Networks and Communications (CrownCom 2008).

[8]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[9]  Hang Su,et al.  Cross-Layer Based Opportunistic MAC Protocols for QoS Provisionings Over Cognitive Radio Wireless Networks , 2008, IEEE Journal on Selected Areas in Communications.

[10]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.